[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y ] [Search | Free Show | Home]

AMD reveals a Exascale MEGA APU in a new academic paper

This is a blue board which means that it's for everybody (Safe For Work content only). If you see any adult content, please report it.

Thread replies: 98
Thread images: 14

File: 21065703899l.png (285KB, 787x414px) Image search: [Google]
21065703899l.png
285KB, 787x414px
>According to this paper, AMD wants to get around this "large die issue" by making their Exascale APUs using a large number of smaller dies, which are connected via a silicon interposer. This is similar to how AMD GPUs connect to HBM memory and can, in theory, be used to connect two or more GPU, or in this case CPU and GPU dies, to create what is effectively a larger final chip using several smaller parts.

>In the image below you can see that this APU uses eight different CPU dies/chiplets and eight different GPU dies/chiplets to create an exascale APU that can effectively act like a single unit. If these CPU chiplets use AMD's Ryzen CPU architecture they will have a minimum of 4 CPU cores, giving this hypothetical APU a total of 32 CPU cores and 64 threads.

>This new APU type will also use onboard memory, using a next-generation memory type that can be stacked directly onto a GPU die, rather than be stacked beside a GPU like HBM. Combine this with an external bank of memory (perhaps DDR4) and AMD's new GPU memory architecture and you will have a single APU that can work with a seemingly endless amount of memory and easily compute using both CPU and GPU resources using HSA (Heterogeneous System Architecture).

>In this chip both the CPU and GPU portions can use the packages onboard memory as well as an external memory, opening up a lot of interesting possibilities for the HPC market, possibilities that neither Intel or Nvidia can provide themselves.

https://www.overclock3d.net/news/cpu_mainboard/amd_reveals_a_exascale_mega_apu_in_a_new_academic_paper/1

OMG, OMG, OMG! Intel is fucked again!
>>
File: maxresdefault.jpg (135KB, 1920x1080px) Image search: [Google]
maxresdefault.jpg
135KB, 1920x1080px
>>59058084
>tfw dreams come true

Finally, I was wondering why GPUs were first to receive on-chip memory, considering there are far fewer products which use them.

My only question is, what will board partners do with all the space created by removing RAM?
>>
That thing is gonna be fuckhuge. I wanna see one fully assembled.
>>
>>59058084
Wouldn't clustering all those components together increase the thermal load by a fuckton, though?

Interesting concept, nonetheless.
>>
>>59058175
>Combine this with an external bank of memory (perhaps DDR4)

HPC need a lot RAM
>>
File: Noctua fan on a PS4.jpg (258KB, 882x1052px) Image search: [Google]
Noctua fan on a PS4.jpg
258KB, 882x1052px
>>59058233
just slap a big ass heatsink on it
>>
>>59058240

>Right now this new "Mega APU" is currently in early design stages, with no planned release date. It is clear that this design uses a new GPU design that is beyond Vega, using a next-generation memory standard which offers advantages over both GDDR and HBM.
>>
>>59058175
Put 3 - RX480 gpus together via interposer
system uses it as one gpu with 100 scaling

wew.
>>
>>59058342
>no grid
>horsefly gets in the fan
>>
>>59058084
>chiplets
When will they learn?
>>
Nvidiots on suicide watch
>>
>>59058084
>MEGA APU

I want one now. No, yesterday. GIVE ME THE FUCKING APU.
>>
>>59058437
not everyone lives in a stable.
>>
>>59058240
>>Combine this with an external bank of memory (perhaps DDR4)
Yeah, NOW. There's no reason it can't simply be migrated onto the chip.

This is a stop-gap to something which could be amazing.
>>
>>59058175
>what will board partners do with all the space created by removing RAM?
batteries
no more laptops
>>
>>59058437
>horsefly
>not housecat
it's like you live in a barn
sonypony detected
>>
File: 1438897525061.jpg (53KB, 228x392px) Image search: [Google]
1438897525061.jpg
53KB, 228x392px
>80 chiplets combined into one mega chip
gattai!
>>
File: 1487723437839.jpg (69KB, 775x837px) Image search: [Google]
1487723437839.jpg
69KB, 775x837px
>>59058175
>My only question is, what will board partners do with all the space created by removing RAM?

More space for on-bus flash memory.
>>
File: 1400191020356.jpg (23KB, 175x175px) Image search: [Google]
1400191020356.jpg
23KB, 175x175px
>>59058473
>>59058623
don't forget the six million barns
>>
File: 1486938366648.png (279KB, 898x790px) Image search: [Google]
1486938366648.png
279KB, 898x790px
>>59058452
>tfw too intelligent too kek
>>
>>59058175

> what is a cache
>>
>>59058084
>This new APU type will also use onboard memory, using a next-generation memory type that can be stacked directly onto a GPU die

Oh, so that's that "next gen memory" that they mentioned to use for Navi, their successor for Vega. In 2019.
>>
File: 7drHiqrh.jpg (19KB, 312x213px) Image search: [Google]
7drHiqrh.jpg
19KB, 312x213px
>>59058084
INTEL IS FINISHED
>>
>tfw I was right about AMD moving towards an APU that acts like a desktop grade SoC
>tfw there will come a time when all you need is a very barebones motherboard -acting as a glorified mounting place for VRM, I/O, audio, and optionally additionsl PCIe lanes and memory dimms- and a single chip.
>tfw by nature OEMs will love this for laptops and AiOs and buy them in droves
>tfw we'll eventually be able to pack console level performance into a mac mini sized enclosure

This shit is so fucking cool. I know it'll be quite a while before consumers will be able to get their hands on these, but at least it's something genuinely new.
I also have to wonder how this will affect CPU design and production in general. Using this strategy on a bigger scale could lead to massive yields per wafer, being able to salvage even dies with only single function cores and pairing them together to make up a super budget offering. Depending on how streamlined they can make the interposer mounting process, this could very well revolutionize the way CPUs are designed and produced.

To me, this is even more exciting than Zen and Vega combined, even if I'll likely continue using high end separates for the foreseeable future.
>>
>>59058370
ADORED TV WAS RIGHT NAVI CONFIRMED TO BTFO NVIDIA
>>
File: IMG_0299.jpg (76KB, 840x473px) Image search: [Google]
IMG_0299.jpg
76KB, 840x473px
>>59058240
There is a very good reason AMD chose to refer to the HBM on Vega as a "high bandwidth cache" and why they've invested in designing a memory controller that can access many parts of the system independent of the CPU's memory controller. Vega, like Fiji, is a test bed, proof of concept, and flashy advertisement for new, forward-thinking technology they plan to use in future designs.

This is what I like about AMD. With each new product, they introduce new stuff to rethink the way computing and architecture design and production is currently done. Tessellation, async, HBM, GCN, mantel, freesync, etc. They're working to alter the entire industry in their favor one feature and one architecture at a time. They're the literal antithesis of Intel.
>>
File: 7drHiqr.gif (973KB, 312x213px) Image search: [Google]
7drHiqr.gif
973KB, 312x213px
>>59058836
More battry space my man.

Ronnie will go for days
>>
>what if we put the chips.... ON ANOTHER CHIP

fucking AMD
>>
This not goin to happen anytime soon, prob 2030

Amd should focus on how to crossfire apu and gpu . this could happen in few year if they try hard
>>
Should I buy AMD when the markets open tomorrow?
>>
>>59059302
I recommend not fucking with the stock market at all.
>>
>>59058084
holy fuc
>>
>>59059302
why not shortsell intel?
>>
>>59059213
They already did that. It's called dual graphics. And it sucked.
>>
>>59058084
Holy shit couldn't this tech improve consumer CPU yields by a lot?

>Take a Ryzen "core unit" of 4 cores
>Just make a shit ton of those as their own dies
>Bin defective ones, glue some together and glue an iGPU to it
>Arbitrarily large chips made from small, high yield dies
>>
>>59059213
>crossfire API+GPU
That's already a thing. Can crossfire APU with R7 250.
Also, the multi GPU in DX12/Vulkan will accomplish this sort of task in software.
>>
>>59059213
navi is rumored to be two gpus working on the same transistor

but it's all speculation

get dem heatsinks ready boys
>>
>>59059302
Just suck my dick and I'll give you 20 bucks, which is probably more than you'd make from AMD after tax.
>>
>>59058084
>chiplets

c-cute!!!!
>>
>>59059582
yes, intel is also experimenting with it.
though it's not just binning yields, since the cuts are physically smaller they can more efficiently use each wafer.
>>
>>59059616
>That's already a thing.

But it probably won't suck this time.
>>
>>59059582
that's very nearly what AMD's new server platform, Naples, will be.

> each Zeppelin die has 8c/16t, 16 MB L3, 2 DDR4 channels, and 32 PCIe lanes
> sell 2x MCMs with 16c/32t, 32 MB L3, 4 DDR channels, 64 PCIe lanes
> sell 4x MCMs with 32c/64t, 64 MB L3, 8 DDR channels, 128 PCIe lanes
> support 2 sockets of either of the above

they won't have great AVX capacity for HPC/simulation stuff, but they will be beastly NVMe file and web servers
>>
>>59058437
Once saw a horsefly an inch long

Not in my home though, I don't take my noctua systems to rural sectors
>>
fucking chiplets, when will they learn?
>>
>>59059821
>> sell 2x MCMs with 16c/32t, 32 MB L3, 4 DDR channels, 64 PCIe lanes
consumer naples when
>>
>>59058727
that barn looks really happy/angry
>>
>>59058398
I'm a huge nvidiot, but if AMD could actually demonstrate that working well, with real 100% scaling efficiency, I think nvidia would be btfo.
>>
>>59058398
it not goin to work like that, gpu had alot wasted transistor during "crossfire". they need to redesign everything
>>
>>59059565

As someone who tried it, can confirm.
>>
>>59059986
high inter-die bandwidth is necessary but not sufficient for proper GPU scaling.

you need, at the absolute least, geometry setup engines that can feed a rasterizer on not just another compute block but on potentially a different die.

control issues like this mean that successful designs will take several years to design and validate. navi might still try something like this, but Vega will almost certainly not.
>>
so, the madman are actually going to make navi scaleable?
I'd be really surprised if nvidia doesn't rebrand volta this year, they wanted to make same thing for very long time, can't imagine them being ahead on this.
>>
>>59060041
well, technically that's how it would work
how it would talk with each other that's another story, no idea, if they pull it off it's going to change GPU as we know it
>>
>>59060049
>>59060050
AMD hasn't had Navi described as "scalable" in the last few roadmap slides, so who the fuck knows.

Nvidia does great work with their fixed function units (color compression, tiling rasterizers, etc.) but lagged so much on async compute/graphics shaders that I can't imagine them being first with MCM GPUs.
>>
>>59060050
Volta was just slated to have Hybrid Memory Cube memory

Maxwell was going to have an ARM processor integrated

Of course neither one of those is gonna materialize and they pulled Pascal out their ass later on.
>>
>>59058084
So it's just stacked dies 3D ICs that everyone already has been looking at and testing already? Yeah Intel sure is fucked.
>>
>>59060068
do you think they will sue what they developed for zen with "infinite fabric"?

>>59060071
2 years back it was scaleable in the slides, they changed it, so was navi, that navi pushed back to 2019 is what makes me think they plan on something
>>
>>59060076
Reminder: Intel is GPU subsidiary of AMD.
>>
>>59060041
>>59060060
SLI/CF are fundamentally fucked nowadays since basically every modern engine pipeline uses previous frame data for effects, and AFR rendering was invented with the assumption that frames could be rendered independently.

Every new game with SLI/CF support is basically a gigantic one-off hack, which is not the way to make MCM GPUs succeed.
>>
>>59060095
I want everyone drop CF/SLi support altogether, it wastes development time for damn 1.5% of users skimping on optimization for single GPUs.
Problem is those users are the most obnoxious vocal minority.
>>
>>59060118

They did spend twice on a GPU, I'd be pretty pissed being that stupid too.

Direct X 12 explicit multi-adapter sounds interesting though.
>>
>>59058084
>that chip
THICC
>>
I really love technology. Can't wait to see how much progress is made in 10 years.
>>
>>59060149
>explicit multi-adapter sounds interesting though.
it's hard to program for, again leads to wasted dev time
>>
>>59060071
HMC was always questionable for GPUs. It was always more about capacity and design flexibility than bandwidth or power.
Pascal (the real GP100 one, not the Maxwell shrink GTX 10x0 series) is most of what Volta was supposed to be, assuming substituting HBM for HMC.

>>59060086
Infinity Fabric will be used for Zeppelin-Vega MCM APUs late this year or early next year. But technically IF is more a set of communication design libraries that are even used internally in their newer chips, so it's not clear what capabilities or operational characteristics/semantics it has. A HyperTransport successor doesn't have the same needs as internal pipeline structuring or control paths, etc.
>>
>>59060166
how possible do you think it is to integrate neural net inside a chip to handle all complicated intercommunication? will latency be unbearable or other way around?
>>
>>59059213
If only I could set my Apu to do only "Shadow/effect" and leave the rest for main Gpu.
>>
>>59058084
>tfw I bought AMD stocks at $12
hopefully they go up enough to fund my next upgrade
>>
>>59060211
dude, they will either jump to 20 or drop to 5 after 28th
be careful
>>
>>59058452
But is that natty?
>>
>>59060219
Eh, I only dropped $300 on them. Not a huge deal if the price drops a heap. If they go below $5 with no I'll just sit on them for the long run.
>>
>>59058084
That looks like a cool office park, where is it at?
>>
>>59060184
"neural nets" if you're generous enough to call them that, are only good in a CPU for speculative decisions, which boil down to branch prediction and prefetching at most. something like cache eviction could be done like this in principle too but wouldn't be worth it.

intra- and inter-chip communications are just protocols for buffer management and state transitions, where heuristics don't really have much of a place.
>>
>>59058084
THEY AREN'T MAKING THIS.
>>
>32 CPU cores
*"mOar corez!!1!!" off in the distance*
>>
>>59060429
Wouldn't it be possible to simply build a scheduler into the interposer (say at 28nm to make the traces small but the scheduler not totally suck) and have that run the inter-die communication. I suspect that you could even break it down further with smaller schedulers to tie together multiple gpu dies, and that scheduler+infinity fabric then talks to the cpu scheduler+infinity fabric.

I'm clearly not a CE or EE, but would such a design be workable?
>>
>>59058084
>chiplets

are these like the manlets of the CPU world? im sceptic in that case!
>>
>>59060479
Is it bad, gaymer?
>>
>>59058084
>chiplets
when wil they ever learn
>>
>>59058084
>next-generation memory type that can be stacked directly onto a GPU die
doesn't this cause heat transfer issues?
>>
>>59060554
Should've been called subchip
>>
The term chiplet has a specific meaning, it implies die stacking. Not to be confused with 3D stacking. Die stacking means you fab small parts and connect them via interproser to reduce cost on a low yielding extremely dense node.
Instead of one CPU die with 16 cores they're proposing 4 smaller dies each with 4 cores.
A couple years ago AMD proposed even breaking up the CPU itself into individual components to fab them all on separate processes tailored specifically for the part.
>>
>chiplets

When will they learn?
>>
>>59059152
Neat

>>59059213
2022 it says.

2030 is when Raja says discrete GPUs won't even exist anymore.
>>
>>59061060
>we're moving towards fully modular CPU fabrication

This shit is so cool. This is how technology markets are supposed to work. Everyone's so concerned with reaching the limit in shrinking transistors they've forgotten how many other ways there are to innovate products from the initial design phase up. This is what it looks like when there's real competition: the potential for methods that turn entire pre-established norms on their fucking heads.
>>
would massively increase yield/wafer

cash money
>>
>>59061034
It will require expensive MB and come with WC unit like that 220W CPU ;^)
>>
File: die stacking.jpg (99KB, 1122x631px) Image search: [Google]
die stacking.jpg
99KB, 1122x631px
>>59061477
If they could reliably package it then it would be an enormous cost savings. Individual parts costing perhaps pennies instead of monolithic dies costing upwards of $20-$50 each.
2.5D integration would be pretty big.
>>
>>59061111
Raja is wrong. Gpu would last longer. powerlimit/vrm capacity would be huge factor. unless you willing to pay premium for MB
>>
>>59061782
When you can do 2 TFLOPs of full precision, which is a bit more powerful than a 1080, on a single APU iGPU die (there's 8 of these dies in the server HBM), and that is only 5 years from now, then 13 years from now he's probably right.
>>
>>59059389
If everyone loses money playing the stock market then who gains money? Checkmate atheists.
>>
File: Ryzelander.jpg (109KB, 1280x545px) Image search: [Google]
Ryzelander.jpg
109KB, 1280x545px
>>59058084
>>
File: 1465404794341.jpg (9KB, 256x257px) Image search: [Google]
1465404794341.jpg
9KB, 256x257px
>>59062969
>then who gains money
>>
I wonder how long the PS3 and 360 could've lasted commercially if MS and Sony had waited for something like this instead of going with the Bobcat family, since it was obvious they wanted an APU solution for the PS4 and XB1, but both opted for the only low-power moar coar option they had at the time

Consoles could ACTUALLY be competitive if they had a Ryzen-based HBM APU
>>
>>59062971
i remember jewlander, good movie
>>
>>59063422
Consoles went with the lowest bidder for a credible platform, which ended up being mostly AMD.

Bulldozer was too high-power for the platform, AMD had nothing else to sell except for Bobcat/Jaguar, Intel didn't have credible GPU performance, and Jen-Hsun was still crying salty tears over never being able to get an x86 license.
>>
>>59062303
that what they say with fusion back then, just think 300 watt apu lol hope. discrete GPU will stay like how ddr still use even with hbm
>>
>>59062971

kek
>>
>>59063980
Um. They're projecting 200watts for a 16 TFLOP full precision APU, which sounds reasonable. A 4 TFLOP one won't be 300 watts. Especially in 2030 when it's on 4nm fab or smaller instead of 7nm.
Thread posts: 98
Thread images: 14


[Boards: 3 / a / aco / adv / an / asp / b / bant / biz / c / can / cgl / ck / cm / co / cock / d / diy / e / fa / fap / fit / fitlit / g / gd / gif / h / hc / his / hm / hr / i / ic / int / jp / k / lgbt / lit / m / mlp / mlpol / mo / mtv / mu / n / news / o / out / outsoc / p / po / pol / qa / qst / r / r9k / s / s4s / sci / soc / sp / spa / t / tg / toy / trash / trv / tv / u / v / vg / vint / vip / vp / vr / w / wg / wsg / wsr / x / y] [Search | Top | Home]

I'm aware that Imgur.com will stop allowing adult images since 15th of May. I'm taking actions to backup as much data as possible.
Read more on this topic here - https://archived.moe/talk/thread/1694/


If you need a post removed click on it's [Report] button and follow the instruction.
DMCA Content Takedown via dmca.com
All images are hosted on imgur.com.
If you like this website please support us by donating with Bitcoins at 16mKtbZiwW52BLkibtCr8jUg2KVUMTxVQ5
All trademarks and copyrights on this page are owned by their respective parties.
Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
This is a 4chan archive - all of the content originated from that site.
This means that RandomArchive shows their content, archived.
If you need information for a Poster - contact them.