RDNA6 going to have unified rentable units, with SMT4 like operation and will quadruple performance and brand new Primitive Shader² ® with advanced Radiance AI Cores ® which going to make Jensenn begg for mercy 😎 😛
approved by MLID ®
RDNA6 going to have unified rentable units, with SMT4 like operation and will quadruple performance and brand new Primitive Shader² ® with advanced Radiance AI Cores ® which going to make Jensenn begg for mercy 😎 😛
You forgot 7ghzRDNA6 going to have unified rentable units, with SMT4 like operation and will quadruple performance and brand new Primitive Shader² ® with advanced Radiance AI Cores ® which going to make Jensenn begg for mercy 😎 😛
approved by MLID ®
You forgot AMD RamDoubler(tm)RDNA6 going to have unified rentable units, with SMT4 like operation and will quadruple performance and brand new Primitive Shader² ® with advanced Radiance AI Cores ® which going to make Jensenn begg for mercy 😎 😛
approved by MLID ®
ML perf and feature set delta between GFX12 and GFX13 is too big. Like Adroc has already said FSR Diamond will be RDNA 5+ exclusive.Unfortunately that likely means FSR5 won't be able to run on RDNA 4 cards.
L1 shared paper via Work Distribution Crossbar implementation under 5.3 provided 140% perf uplift for P-GEMM. Much larger uplift than 16X L1 private or theoretical perf with zero latency and replication with slow mesh interconnect.Looks like this extends the concept from the 2020 Shared L1 paper to register files.
Best case scenario: exclusive to RDNA5+ cardsML perf and feature set delta between GFX12 and GFX13 is too big. Like Adroc has already said FSR Diamond will be RDNA 5+ exclusive.
Hunyh also said "natively optimized for Project Helix". Translation: FSR porting aint gonna happen.
Helix uses AT2.Best case scenario: exclusive to RDNA5+ cards
Worst case scenario: exclusive to Helix family
Helix uses off the shelf AT2.Worst case scenario: exclusive to Helix family
The install base is so marginally tiny they don't have to port anything.AMD should really make FSR4 work on RDNA3 and FSR5 on RDNA4. Compared to not even running it, it is much better when it runs - eventhough very slowly. It just gives the much better impression, even if RDNA3/4 owners might not like the performance hit.
sf4 was supposed to be sonoma valley (mendocino replacement) right ?It looks like there is sufficient capacity for bdie(SF4X?)
It seems the Samsung Foundry team is firmly convinced that AMD won't be giving them any orders
both soundwave & bumblebee on tsmc rather than samsungI suspect it was cancelled, as it seems to have been replaced by Shockwave and Bumblebee
Why would AMD do that?i hope one or two low end zen 7 socs ( especially the 15watt grimlock point 4) find their way to samsung 2nm
What is "CBDIE"?They still seem to think that CBDIE is out of the question.
If by "Korean memory guys" you mean SK Hynix and Samsung, then they probably weren't shocked. I think SK Hynix and Samsung are very well positioned to assess Micron's capabilities.I'm not sure if I should say this here, but Korean memory guys will likely be shocked by the news of Micron supplying hbm4.
Unfortunately, they are denying the claims, stating that the Qual(qualification tests) have not yet been completedWho takes these guys seriously?
Hope they ignore this BS. Even if it becomes flawless I don't want every game to look like the same Photorealistic game (UE5 bland look 2.0). Per-game training is possible but ain't gonna happen.Wonder if consoles with RDNA5 will have enought AI power for AI slop like DLSS5
"Power" is only a problem if they want to offer something similar.Wonder if consoles with RDNA5 will have enought AI power for AI slop like DLSS5
I'm not sure we even have numbers on on-paper TFLOPs yet (at least not for lower-precision GEMM formats), and the new CUs likely have seen some significant changes to the VGPR and/or caches as well, so I doubt anyone can give even a rough estimate except "higher than RDNA4".Do we have any idea about roughly what kind of GEMM throughput RDNA 5 will achieve compared to RDNA 4 and 50 series? Not talking about on paper TFLOPs here but actual perf differences assuming same number of CUs/SMs and clocks.
I think these two videos from coreteks are really good for gauging thatWonder if consoles with RDNA5 will have enought AI power for AI slop like DLSS5