AMD: Zen 2 (Ryzen/Threadripper 3000?, Epyc 8000?) Speculation, Rumours and Discussion

Discussion in 'PC Industry' started by Deleted member 13524, Oct 8, 2018.

Tags:
  1. Gipsel

    Veteran

    Joined:
    Jan 4, 2010
    Messages:
    1,620
    Likes Received:
    264
    Location:
    Hamburg, Germany
    That's how the arrangement looks like:

    [​IMG]
     
  2. DieH@rd

    Legend

    Joined:
    Sep 20, 2006
    Messages:
    6,387
    Likes Received:
    2,411
    Zen 2 info from today's AMD presentation was interesting. Should be a great upgrade.

    On a GPU front, I did not find 7nm Vega to be interesting. They just optimized it a lot for professional use. The wait for Navi continues.
     
  3. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
    Another angle:

    [​IMG]
     
    hoom and Lightman like this.
  4. cheapchips

    Veteran

    Joined:
    Feb 23, 2013
    Messages:
    2,493
    Likes Received:
    2,665
    Location:
    UK
    Does the MI50 provide the first real world suggestion of next gen transistor count? 13.2bn on a 331mm² chip.
     
  5. CarstenS

    Legend Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,800
    Likes Received:
    3,920
    Location:
    Germany
    And another one. Someone should train an AI the can reduce the margin of error on the die's edges in not-so-optimal fotos....
     

    Attached Files:

    Lightman, iMacmatician and BRiT like this.
  6. mrcorbo

    mrcorbo Foo Fighter
    Veteran

    Joined:
    Dec 8, 2004
    Messages:
    4,024
    Likes Received:
    2,851
    Expanding on this a bit, the "chiplet" architecture for Epyc also caught my attention. Is a next-gen console with standalone CPU and GPU chiplets connected to a shared IO hub chip on a MCM plausible?
     
  7. Esrever

    Regular

    Joined:
    Feb 6, 2013
    Messages:
    846
    Likes Received:
    647
    No matter how you measure, that io die is HUGE. Looks like the io is about the same size as all 64 cores put together.

    I wonder how much cache it contains.
     
    Lightman and iMacmatician like this.
  8. fellix

    Veteran

    Joined:
    Dec 4, 2004
    Messages:
    3,552
    Likes Received:
    514
    Location:
    Varna, Bulgaria
    Using dodgy Photoshop measuring:

    Chiplet -- 71 mm²
    I/O Hub -- 442 mm²

    Assuming the package is the same as FCLGA-4094 for the current EPYC series (58.5 mm × 75.4 mm).
     
    Kej, Alexko, BRiT and 3 others like this.
  9. no-X

    Veteran

    Joined:
    May 28, 2005
    Messages:
    2,451
    Likes Received:
    471
    My result is very close, 73 mm² / 434 mm². Despite the huge 14nm central core, core to mm² ratio is still 67 % better compared to original Epyc.
     
    BRiT and Lightman like this.
  10. anexanhume

    Veteran

    Joined:
    Dec 5, 2011
    Messages:
    2,078
    Likes Received:
    1,535
    Even the Rome IO die doesn’t have the necessary bandwidth for GDDR6. They have to solve that and prove latency won’t kill performance.
     
  11. 3dilettante

    Legend Alpha

    Joined:
    Sep 15, 2003
    Messages:
    8,579
    Likes Received:
    4,799
    Location:
    Well within 3d
    Comparing the diagram that was shown first to the MCM picture indicates the IO die has 4 DDR channels on the top and bottom edge. The left and right sides have a pair of infinity fabric blocks next to each corner, with the middle third of the IO die's side given over to a non-differentiated IO block.
    Elsewhere, it's been stated that Rome has adopted PCIe 4.0.

    The CPU chips are grouped in an interesting fashion compared to the prior EPYC layout. In each quadrant of the MCM, there is a pair of CPU chiplets with much less space between them than there is between them and the IO die, and between them and the nearest chiplet pair below them.
    The horizontal midline of the package may be wider partly from the path the IO must take from the central die to the left and right sides.

    The lack of space between CPU pairs could mean there's a different topology than previously--and perhaps not as uniform as proposed earlier, if there's short-range links between them and the die nearest the IO chip is connected to both fabric links. While that may introduce an incremental amount of additional latency to the other die, this could give each die more overall bandwidth than giving each CPU chip one link to the IO block. If scaling the PCIe 4.0 bandwidth means xGMI is similarly double-bandwidth, raising the bandwidth of the on-package links to match would give each CPU pair enough link bandwith to utilize all 8 channels in some peak demand situation.

    Another possibility is the midline gap is broader in part to let each die's links wend their way to the IO die or to other clients as well.
     
    Alexko, hoom, BRiT and 1 other person like this.
  12. tunafish

    Regular

    Joined:
    Aug 19, 2011
    Messages:
    627
    Likes Received:
    414
    Assuming the die size measures above, and that it is as dense as the L3 of Zeppelin (including the sizes of tag arrays), the die composing of only cache would be ~160MB. 128MB seems reasonable, given inefficiencies and the space needed by IO and presumed directory entries (for dual socket coherency).

    That's actually quite a bit less than I expected.

    Another hypothesis is power. You want the main power and ground lines to go straight through to the chip, and the socket is designed for Zeppelins to be where those chiplets were placed.
     
    Lightman likes this.
  13. anexanhume

    Veteran

    Joined:
    Dec 5, 2011
    Messages:
    2,078
    Likes Received:
    1,535
    #33 anexanhume, Nov 6, 2018
    Last edited: Nov 6, 2018
    Lightman likes this.
  14. Tkumpathenurpahl

    Tkumpathenurpahl Oil Monsieur Geezer
    Veteran

    Joined:
    Apr 3, 2016
    Messages:
    1,910
    Likes Received:
    1,929
    I found it quite interesting that they're touting performance of 7.4TF FP64, 14.8TF FP32, 29.5 FP16, 59.5 INT8, and 118 INT4.

    It puts AMD very close to RTRT hardware by the end of this year.

    Am I right in thinking that some BVH accelerating hardware would take up little of the M160's 331mm2 die, but would put it in the same realm as the RTX2070 and 2080?

    If so, I'm very much coming round to the idea of RTRT in the next gen consoles. I'd still like a two tier launch, and at this rate, it looks like even a low tier option could be looking at some 8-10TF of RTRT capable hardware.

    The 6TF X1X renders Red Dead Redemption 2 at native 4K. Imagine what it would look like with an additional 4TF of RTRT and ML hardware taking care of all of the shadows, upscaling, and AA. Maybe reflections too, if that's not too costly.

    :yep2:

    :yes:

    :runaway::runaway::runaway:
     
    gamervivek likes this.
  15. Not having a memory controller on the CPU chiplet but keeping the PCIe actually leaves it with some interesting options.
    We know Vega 20 uses an out-of-chip Infinity Fabric with 50GB/s full-duplex per link. A future APU could use the same chiplet connected through one or two of these IF links, they'd use the GPU's memory controller (which has HBCC anyways). Most I/O could still be implemented through the chiplet using the PCIe lanes.

    Now, I don't think Sony or Microsoft will go back to multi-chip solutions when AMD seems to be able to mix&match CCXs with modular GPUs with relative ease (like the Sudor SoC). But some PC OEMss could now order MCM APUs to put into SFF PC/console hybrids, without having to pay for the development of a custom chip.


    Yes, though RTRT performance would still be terrible.
     
  16. beyondtest

    Newcomer

    Joined:
    Jun 3, 2018
    Messages:
    58
    Likes Received:
    13
    Could someone give me more insight?

    I've seen many negative comments in Anand saying the new product is 50% more dense but only 20% better in performance?
     
  17. DavidGraham

    Veteran

    Joined:
    Dec 22, 2009
    Messages:
    3,976
    Likes Received:
    5,213
    TF numbers alone are not enough to extrapolate performance between NVIDIA and AMD. NVIDIA has higher performance than their TF number would suggest. Because they compensate by having better polygon throughput, pixel filtrate, texturing and higher effective memory bandwidth. Also they have more advanced Tile Rendering than AMD.
     
  18. Called it. Ha!
    (Ok so it's not 12nm it's 14nm, bleh..)



    Things I want to know but they didn't release:

    1 - Where's the L3 cache? One would think it would belong in the chiplets to reduce latency and the new IF links don't seem to be wide enough to keep up with the bandwidth. OTOH the chiplets are tiny (way less than half of a Zeppelin) and that I/O die is huge. There's no way the I/O die has only DDR4 PHYs and IF glue, it has to have some huge cache for coherency (L4?).

    2 - Chiplet diagram. Is it still two 4-core CCXs? One 8-core CCX? Argh..
     
  19. anexanhume

    Veteran

    Joined:
    Dec 5, 2011
    Messages:
    2,078
    Likes Received:
    1,535
    1.25x performance for the same power. This is on TSMC, not AMD. Interconnect resistivity likely rearing its ugly head. Rumors were that Apple weren’t pleased either.

    Wouldn’t this give atrocious latency?
     
    beyondtest likes this.
  20. Tkumpathenurpahl

    Tkumpathenurpahl Oil Monsieur Geezer
    Veteran

    Joined:
    Apr 3, 2016
    Messages:
    1,910
    Likes Received:
    1,929
    I get that Nvidia's generally considered to be better, but AMD's going to provide the tech behind the PS5, for certain, and probably the XBoxTwo.

    And if a late 2018, 7nm iteration of Vega can be used for ML, their hardware isn't that far away from being comparable in RTRT aptitude.

    So my point isn't that AMD are going to come along with the best RTRT solution, just that it's promising that they're already so close to having one. It bodes well for industry support and inclusion in at least one of the PS5 and XBoxTwo.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...