NVidia 4000 Series RTX: Lovelace? Speculation, Rumours and Discussion

Discussion in 'Architecture and Products' started by Jawed, Jul 10, 2021.

Tags:
  1. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,484
    Likes Received:
    1,844
    Location:
    London
    Well, I never did find a speculation thread for NVidia's desktop GPU after Ampere, so here goes:



    I have no idea about Lovelace versus Hopper. So I'm wondering if Hopper is solely a data centre GPU and so Lovelace is consumer/prosumer?
     
  2. Rootax

    Veteran Newcomer

    Joined:
    Jan 2, 2006
    Messages:
    2,292
    Likes Received:
    1,733
    Location:
    France
    I'm curious about the bandwitdh situation... I don't doubt they can increase raw power, but memory bandwitdh for gamers ?

    A nVinfinity cache ? Wider bus (too costly imo ?) ? Or "just" new innovations on compression, culling, tiling, data path,... ?
     
  3. techuse

    Regular Newcomer

    Joined:
    Feb 19, 2013
    Messages:
    992
    Likes Received:
    606
    I’ll take the under on double 3090 performance.
     
  4. Dictator

    Regular Newcomer

    Joined:
    Feb 11, 2011
    Messages:
    591
    Likes Received:
    3,460
    We could see another RT core Iteration.. . Who knows! But Yeah, That sounds realistic
     
    PSman1700 likes this.
  5. pjbliverpool

    pjbliverpool B3D Scallywag
    Legend

    Joined:
    May 8, 2005
    Messages:
    8,928
    Likes Received:
    3,672
    Location:
    Guess...
    I hope it doesn't take that long to come out. There's speculation in the RDNA3 thread of RDNA 3 landing by end of Q1 next year.
     
    PSman1700 likes this.
  6. HLJ

    HLJ
    Regular Newcomer

    Joined:
    Aug 26, 2020
    Messages:
    529
    Likes Received:
    869
    That would be the 3rd iteration then, one could hope they would start going for L3 RT.
     
  7. Dangerman

    Newcomer

    Joined:
    Apr 1, 2014
    Messages:
    40
    Likes Received:
    8
    RedGamingTech said that he heard that Lovelace isn't consumer, overall the near future could just be "Ampere Next" for next year and then "Ampere Next Next".
    I think a AD102/GA202 could have a 384-bit bus with a 192 MB cache, not sure if GDDR6X (hopefully not after the hot and hungry SKUs it's with) or GDDR7 (doable if Computex the earliest) at 24Gbps, would be able to feed the 2-2.3x rumors we have about the top RTX 40 SKUs. Overall I think the RTX 40 lineup could look like this:


    AD102/GA202 SKUs

    RTX Titan II:

    o 144 SMs.
    o 384 Bit Bus.
    o 48 GB of GDDR*
    o 2000-2500 USD

    RTX 4090:

    o 140 SMs.
    o 384 Bit Bus.
    o 24 GB of GDDR
    o 1200 USD

    RTX 4080:

    o 116 SMs.
    o 384 Bit Bus.
    o 20 GB of GDDR (slower than 4090/Titan 2)**
    o 800 USD


    AD104/GA204 SKUs

    RTX 4070:

    o 82 SMs.
    o 257 Bit Bus.
    o 16 GB of GDDR
    o 600 USD

    RTX 4060:

    o 64 SMs.
    o 224 Bit Bus.
    o 14 GB of GDDR (slower than 4070)**
    o 450 USD

    AD106/GA206 SKUs

    RTX 4050:

    o 46 SMs.
    o 192 Bit Bus.
    o 12 GB of GDDR
    o 350 USD

    RTX 4040:

    o 34 SMs.
    o 160 Bit Bus.
    o 10 GB of GDDR (slower than 4050)**
    o 250 USD

    *RTX Titan II would be using double sided memory ala 3090, but I think Nvidia will want to bring back the Titan as the prosumer they can jack up the cost to whatever they want.

    **4080/4060/4040 would have slower memory as it makes it easier for Nvidia to segment the line up in terms of performance and space SKUs. No Tis as Nvidia originally wanted for RTX 30 but AMD forced their hand.
     
  8. TopSpoiler

    Joined:
    Aug 18, 2020
    Messages:
    8
    Likes Received:
    12
    MCM or monolithic?
     
  9. Dangerman

    Newcomer

    Joined:
    Apr 1, 2014
    Messages:
    40
    Likes Received:
    8
    Almost everyone says monolithic, big question is if say a RTX 4090 can hit 2.2x over a 3090 and so on.
     
    TopSpoiler likes this.
  10. Clukos

    Clukos Bloodborne 2 when?
    Veteran Newcomer

    Joined:
    Jun 25, 2014
    Messages:
    4,606
    Likes Received:
    4,156
    Assuming TSMC 5nm I can see this being twice as fast as high-end Ampere. Clock boost and better perf / watt would go a long way even without the architectural improvements.
     
  11. xpea

    Regular Newcomer

    Joined:
    Jun 4, 2013
    Messages:
    480
    Likes Received:
    598
    Assume SEC 4LPP which is very close to TSMC 5P (much more competitive than previous round SEC 8nm vs TSMC 7nm)
     
  12. Dangerman

    Newcomer

    Joined:
    Apr 1, 2014
    Messages:
    40
    Likes Received:
    8
    But some rumors say TSMC N5 is the choice for RTX 40 since SEC's processes aren't yielding well.
     
  13. Davros

    Legend

    Joined:
    Jun 7, 2004
    Messages:
    17,282
    Likes Received:
    4,698
    Are Nvidia seriously going to name a chip after a porn star ?
    Edit:
    Since someone will ask
    [​IMG]
     
    #13 Davros, Jul 10, 2021
    Last edited: Jul 10, 2021
    fehu and Lightman like this.
  14. CarstenS

    Legend Veteran Subscriber

    Joined:
    May 31, 2002
    Messages:
    5,658
    Likes Received:
    3,662
    Location:
    Germany
    Sxotty, T2098, Clukos and 4 others like this.
  15. Frenetic Pony

    Regular Newcomer

    Joined:
    Nov 12, 2011
    Messages:
    797
    Likes Received:
    461
    The three options Nvidia has for bandwidth are: HBM (expensive). Go back to GDDR6, which is now as fast as GDDR6x, and implement a 512bit bus. And go with a big LLC like AMD has done.

    None of them are ideal of course; a 512bit bus has its own costs; as does a big LLC as we can see from RDNA2 having a deficit in deferred titles at high resolutions (wonder what effect a visibility buffer has on that). Regardless, for performance increases they're also heavily constrained by power. Even jumping from Samsung 8nm to TSMC 5nm there isn't enough power savings to double performance, so unless there's a major architecture shift simultaneously that notion is out.

    As a guess, lower end of expectations is a switch to a big LLC and a 25% performance improvement (averaged) with a switch to GDDR6 to save money. But better is always possible.
     
  16. ToTTenTranz

    Legend Veteran

    Joined:
    Jul 7, 2008
    Messages:
    12,752
    Likes Received:
    7,766
    Obviously this isn't coming to Lovelace, but with PCIe 6.0 bringing 128GB/s duplex (256GB/s raw) and DDR5 hitting ~100GB/s in a 2-module configuration, I wonder if we'll start seeing DRAM-less midrange GPUs that only bring large amounts of stacked LLC.
     
    T2098 likes this.
  17. Jawed

    Legend

    Joined:
    Oct 2, 2004
    Messages:
    11,484
    Likes Received:
    1,844
    Location:
    London
    What is "AD"?

    What is "GA"?
     
  18. ToTTenTranz

    Legend Veteran

    Joined:
    Jul 7, 2008
    Messages:
    12,752
    Likes Received:
    7,766
    I think it's just a suggested prefix that uses "Ada" as acronym.
     
  19. Picao84

    Veteran Regular

    Joined:
    Feb 15, 2010
    Messages:
    2,022
    Likes Received:
    1,072
    Wouldn't that make the GPU performance too much dependent on the rest of the system? Sounds like a PR nightmare.
     
    Lightman likes this.
  20. ToTTenTranz

    Legend Veteran

    Joined:
    Jul 7, 2008
    Messages:
    12,752
    Likes Received:
    7,766
    Is it that much different to being able to pair a RTX3090 with a 2-core/2-thread Celeron using the motherboard's chipset-driven PCIe 3.0 x4 slot that is usually full sized?
    Or pairing a 5700G APU with just one module of DDR4-1600?
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...