NVidia Ada Speculation, Rumours and Discussion

Jawed · Jul 10, 2021

Well, I never did find a speculation thread for NVidia's desktop GPU after Ampere, so here goes:

https://twitter.com/i/web/status/1412049686945427456

I have no idea about Lovelace versus Hopper. So I'm wondering if Hopper is solely a data centre GPU and so Lovelace is consumer/prosumer?

Rootax · Jul 10, 2021

I'm curious about the bandwitdh situation... I don't doubt they can increase raw power, but memory bandwitdh for gamers ?

A nVinfinity cache ? Wider bus (too costly imo ?) ? Or "just" new innovations on compression, culling, tiling, data path,... ?

techuse · Jul 10, 2021

I’ll take the under on double 3090 performance.

Dictator · Jul 10, 2021

techuse said:
I’ll take the under on double 3090 performance.

We could see another RT core Iteration.. . Who knows! But Yeah, That sounds realistic

pjbliverpool · Jul 10, 2021

I hope it doesn't take that long to come out. There's speculation in the RDNA3 thread of RDNA 3 landing by end of Q1 next year.

HLJ · Jul 10, 2021

Dictator said:
We could see another RT core Iteration.. . Who knows! But Yeah, That sounds realistic

That would be the 3rd iteration then, one could hope they would start going for L3 RT.

Dangerman · Jul 10, 2021

Jawed said:
I have no idea about Lovelace versus Hopper. So I'm wondering if Hopper is solely a data centre GPU and so Lovelace is consumer/prosumer?

RedGamingTech said that he heard that Lovelace isn't consumer, overall the near future could just be "Ampere Next" for next year and then "Ampere Next Next".

Rootax said:
I'm curious about the bandwitdh situation... I don't doubt they can increase raw power, but memory bandwitdh for gamers ?

A nVinfinity cache ? Wider bus (too costly imo ?) ? Or "just" new innovations on compression, culling, tiling, data path,... ?

I think a AD102/GA202 could have a 384-bit bus with a 192 MB cache, not sure if GDDR6X (hopefully not after the hot and hungry SKUs it's with) or GDDR7 (doable if Computex the earliest) at 24Gbps, would be able to feed the 2-2.3x rumors we have about the top RTX 40 SKUs. Overall I think the RTX 40 lineup could look like this:

AD102/GA202 SKUs

RTX Titan II:

o 144 SMs.
o 384 Bit Bus.
o 48 GB of GDDR*
o 2000-2500 USD

RTX 4090:

o 140 SMs.
o 384 Bit Bus.
o 24 GB of GDDR
o 1200 USD

RTX 4080:

o 116 SMs.
o 384 Bit Bus.
o 20 GB of GDDR (slower than 4090/Titan 2)**
o 800 USD

AD104/GA204 SKUs

RTX 4070:

o 82 SMs.
o 257 Bit Bus.
o 16 GB of GDDR
o 600 USD

RTX 4060:

o 64 SMs.
o 224 Bit Bus.
o 14 GB of GDDR (slower than 4070)**
o 450 USD

AD106/GA206 SKUs

RTX 4050:

o 46 SMs.
o 192 Bit Bus.
o 12 GB of GDDR
o 350 USD

RTX 4040:

o 34 SMs.
o 160 Bit Bus.
o 10 GB of GDDR (slower than 4050)**
o 250 USD

*RTX Titan II would be using double sided memory ala 3090, but I think Nvidia will want to bring back the Titan as the prosumer they can jack up the cost to whatever they want.

**4080/4060/4040 would have slower memory as it makes it easier for Nvidia to segment the line up in terms of performance and space SKUs. No Tis as Nvidia originally wanted for RTX 30 but AMD forced their hand.

TopSpoiler · Jul 10, 2021

MCM or monolithic?

Dangerman · Jul 10, 2021

TopSpoiler said:
MCM or monolithic?

Almost everyone says monolithic, big question is if say a RTX 4090 can hit 2.2x over a 3090 and so on.

Clukos · Jul 10, 2021

Assuming TSMC 5nm I can see this being twice as fast as high-end Ampere. Clock boost and better perf / watt would go a long way even without the architectural improvements.

xpea · Jul 10, 2021

Clukos said:
Assuming TSMC 5nm I can see this being twice as fast as high-end Ampere. Clock boost and better perf / watt would go a long way even without the architectural improvements.

Assume SEC 4LPP which is very close to TSMC 5P (much more competitive than previous round SEC 8nm vs TSMC 7nm)

Dangerman · Jul 10, 2021

xpea said:
Assume SEC 4LPP which is very close to TSMC 5P (much more competitive than previous round SEC 8nm vs TSMC 7nm)

But some rumors say TSMC N5 is the choice for RTX 40 since SEC's processes aren't yielding well.

Davros · Jul 10, 2021

Are Nvidia seriously going to name a chip after a porn star ?
Edit:
Since someone will ask

CarstenS · Jul 10, 2021

https://en.wikipedia.org/wiki/Ada_Lovelace

Regarded as one of the first if not the first computer programmers. Daugther of poet Lord Byron. They both died untimely at age 36.

Frenetic Pony · Jul 10, 2021

The three options Nvidia has for bandwidth are: HBM (expensive). Go back to GDDR6, which is now as fast as GDDR6x, and implement a 512bit bus. And go with a big LLC like AMD has done.

None of them are ideal of course; a 512bit bus has its own costs; as does a big LLC as we can see from RDNA2 having a deficit in deferred titles at high resolutions (wonder what effect a visibility buffer has on that). Regardless, for performance increases they're also heavily constrained by power. Even jumping from Samsung 8nm to TSMC 5nm there isn't enough power savings to double performance, so unless there's a major architecture shift simultaneously that notion is out.

As a guess, lower end of expectations is a switch to a big LLC and a 25% performance improvement (averaged) with a switch to GDDR6 to save money. But better is always possible.

Deleted member 13524 · Jul 11, 2021

Obviously this isn't coming to Lovelace, but with PCIe 6.0 bringing 128GB/s duplex (256GB/s raw) and DDR5 hitting ~100GB/s in a 2-module configuration, I wonder if we'll start seeing DRAM-less midrange GPUs that only bring large amounts of stacked LLC.

Jawed · Jul 11, 2021

Dangerman said:
RedGamingTech said that he heard that Lovelace isn't consumer, overall the near future could just be "Ampere Next" for next year and then "Ampere Next Next".

I think a AD102/GA202 [...]

What is "AD"?

What is "GA"?

Deleted member 13524 · Jul 11, 2021

Jawed said:
What is "AD"?

What is "GA"?

I think it's just a suggested prefix that uses "Ada" as acronym.

Picao84 · Jul 11, 2021

ToTTenTranz said:
Obviously this isn't coming to Lovelace, but with PCIe 6.0 bringing 128GB/s duplex (256GB/s raw) and DDR5 hitting ~100GB/s in a 2-module configuration, I wonder if we'll start seeing DRAM-less midrange GPUs that only bring large amounts of stacked LLC.

Wouldn't that make the GPU performance too much dependent on the rest of the system? Sounds like a PR nightmare.

Deleted member 13524 · Jul 11, 2021

Picao84 said:
Wouldn't that make the GPU performance too much dependent on the rest of the system? Sounds like a PR nightmare.

Is it that much different to being able to pair a RTX3090 with a 2-core/2-thread Celeron using the motherboard's chipset-driven PCIe 3.0 x4 slot that is usually full sized?
Or pairing a 5700G APU with just one module of DDR4-1600?

NVidia Ada Speculation, Rumours and Discussion

Jawed

Rootax

techuse

Dictator

pjbliverpool

B3D Scallywag

HLJ

Dangerman

TopSpoiler

Dangerman

Clukos

Bloodborne 2 when?

xpea

Dangerman

Davros

CarstenS

Moderator

Frenetic Pony

Deleted member 13524

Guest

Jawed

Deleted member 13524

Guest

Picao84

Deleted member 13524

Guest

Similar threads