nelg said:
Uttar said:
The low-end will be based on the R300 for sure.
That is music to my ears.
Bah, wait 3-4 more months and you'll get the NV42/NV43, low-end parts fully replacing the NV34, with support for Shaders 3.0. and much more acceptable performance than the NV34.
0.13u LowK at TSMC for one, I suspect. Maybe 0.11u SOI at IBM for the other, or maybe TSMC is going to support 0.11u too even though they're rushing 0.09u - strange choice that'd be though!
*entering speculation mode*
NV35 = 130M
NV36 = slightly more than 80M
That means 50M transistors saved - on what, though?
Transistor costs in the NV3x AFAIK:
Full FP32 unit: 3M/component ( -> 12M for Vec4 -> 48M on the NV30/ )
VS Unit: 9M ( 3 in both the NV35 and NV36 ).
So, 24M transistors are saved from reducing the number of full PS units; BUT there also are mini-units! I've got no idea about how much they cost, but I'd suspect them plus ROPs, ... to cost at least 8M per "pipeline".
So we got 40M saved out of 50M required. The rest most likely comes from cache size reductions, mostly in the LMA. I've confirmed through a few small testings that the number of registers/quad in the NV36 is the same as in the NV35.
Roughly, the NV34 is the NV31 (which is nearly identical to the NV36 beside a lot more stuff is bugged and it's FX12), is:
80M - ( 2x10M for VS ) = 60M
60M - 5M for LMA = 50M
50M - 10M for smaller caches (less registers,...) and some of the LMA caches simply gone = 45M
To get to the 130M transistor count of the NV35, we do:
3x9 = 27M (VS)
3x4x4 = 48M (PS full)
8x4 = 32M (rest of pixel pipeline)
10+5 = 15M ( LMA )
27+48+32+15 = 122M
Add 8M for 2D, Triangle Setup, Rasterization, and so on to get to 130M.
Remember this is all speculation guys.
Thus, let us imagine a similar scenario for the NV40/NV41/NV42!
Let's imagine the NV40 has 5 VS units (no idea really) and 8 pipelines.
Each VS unit would cost 10M.
LMA would cost 20M.
Misc. would cost 10M.
This means 50+20+10 = 80M for non-pixel stuff, and 95M for pixel pipelines (this includes EVERYTHING pixel-related: ROPs, FPUs, pseudo-TMUs, ...)
So, a 3VS & 4 pipeline NV41 would be:
30+15+10+50 = 105M
Hmm, a mid-end part at 100M? Sounds logical to me considering the production will start in late H1 or early H2 2004 on 0.11u!
Then, the NV42/NV43 would be 1VS & 4 pipeline, with each pipeline having some retrieved 25% units/cache, and with less transistors for LMA/Misc/... thus:
10+10+10+35 = 65M
Reasonable target IMO, considering it's done on 0.13u and after some time will maybe even be done at 0.11u - although considering those transistor figures, it's possible the NV34 will survive afterwards, just like the GF4MX did (could very well be the end of the GF4 MX now though, thanks god! Or at least, only the current TNT2 M64 buyers will still consider a GF4 MX(420, 64-bit), eh).
However, considering those transistor figures, I'd say the NV42/NV43 will be MUCH more viable DX9 solutions than the NV34 ever will be. Unless NVIDIA screws up and does a 2x2/4x1, meh, then it would suck, yes....
Assuming the cost per-transistor for performance increased by 20% due to new features but it balances out to only 10% due to increased efficiency compared to the NV3x... And considering a nearly 20% clock speed disadvantage, too, compared to the NV36, we could say it's about 25% slower PS-wise.
Not bad at all, for a $99 part!
*stops useless speculation*
Uttar