Welcome, Unregistered.

If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

Reply
Old 25-Jul-2010, 18:50   #4001
rpg.314
Senior Member
 
Join Date: Jul 2008
Location: /
Posts: 4,070
Send a message via Skype™ to rpg.314
Default

Quote:
Originally Posted by NathansFortune View Post
It wouldn't make sense for Nvidia to stick A2/3 and get a neutered chip when a silicon respin would fix a lot of their problems and increase yields...
A silicon respin could certainly improve upon many performance and efficiency metrics. However, a bigger question, at least for me, is whether there is a reasonable probability of this happening, assuming Fermi 2/Fermi's shrink is due this winter.
__________________
The views presented here are my own and not my employer's.
Quote:
Originally Posted by Alexko View Post
So in a nutshell, model [BLANK] will have [BLANK], up to [BLANK], and even [BLANK] for a power consumption of just [BLANK]. Impressive.
rpg.314 is offline   Reply With Quote
Old 25-Jul-2010, 19:05   #4002
Alexko
Senior Member
 
Join Date: Aug 2009
Posts: 2,023
Send a message via MSN to Alexko
Default

There can't be any shrink this winter, since TSMC's 40nm process is the smallest available.
Alexko is offline   Reply With Quote
Old 25-Jul-2010, 19:26   #4003
NathansFortune
Member
 
Join Date: Mar 2009
Posts: 539
Default

Quote:
Originally Posted by rpg.314 View Post
A silicon respin could certainly improve upon many performance and efficiency metrics. However, a bigger question, at least for me, is whether there is a reasonable probability of this happening, assuming Fermi 2/Fermi's shrink is due this winter.
TSMC and Global Foundries have stated 28nm isn't going to be ready until H2 2011. That means Nvidia need to get more mileage out of their current designs.
NathansFortune is offline   Reply With Quote
Old 25-Jul-2010, 19:44   #4004
DegustatoR
Senior Member
 
Join Date: Mar 2002
Location: msk.ru/spb.ru
Posts: 1,311
Default

Quote:
Originally Posted by Silent_Buddha View Post
Unless they were ditching GF100 entirely as ATI did with R520/R600. And instead there's a new modified chip coming out for the fall.
GF100B (or whatever they'll call it in the end) is what's coming in the Fall. There may be a more or less new (still Fermi-based at it's core) 40G GF100B replacement down the road but its fate will depend on a lot of factors, and I won't be surprised if they'll wait for 28HP for their next top-end GPU.
DegustatoR is offline   Reply With Quote
Old 25-Jul-2010, 19:48   #4005
rpg.314
Senior Member
 
Join Date: Jul 2008
Location: /
Posts: 4,070
Send a message via Skype™ to rpg.314
Default

Quote:
Originally Posted by NathansFortune View Post
TSMC and Global Foundries have stated 28nm isn't going to be ready until H2 2011. That means Nvidia need to get more mileage out of their current designs.
Must have missed it. Do you have a recent link?

Also, it means at best, we can expect a hybrid part this year from AMD.
__________________
The views presented here are my own and not my employer's.
Quote:
Originally Posted by Alexko View Post
So in a nutshell, model [BLANK] will have [BLANK], up to [BLANK], and even [BLANK] for a power consumption of just [BLANK]. Impressive.
rpg.314 is offline   Reply With Quote
Old 25-Jul-2010, 20:00   #4006
NathansFortune
Member
 
Join Date: Mar 2009
Posts: 539
Default

Quote:
Originally Posted by rpg.314 View Post
Must have missed it. Do you have a recent link?

Also, it means at best, we can expect a hybrid part this year from AMD.
I can't find the link, but it was from the TSMC Fab 15 article somewhere. The CEO said 40nm was their concern right now and 28nm is delayed until later in 2011, probably H2. Global Foundries have a similar outlook as 32nm for AMD and ARM is their primary concern.
NathansFortune is offline   Reply With Quote
Old 25-Jul-2010, 20:48   #4007
Blazkowicz
Senior Member
 
Join Date: Dec 2004
Location: Toulouse
Posts: 4,142
Default

How interesting, as People's Republic of China domestic CPU industry (loongson processors) stated they aim for 32nm at end of 2011 .
Blazkowicz is online now   Reply With Quote
Old 25-Jul-2010, 21:11   #4008
Alexko
Senior Member
 
Join Date: Aug 2009
Posts: 2,023
Send a message via MSN to Alexko
Default

Quote:
Originally Posted by NathansFortune View Post
I can't find the link, but it was from the TSMC Fab 15 article somewhere. The CEO said 40nm was their concern right now and 28nm is delayed until later in 2011, probably H2. Global Foundries have a similar outlook as 32nm for AMD and ARM is their primary concern.
As far as I'm aware, this is GlobalFoundries' latest public roadmap:



And I haven't heard of any changes to the 28nm schedule since then.
Alexko is offline   Reply With Quote
Old 26-Jul-2010, 01:16   #4009
aaronspink
Senior Member
 
Join Date: Jun 2003
Posts: 2,570
Default

Quote:
Originally Posted by Blazkowicz View Post
How interesting, as People's Republic of China domestic CPU industry (loongson processors) stated they aim for 32nm at end of 2011 .
They've stated a lot of things over time and they contract out the fabrication to non Chinese companies.
__________________
Aaron Spink
speaking for myself inc.
aaronspink is offline   Reply With Quote
Old 31-Jul-2010, 17:21   #4010
Man from Atlantis
Member
 
Join Date: Jul 2010
Location: Istanbul
Posts: 727
Default

GF104, GF100 Core Architecture Comparison
http://news.mydrivers.com/Img/20100730/02501268.jpg
GF104 SM architecture (part of the speculation)
http://news.mydrivers.com/Img/20100730/02503995.jpg
GF100 SM architecture
http://news.mydrivers.com/Img/20100730/02504021.jpg
NVIDIA graphics core in recent years, the evolution diagram
http://news.mydrivers.com/Img/20100730/02521912.jpg
G80, GT200, GF100, GF104 contrast the core memory and multithreading
http://news.mydrivers.com/Img/20100730/02521937.jpg
Man from Atlantis is offline   Reply With Quote
Old 02-Aug-2010, 04:08   #4011
trinibwoy
Meh
 
Join Date: Mar 2004
Location: New York
Posts: 9,809
Default

Did Nvidia beef up GF104's texture units? Was just browsing Damien's english review and it seems FP16 and RGB9E5 are now full speed as opposed to half speed on GF100.

__________________
What the deuce!?
trinibwoy is offline   Reply With Quote
Old 02-Aug-2010, 09:39   #4012
Alexko
Senior Member
 
Join Date: Aug 2009
Posts: 2,023
Send a message via MSN to Alexko
Default

Damien's reviews usually deserve a bit more than a quick browsing…

Quote:
Moreover, the texturing units have been improved to filter FP16 textures (as well as FP11, FP10 and RGB9E5) at full speed.
http://www.behardware.com/articles/7...e-gtx-460.html
Alexko is offline   Reply With Quote
Old 03-Aug-2010, 05:05   #4013
trinibwoy
Meh
 
Join Date: Mar 2004
Location: New York
Posts: 9,809
Default

Of course, thanks. Saw it on my second read through Wonder why they bothered.
__________________
What the deuce!?
trinibwoy is offline   Reply With Quote
Old 03-Aug-2010, 07:23   #4014
Chalnoth
 
Join Date: May 2002
Location: New York, NY
Posts: 12,678
Default

Quote:
Originally Posted by trinibwoy View Post
Of course, thanks. Saw it on my second read through Wonder why they bothered.
My first guess would be that it was something that was intended for the GF100 all along, but there was a bug in the hardware implementation that forced them to implement these modes with reduced performance.

As for why they would have wanted to go this route in the first place, well, that would make sense if they feel that these modes will become more and more common as time goes forward, and if the added hardware cost was minimal.
Chalnoth is offline   Reply With Quote
Old 03-Aug-2010, 13:54   #4015
mczak
Senior Member
 
Join Date: Oct 2002
Posts: 2,437
Default

Maybe the full-speed fp16 was just a later addition which didn't make it for GF100.
That said, it would imho make more sense for GF100 than GF104, since GF100 has lower tex:alu ratio (and also higher memory bandwidth / tex). Unless you think it doesn't matter for GF100 since it looks more useful for non-gaming usages anyway..
mczak is offline   Reply With Quote
Old 03-Aug-2010, 15:00   #4016
ShaidarHaran
hardware monkey
 
Join Date: Mar 2007
Posts: 3,905
Default

Interesting that the fp formats have seen performance increases from GF100->GF104, but the int formats have seen performance decreases. Also, there appears to be a hard cap @ 33.3 GTexels/s for 3 of the formats. Any thoughts as to what might be causing this? Is it a lack of cache or cache bandwidth? Some other architectural limitation? I don't think it's a lack of VRAM or VRAM bandwidth since GF104 out-performs GT200b in 2 of the 3 formats.
ShaidarHaran is offline   Reply With Quote
Old 03-Aug-2010, 16:38   #4017
TKK
Member
 
Join Date: Jan 2010
Posts: 140
Default

Quote:
Originally Posted by ShaidarHaran View Post
I don't think it's a lack of VRAM or VRAM bandwidth since GF104 out-performs GT200b in 2 of the 3 formats.
Also, if it was the case there should be a difference between the two GTX 460 variants, which isn't the case.
TKK is offline   Reply With Quote
Old 03-Aug-2010, 17:57   #4018
Gipsel
Member
 
Join Date: Jan 2010
Location: Hamburg, Germany
Posts: 987
Default

Quote:
Originally Posted by ShaidarHaran View Post
Also, there appears to be a hard cap @ 33.3 GTexels/s for 3 of the formats. Any thoughts as to what might be causing this? Is it a lack of cache or cache bandwidth? Some other architectural limitation? I don't think it's a lack of VRAM or VRAM bandwidth since GF104 out-performs GT200b in 2 of the 3 formats.
It's the theoretical max throughput of the 56 TMUs * 0.675 GHz = 37.8 GTexel/s. Obviously the efficiency (88%) is slightly lower than on AMD GPUs (~98% or so) for this simple tasks.
Gipsel is offline   Reply With Quote
Old 03-Aug-2010, 19:34   #4019
mczak
Senior Member
 
Join Date: Oct 2002
Posts: 2,437
Default

Quote:
Originally Posted by Gipsel View Post
It's the theoretical max throughput of the 56 TMUs * 0.675 GHz = 37.8 GTexel/s. Obviously the efficiency (88%) is slightly lower than on AMD GPUs (~98% or so) for this simple tasks.
I think the more interesting comparison is GTX470/480 - 60 TMUs *0.7 GHz = 42 GTexels/s and it is achieving 41.4 GTexels/s (for int8 only though) - 99%. So for some odd reason GF104 can achieve less of the peak potential of the tmus.
mczak is offline   Reply With Quote
Old 03-Aug-2010, 20:20   #4020
CarstenS
Senior Member
 
Join Date: May 2002
Location: Germany
Posts: 2,842
Send a message via ICQ to CarstenS
Default

I'm showing (almost) the same here. 33.8 GTex is the maximum i can get out of a stock GF104 with bilinear filtering. With trilinear it's a more expected 18.9 GTex/s. Together with the point sampling result of - again - 33.8 GTex/s I'm guessing, it's maybe interpolation or adress bound.

An HD5830 is literally miles away at 43.6 and 22.4 GTex/s.
__________________
English is not my native tongue. Before flaming please consider the possiblity that I did not mean to say what you might have read from my posts.
Work| Recreation
Warning! This posting may contain unhealthy doses of gross humor, sarcastic remarks and exaggeration!
CarstenS is offline   Reply With Quote

Reply

Tags
delay, fermi, geforce, gf100

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 19:52.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.