If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#2701 | |
|
Epsilon plus three
Join Date: Feb 2002
Location: Chania
Posts: 7,768
|
Quote:
__________________
People are more violently opposed to fur than leather; because it's easier to harass rich ladies than motorcycle gangs. |
|
|
|
|
|
#2702 |
|
Senior Member
|
I on the other hand believe that CPU style caches dont scale. LRB's rendering pipeline is an ample proof of that. We'll need scratch pad memories, just like cell/gpu's of today. However, the one thing that I'll change over cell is to allow vector scatter gather from global memory as well, and not just async. dma's.
Cell programmers might be banging their heads against walls, stones etc. But gpu programmers have got on pretty fine in the last 2.5 years on CUDA. |
|
|
|
|
#2703 | |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
Quote:
edit: sooner or later nvidia & ati will add proper coherent r/w caches to their architectures, it's just a matter of time.
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
|
#2704 | |
|
Member
Join Date: Sep 2002
Posts: 559
|
Quote:
Engine clock is having a larger impact here. Note that engine speed regulates more than just ALU speed, it also controls ROP performance, vertex rates, etc. -FUDie
__________________
Ph.D. - Piled Higher and Deeper |
|
|
|
|
|
#2705 | ||
|
Senior Member
|
Quote:
Quote:
|
||
|
|
|
|
#2706 |
|
Nutella Nutellae
Join Date: Feb 2002
Location: San Francisco
Posts: 4,297
|
With naive/simple hw implementations.
__________________
[twitter] More samples, we need more samples! [Dean Calver] The opinions expressed herein are my own personal opinions and do not represent my employer's view in any way |
|
|
|
|
#2707 |
|
Senior Member
|
May be it is possible to reduce the O(p^2) to something lower, but I am still waiting for something that uses the r/w coherency of caches on an O(50) core chip with high performance.
|
|
|
|
|
#2708 |
|
Tiled
Join Date: Oct 2003
Location: Kings Langley, UK
Posts: 2,675
|
Those for HD 5870 are done, and were done before I started work on GF100 (thanks Alex!). We'll publish on it soon.
__________________
A major redesign of the core ALU pineapple boomerang fortress. |
|
|
|
|
#2709 |
|
Darlek ******
Join Date: Jun 2004
Posts: 9,498
|
GF100 ? where did this come from I know about G300, but Gf100 ???
edit: and Gt212 what the bloody hell is that ?
__________________
Guardian of the Most holy Two Terabytes of Gaming Goodness™ |
|
|
|
|
#2710 |
|
Anas platyrhynchos
Join Date: Jul 2004
Location: Finland
Posts: 4,373
|
Go back to post nr. 2548 and read forward.
__________________
http://www.youtube.com/watch?v=hpz9USr1RHg&feature=fvw |
|
|
|
|
#2711 | |
|
Senior Member
Join Date: Apr 2007
Posts: 1,393
|
Quote:
|
|
|
|
|
|
#2712 | |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Quote:
__________________
What the deuce!? |
|
|
|
|
|
#2713 | |
|
Regular
|
Quote:
|
|
|
|
|
|
#2714 | |
|
Epsilon plus three
Join Date: Feb 2002
Location: Chania
Posts: 7,768
|
Quote:
Since you're asking questions I hope now some come can understand why the intentional false information in supposed roadmaps. They just "named" the D12U something like GTX280 1.5GB.
__________________
People are more violently opposed to fur than leather; because it's easier to harass rich ladies than motorcycle gangs. |
|
|
|
|
|
#2715 |
|
Senior Member
|
Isn't there supposed to be 32 kb shared mem per block in dx11?
|
|
|
|
|
#2716 |
|
Senior Member
Join Date: Mar 2002
Location: msk.ru/spb.ru
Posts: 1,311
|
|
|
|
|
|
#2717 | |||
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Quote:
Quote:
Quote:
__________________
What the deuce!? |
|||
|
|
|
|
#2718 |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Heh, where did you see 48? Theo didn't mention it
Ah, I see what you did thar! 1024/16-16=48
__________________
What the deuce!? |
|
|
|
|
#2719 |
|
Tiled
Join Date: Oct 2003
Location: Kings Langley, UK
Posts: 2,675
|
There isn't 16KB of L1 per SM.
__________________
A major redesign of the core ALU pineapple boomerang fortress. |
|
|
|
|
#2720 |
|
Senior Member
Join Date: Mar 2002
Location: msk.ru/spb.ru
Posts: 1,311
|
|
|
|
|
|
#2721 |
|
Senior Member
|
Or is it 32KB per 16-wide SM (two in a cluster) for a grand total of 512 SPs in 16 clusters and 1024KB array?!
__________________
Apple: China -- Brutal leadership done right.
Google: United States -- Somewhat democratic. Microsoft: Russia -- Big and bloated. Linux: EU -- Diverse and broke. |
|
|
|
|
#2722 | |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
Quote:
__________________
What the deuce!? |
|
|
|
|
|
#2723 |
|
Tiled
Join Date: Oct 2003
Location: Kings Langley, UK
Posts: 2,675
|
It really has changed. I can't say (well I could) if Theo's right or not, but GF100 is not terribly GT200-like in places. All will be revealed later today anyway, not long to go now.
__________________
A major redesign of the core ALU pineapple boomerang fortress. |
|
|
|
|
#2724 |
|
Meh
Join Date: Mar 2004
Location: New York
Posts: 9,809
|
http://www.fudzilla.com/content/view/15741/1/
Yep, so Fuad says as well. JHH will give us the business during his keynote at 1pm EST. Delays aside, it's good to know we'll have something new to dissect over the next few months
__________________
What the deuce!? |
|
|
|
|
#2725 | |
|
Member
Join Date: May 2007
Location: London
Posts: 235
|
Quote:
My present next year |
|
|
|
![]() |
| Tags |
| nvidia, speculation |
| Thread Tools | |
| Display Modes | |
|
|