Welcome, Unregistered.

If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

Reply
Old 04-Jul-2010, 03:08   #901
Silent_Buddha
Regular
 
Join Date: Mar 2007
Posts: 8,961
Default

So by implication the next round of desktop ATI cards should hit in or before October. It would be a bit odd if the mobility parts launched before desktop parts.

Unless those are just rebranding, in which case I'll be very disappointed by ATI. Same goes for Nvidia if 425 isn't a Fermi derivative.

Regards,
SB
Silent_Buddha is offline   Reply With Quote
Old 04-Jul-2010, 14:20   #902
TKK
Member
 
Join Date: Jan 2010
Posts: 140
Default

Quote:
Originally Posted by Silent_Buddha View Post
Unless those are just rebranding, in which case I'll be very disappointed by ATI. Same goes for Nvidia if 425 isn't a Fermi derivative.
I have a hunch that you'll be very disappointed

And no, I'm not implying that I know anything, but there was that supposedly leaked AMD roadmap that showed a Vancouver mobile refresh that is still UVD2 (+Eyefinity), whereas all SI-based chips seem to be UVD3.

I also read a rumor that AMD is/was working on respins/new steppings of their 5xxx mobile chips to improve yields or power consumption (or both).

So my guess is that Vancouver is just a refresh of the Evergreen-based mobile lineup to reduce manufacturing cost and/or improve performance-per-watt, and for marketing reasons it will introduce Eyefinity to mobile space and use the 6xxx naming scheme, to have "new" products in time for back-to-school season.

SI-based "real" 6xxx mobile chips probably won't come before Q1/Q2 2011. That's what the leaked roadmap claimed, and mobile chips usually appear a few months after their desktop equivalents, and since SI desktop models seem to be scheduled for Q4/2010 - Q1/2011 time-frame...
TKK is offline   Reply With Quote
Old 04-Jul-2010, 15:26   #903
neliz
MSI Man
 
Join Date: Mar 2005
Location: In the know
Posts: 4,885
Send a message via ICQ to neliz Send a message via MSN to neliz
Default

TKK, you're talking a lot of sense.
__________________
I miss you CJ, 1976 - 2010
neliz is offline   Reply With Quote
Old 04-Jul-2010, 19:03   #904
GZ007
Member
 
Join Date: Jan 2010
Posts: 416
Default

A radeon 5850 for a much more realistic price would be more than any new 6xxx card .
The price/performance is same(or worse ?) than the 4k cards. The new faster cards just cost much more than the older cards. At least for people who dont care for eyefinity (over 90% consumers) and other "features". (including DX11, which seems to be the dx with the most benchmarks and techdemos )


If the 6k cards will be on the same leaky 40nm TSMC than the 5k cards, than dont expect revolution. If they increase die size or buss width the new cards will cost more. The price/performance stays probably same.

Last edited by GZ007; 04-Jul-2010 at 19:24.
GZ007 is offline   Reply With Quote
Old 04-Jul-2010, 21:32   #905
CRoland
Member
 
Join Date: Jan 2010
Posts: 114
Default

Quote:
Originally Posted by GZ007 View Post
A radeon 5850 for a much more realistic price would be more than any new 6xxx card .
The price/performance is same(or worse ?) than the 4k cards. The new faster cards just cost much more than the older cards. At least for people who dont care for eyefinity (over 90% consumers) and other "features". (including DX11, which seems to be the dx with the most benchmarks and techdemos )
There's of course power consumption and thus noise, which makes me want to switch my 4850 to 5750 or even lower when the prices come down.

Last edited by CRoland; 04-Jul-2010 at 21:33. Reason: typo
CRoland is offline   Reply With Quote
Old 04-Jul-2010, 22:07   #906
homerdog
hardly a Senior Member
 
Join Date: Jul 2008
Location: still camping with a mauler
Posts: 3,637
Default

Quote:
Originally Posted by GZ007 View Post
If the 6k cards will be on the same leaky 40nm TSMC than the 5k cards, than dont expect revolution. If they increase die size or buss width the new cards will cost more. The price/performance stays probably same.
Obviously we wouldn't expect any price/performance revolutions if they stay on the same node. They could perhaps dramatically increase tessellation performance, but this hardly matters for most games.
homerdog is offline   Reply With Quote
Old 04-Jul-2010, 22:16   #907
no-X
Senior Member
 
Join Date: May 2005
Posts: 2,038
Default

They can increase efficiency. Evergreen architecture scales quite poorly with the number of SIMDs (simply - HD5850 OCed to HD5870 level performs the same despite 160 SPs and 8 TMUs difference).

I expect higher efficiency (per-flop performance), higher tesselation+geometry performance and better texture filtering.
__________________
Sorry for my English. But I hope it's better than your Czech
no-X is offline   Reply With Quote
Old 04-Jul-2010, 23:34   #908
Psycho
Member
 
Join Date: Jun 2008
Location: Copenhagen
Posts: 554
Default

That "effiency" and tesselation performance is only really an issue on 58x0, which has sort of out-grown the design. So I don't see much reason to update the lower end (until a new node).
Psycho is offline   Reply With Quote
Old 05-Jul-2010, 07:57   #909
no-X
Senior Member
 
Join Date: May 2005
Posts: 2,038
Default

That's right for tesselation, but scaling isn't great even on HD5700 (but I must admit it's much better than on HD5800):

http://en.inpai.com.cn/doc/enshowcont.asp?id=7688

at the average:
400 -> 640 SPs (+60%) = +7,8% gaming performance
720 -> 800 SPs (+11%) = +6% gaming performance
1440 -> 1600 SPs (+11%) = +1,8% gaming performace

the test isn't optimal, HD5800 may be CPU limited in some cases, but the results are very similar even for those games, which are GPU bottlenecked...
__________________
Sorry for my English. But I hope it's better than your Czech
no-X is offline   Reply With Quote
Old 05-Jul-2010, 09:35   #910
Alexko
Senior Member
 
Join Date: Aug 2009
Posts: 2,023
Send a message via MSN to Alexko
Default

There must be disabled ROPs or something like that on the Juniper-based HD 5670.
Alexko is offline   Reply With Quote
Old 05-Jul-2010, 14:30   #911
mczak
Senior Member
 
Join Date: Oct 2002
Posts: 2,435
Default

Quote:
Originally Posted by no-X View Post
400 -> 640 SPs (+60%) = +7,8% gaming performance
720 -> 800 SPs (+11%) = +6% gaming performance
1440 -> 1600 SPs (+11%) = +1,8% gaming performace

the test isn't optimal, HD5800 may be CPU limited in some cases, but the results are very similar even for those games, which are GPU bottlenecked...
I think a 6% increase for 11% more SPs all else being the same is quite ok for a "balanced" architecture. Clearly though, the 1.8% of the HD58xx for the same 11% increase are not.
(And yes both the HD5830 and Juniper HD5670 don't scale normally but that has nothing to do with bad scaling of the architecture but everything with the strange effects of disabling half the rops with these chips.)
mczak is offline   Reply With Quote
Old 05-Jul-2010, 14:56   #912
Dave Baumann
Gamerscore Wh...
 
Join Date: Jan 2002
Posts: 12,948
Default

Given the highest resolution tested there is 1080P thats not really saying anything conclusive.
__________________
Expand. Accelerate. Dominate.
Tweet Tweet!
Dave Baumann is offline   Reply With Quote
Old 05-Jul-2010, 17:55   #913
no-X
Senior Member
 
Join Date: May 2005
Posts: 2,038
Default



These situations aren't CPU limited, evidently. HD5800 barely hits 60FPS, so higher resolution would reduce framerates to hardly playable levels.

+11% of SPs and TMUs translates to 1-3% performance advantage.

Another example - at more demanding settings:



HD5970 and 2x HD5850 - both of them have the same core and memory clocks. HD5970 has an advantage of 320 SPs and 16 TMUs (+11%). But real performance is only 1-3% better, again.
__________________
Sorry for my English. But I hope it's better than your Czech
no-X is offline   Reply With Quote
Old 05-Jul-2010, 18:03   #914
Picao84
Member
 
Join Date: Feb 2010
Posts: 293
Default

Quote:
Originally Posted by no-X View Post
That's right for tesselation, but scaling isn't great even on HD5700 (but I must admit it's much better than on HD5800):

http://en.inpai.com.cn/doc/enshowcont.asp?id=7688

at the average:
400 -> 640 SPs (+60%) = +7,8% gaming performance
720 -> 800 SPs (+11%) = +6% gaming performance
1440 -> 1600 SPs (+11%) = +1,8% gaming performace

the test isn't optimal, HD5800 may be CPU limited in some cases, but the results are very similar even for those games, which are GPU bottlenecked...
I did refer this some time ago, and I was almost cruxified at the time. I speculated that this still R600 based architecture was hiting its limits, with more and more diminishing returns for each SP.
Picao84 is offline   Reply With Quote
Old 05-Jul-2010, 18:55   #915
DavidGraham
Member
 
Join Date: Dec 2009
Posts: 582
Default

Quote:
Originally Posted by Picao84 View Post
I did refer this some time ago, and I was almost cruxified at the time. I speculated that this still R600 based architecture was hiting its limits, with more and more diminishing returns for each SP.
We could be saying the same about Nvidia SPs , just look at GTX 285 and GTX470, the latter is actually barely 20% faster , despite having 208(80%) more SPs .

Now I know there are many differences between the two , like bandwidth , Texture Mapping , clocks , but still ..
DavidGraham is offline   Reply With Quote
Old 05-Jul-2010, 18:55   #916
Dave Baumann
Gamerscore Wh...
 
Join Date: Jan 2002
Posts: 12,948
Default

Quote:
Originally Posted by no-X View Post
These situations aren't CPU limited, evidently. HD5800 barely hits 60FPS, so higher resolution would reduce framerates to hardly playable levels.
I can't see those graphs, but when talking about architecture scaling the playability is a different thing. However, the higher the performance the graphics card the more the CPU is always going to be a factor. I have the full suite of data for Cypress 18 vs 20 SIMD's because we analyzed this and there are many cases where the difference is much greater than seen here - there are also specific cases where is scales to its peak (shader and compute tests).

Quote:
HD5970 and 2x HD5850 - both of them have the same core and memory clocks. HD5970 has an advantage of 320 SPs and 16 TMUs (+11%). But real performance is only 1-3% better, again.
Thats not an apples to apples comparison. One is passing two GPU's worth of data through one PCI Express link while the other is passing two GPU's worth of data through two links.
__________________
Expand. Accelerate. Dominate.
Tweet Tweet!
Dave Baumann is offline   Reply With Quote
Old 05-Jul-2010, 20:30   #917
GZ007
Member
 
Join Date: Jan 2010
Posts: 416
Default

Quote:
Originally Posted by CRoland View Post
There's of course power consumption and thus noise, which makes me want to switch my 4850 to 5750 or even lower when the prices come down.
Yep, but strange is the 57xx cards are just 128bit , have smaller die area, less power consumption and yet they sell more or same like the old 48xx cards. (maybe just the tsmc 40nm combination with nvidias lineup )
Lets hope new cards wont cost 800 dolars for similar 4800-->5800 performance increase in the future.
GZ007 is offline   Reply With Quote
Old 05-Jul-2010, 21:01   #918
Picao84
Member
 
Join Date: Feb 2010
Posts: 293
Default

Quote:
Originally Posted by GZ007 View Post
Yep, but strange is the 57xx cards are just 128bit , have smaller die area, less power consumption and yet they sell more or same like the old 48xx cards. (maybe just the tsmc 40nm combination with nvidias lineup )
Lets hope new cards wont cost 800 dolars for similar 4800-->5800 performance increase in the future.
And you should know you are thinking wrong. Costs of production are barely relevant in a "free market". And product actual value depends on competition. ATI is just playing by "free markets" rules there. Without a capable competition from nVIDIA or other player, price is high of course.
Picao84 is offline   Reply With Quote
Old 05-Jul-2010, 21:29   #919
no-X
Senior Member
 
Join Date: May 2005
Posts: 2,038
Default

Quote:
Originally Posted by Dave Baumann View Post
I have the full suite of data for Cypress 18 vs 20 SIMD's because we analyzed this and there are many cases where the difference is much greater than seen here
I'm trying to find weak points which could be improved to extrapolate possible character of "R9xx" GPUs.

You know the weak points and you know which of them will be solved by R9xx... That makes any discussion quite difficult

Quote:
Originally Posted by Dave Baumann View Post
there are also specific cases where is scales to its peak (shader and compute tests).
Yes. SPs can be fully utilized, but only by syntetic tests, not in games. This implies there's likely something unoptimal in GPU's front end. Maybe LDS conflicts, maybe somethink else...
__________________
Sorry for my English. But I hope it's better than your Czech
no-X is offline   Reply With Quote
Old 05-Jul-2010, 22:45   #920
AlexV
Heteroscedasticitate
 
Join Date: Mar 2005
Posts: 2,354
Default

Quote:
Originally Posted by no-X View Post
Yes. SPs can be fully utilized, but only by syntetic tests, not in games. This implies there's likely something unoptimal in GPU's front end. Maybe LDS conflicts, maybe somethink else...
This is a rather significant stretch. Merely looking at some traces from modern and semi-modern games outlines the fact that A LOT of stuff goes on to render a single frame, and there are a number of potential sticking points that extra ALUs in particular or hardware in general can't help with. There are quite a few things that can be done wrong and are done wrong. Examples: oh wow, i'm going wild with state switching (this will hurt), my vertices are described by an 49 bit(hypothetical) structure(great, i'm messing up the vertex cache). I've also decided that i need [maxvertexcount(lol)] for my geometry shader (this will hurt too). To top it off, I use a lot of clip/discard, branching like I was on a CPU, and draw the skybox first. And those are only a few examples. There are more nefarious things like stalling due to improper use of Map/Unmap etc., etc.

More ALUs allow you to do more math. Great, but that doesn't help with either of the above examples. There are other architectural traits that aren't neccessarily upscaled between the SKUs you're comparing (for example, equal export rate from differing stages of the pipe, equal setup rate, equal Hier-Z rate etc.). What I'm getting at is that ultimately, these are reasonably complex machines running reasonably complex code, and a complex system should not be approached deterministically.
__________________
Donald Knuth: Science is what we understand well enough to explain to a computer. Art is everything else we do.
AlexV is offline   Reply With Quote
Old 06-Jul-2010, 04:39   #921
Dave Baumann
Gamerscore Wh...
 
Join Date: Jan 2002
Posts: 12,948
Default

Quote:
Originally Posted by no-X View Post
Yes. SPs can be fully utilized, but only by syntetic tests, not in games.
Well, they are being used in games, as thats where the increase is coming from.

And just looking at the graphs you linked, now that I can see them, we have a 52% increase in performance from (full) Cypress to Juniper on Crysis and a 77% increase on Just Cause; to me that indicates that Cyrsis is fairly CPU bound. Taking a closer look at the Just Cause numbers shows that Juniper SIMD scaling is at <2% but Cypress is actually at 3% - given the only element that doesn't scale by 2x between Cypress and Juniper is the front end geometry, these results would indicate that Just Cause is not being particularly bound by that.
__________________
Expand. Accelerate. Dominate.
Tweet Tweet!
Dave Baumann is offline   Reply With Quote
Old 06-Jul-2010, 08:12   #922
no-X
Senior Member
 
Join Date: May 2005
Posts: 2,038
Default

So the question is simple. Why gaming performance of Cypress doesn't scale well with the number of SIMDs, while it scales well with frequency?

HD5850 (1440SPs/72TMUs) clocked 3% higher than HD5870 (1600SPs/80TMUs - typo, thx to neliz) would outperform it. That's quite strange, isn't it?
__________________
Sorry for my English. But I hope it's better than your Czech

Last edited by no-X; 06-Jul-2010 at 09:15.
no-X is offline   Reply With Quote
Old 06-Jul-2010, 08:32   #923
neliz
MSI Man
 
Join Date: Mar 2005
Location: In the know
Posts: 4,885
Send a message via ICQ to neliz Send a message via MSN to neliz
Default

Quote:
Originally Posted by no-X View Post
So the question is simple. Why gaming performance of Cypress doesn't scale well with the number of SIMDs, while it scales well with frequency?

HD5850 (1440SPs/72TMUs) clocked 3% higher than HD5870 (1600SPs/72TMUs) would outperform it. That's quite strange, isn't it?
5870's don't have 72TMUs?!

On the other hand, with SIMD's disabled, your L2 cache remains intact. I'm not sure if that's a bottleneck there, but in some cases it might be better to have more cache available per SIMD? That and your Geometry/Vertex/Z assembler performance is also increased since that's also equal between 5850/5870. So, if the app is limited on anything but SIMD, you'd benefit of increased clocks.

(Dave has that data.. spill it! )
__________________
I miss you CJ, 1976 - 2010

Last edited by neliz; 06-Jul-2010 at 08:40.
neliz is offline   Reply With Quote
Old 06-Jul-2010, 09:04   #924
CarstenS
Senior Member
 
Join Date: May 2002
Location: Germany
Posts: 2,842
Send a message via ICQ to CarstenS
Default

Cache is tied to the MCs and a lesser extent ROPs - not SIMDs, isn't it?
__________________
English is not my native tongue. Before flaming please consider the possiblity that I did not mean to say what you might have read from my posts.
Work| Recreation
Warning! This posting may contain unhealthy doses of gross humor, sarcastic remarks and exaggeration!
CarstenS is offline   Reply With Quote
Old 06-Jul-2010, 09:21   #925
no-X
Senior Member
 
Join Date: May 2005
Posts: 2,038
Default

neliz: Fixed, thanks

CarstenS: Yes. I think neliz's point was that disabling SIMDs doesn't disable L2 (which isn't part of SIMDs) and because of that, HD5850 has more L2 per SIMD (ratio).
__________________
Sorry for my English. But I hope it's better than your Czech
no-X is offline   Reply With Quote

Reply

Tags
Барт, Кайман

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 18:03.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.