Welcome, Unregistered.

If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

Reply
Old 09-May-2011, 08:19   #101
CarstenS
Senior Member
 
Join Date: May 2002
Location: Germany
Posts: 2,964
Send a message via ICQ to CarstenS
Default

That's precisely what I meant. No DP for example, crash in INT, but at least a score in Image Processing, which (the latter) wasn't the case with HD 5870 and older drivers.

edit: To be more precise: The combination of the rather oldish and not recently updated benchmark and AMDs (and partly Nvidias also) OpenCL drivers does not give a good indication of the performance one should be expecting from Cayman.
__________________
English is not my native tongue. Before flaming please consider the possiblity that I did not mean to say what you might have read from my posts.
Work| Recreation
Warning! This posting may contain unhealthy doses of gross humor, sarcastic remarks and exaggeration!

Last edited by CarstenS; 09-May-2011 at 18:09.
CarstenS is offline   Reply With Quote
Old 16-May-2011, 16:51   #102
trinibwoy
Meh
 
Join Date: Mar 2004
Location: New York
Posts: 9,947
Default

The histogram test shows the local atomics version running much faster than the global atomics on Fermi. I thought we had concluded that local atomics were still making a round trip through L2 and therefore shouldnt be any faster?
__________________
What the deuce!?
trinibwoy is offline   Reply With Quote
Old 16-May-2011, 22:50   #103
pcchen
Moderator
 
Join Date: Feb 2002
Location: Taiwan
Posts: 2,485
Default

Quote:
Originally Posted by trinibwoy View Post
The histogram test shows the local atomics version running much faster than the global atomics on Fermi. I thought we had concluded that local atomics were still making a round trip through L2 and therefore shouldnt be any faster?
Are you sure about this? Local atomics is in shared memory, so by definition it's in L1 cache, not L2 cache. Remember that local atomics only work within a block, so it shouldn't have to touch L2 cache.
pcchen is offline   Reply With Quote
Old 17-May-2011, 14:08   #104
trinibwoy
Meh
 
Join Date: Mar 2004
Location: New York
Posts: 9,947
Default

Quote:
Originally Posted by pcchen View Post
Are you sure about this? Local atomics is in shared memory, so by definition it's in L1 cache, not L2 cache. Remember that local atomics only work within a block, so it shouldn't have to touch L2 cache.
No I'm not sure as I haven't done the test myself but B3D's analysis found local and global atomics to be the same speed (actually locals were slower). Haven't seen any other evidence to dispute this besides the GPCBenchmark test.

http://www.beyond3d.com/content/reviews/55/14
__________________
What the deuce!?
trinibwoy is offline   Reply With Quote
Old 17-May-2011, 21:08   #105
pcchen
Moderator
 
Join Date: Feb 2002
Location: Taiwan
Posts: 2,485
Default

Quote:
Originally Posted by trinibwoy View Post
No I'm not sure as I haven't done the test myself but B3D's analysis found local and global atomics to be the same speed (actually locals were slower). Haven't seen any other evidence to dispute this besides the GPCBenchmark test.

http://www.beyond3d.com/content/reviews/55/14
Well, I don't know but my previous experiments on histogram also shows local atomics is faster than global atomics (local atomics version can do histogram at around 10GB/s on my GTX 460), but it's in CUDA. Of course, this still can't be completely ruled out as I didn't do any "maximum speed" test on atomics.
pcchen is offline   Reply With Quote
Old 18-Mar-2012, 19:47   #106
Man from Atlantis
Member
 
Join Date: Jul 2010
Location: Istanbul
Posts: 728
Default

i wonder GCN numbers
__________________
SiS 6326 > Ti 4200 > 9800XT > 9800GT > GTX 460
Celeron 366 > Celeron 1700 > Athlon XP 2500+ > E6300 > Q9650
Man from Atlantis is offline   Reply With Quote
Old 18-Mar-2012, 22:38   #107
CarstenS
Senior Member
 
Join Date: May 2002
Location: Germany
Posts: 2,964
Send a message via ICQ to CarstenS
Default

Quote:
Originally Posted by Man from Atlantis View Post
i wonder GCN numbers
Lemme just quick-quote myself:
Quote:
Originally Posted by CarstenS View Post
edit: To be more precise: The combination of the rather oldish and not recently updated benchmark and AMDs (and partly Nvidias also) OpenCL drivers does not give a good indication of the performance one should be expecting from Cayman.
This is also true for Tahiti. Some of the performance numbers in OpenCL Bench make sense, others just don't. Plus, the results vary wildly in some subtests from run to run.
__________________
English is not my native tongue. Before flaming please consider the possiblity that I did not mean to say what you might have read from my posts.
Work| Recreation
Warning! This posting may contain unhealthy doses of gross humor, sarcastic remarks and exaggeration!
CarstenS is offline   Reply With Quote
Old 23-Mar-2012, 17:33   #108
Man from Atlantis
Member
 
Join Date: Jul 2010
Location: Istanbul
Posts: 728
Default

Quote:
Originally Posted by CarstenS View Post
Lemme just quick-quote myself:

This is also true for Tahiti. Some of the performance numbers in OpenCL Bench make sense, others just don't. Plus, the results vary wildly in some subtests from run to run.
Thanks i wont ask for Kepler then
__________________
SiS 6326 > Ti 4200 > 9800XT > 9800GT > GTX 460
Celeron 366 > Celeron 1700 > Athlon XP 2500+ > E6300 > Q9650
Man from Atlantis is offline   Reply With Quote

Reply

Tags
opencl, stream sdk 2.01

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 06:46.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.