If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#1 | ||
|
Regular
Join Date: Jun 2005
Posts: 15,103
|
Ok... I was searching for PS3 DLNA information and came across the article. Decided to post it here after a quick read.
Here's DeanoC hard at work on his blog... Quote:
P.S. Free beer from me next time you guys (or any of the pushing-the-envelope guys) stop by Bay Area. EDIT: Holy Sh*t ! Why didn't anyone highlight this before ? It will make a huge difference. Quote:
|
||
|
|
|
|
|
#2 |
|
Member
Join Date: Apr 2006
Posts: 215
|
|
|
|
|
|
|
#3 |
|
Member
Join Date: Oct 2005
Posts: 273
|
|
|
|
|
|
|
#4 |
|
Member
Join Date: Apr 2006
Posts: 215
|
Yeah, a snooping protocol actually makes a lot of sense for the EIB (high bandwidth for the coherence traffic, broadcasting to all SPUs).
|
|
|
|
|
|
#5 | |
|
Unruly Member
Join Date: Jul 2004
Location: Bunkyo-ku
Posts: 4,694
|
Quote:
http://forum.beyond3d.com/showpost.p...89&postcount=4 |
|
|
|
|
|
|
#6 |
|
Member
Join Date: Oct 2005
Location: Cambridge, UK
Posts: 236
|
I'm not sure it'll make a huge difference... infact, I'm interested as to why you think it would! Even without keeping this data in the 4 entry cache, it's my understanding that full LS to LS DMAs stay on the EIB.. they don't go via main memory.
So bearing that in mind, I'm not sure why Deano is describing a system where data goes out from LS, to main memory, and back to LS. As that simply doesn't happen in the case of LS->LS DMA. And surely the utilisation of the SPU cache in this way pretty much requires that in order to run at full speed the other SPUs you're communicating with are not evicting cache contents by performing other DMAs? So your system needs to be pretty static in terms of DMA usage to reap the full benefit of what is described. Cheers, Dean
__________________
Opinions I share here are my own, and should not be incorrectly interpreted as the views of SCEE, SCE, or Sony Corporation. |
|
|
|
|
|
#7 | |
|
Member
Join Date: Apr 2006
Posts: 215
|
Quote:
|
|
|
|
|
|
|
#8 | |
|
Member
Join Date: Oct 2005
Location: Cambridge, UK
Posts: 236
|
Quote:
Probably wouldn't affect things too much though, to be honest.. Dean
__________________
Opinions I share here are my own, and should not be incorrectly interpreted as the views of SCEE, SCE, or Sony Corporation. |
|
|
|
|
|
|
#9 | |
|
Senior Member
Join Date: Feb 2003
Location: Cambridge, UK
Posts: 1,396
|
Quote:
The ACU cache gives you a place to leave the data effectively on the ring bus for a while without knowing any details of the destination. Its partly LRU and AFAICT doesn't get evicted via normal DMA get, tho put does. Its also a high speed ring bus op, faster than normal ring bus movement. So its should always be better or the same as normal get. Its not perfect but it does appear to be better than the alternatives 'most' of the time. Which is true of all caches really.
__________________
Spouter of random rubbish @ http://blog.deanoc.com |
|
|
|
|
|
|
#10 | |||
|
Regular
Join Date: Jun 2005
Posts: 15,103
|
Quote:
Quote:
What is the time saved between an atomic cache write/read (cache hit) versus a LS atomic store/read (cache miss) for multiple SPUs ? Quote:
EDIT: Ah ! DeanoC replied with more juicy details. |
|||
|
|
|
|
|
#11 | |
|
Naughty Boy!
Join Date: Jan 2007
Posts: 136
|
Quote:
An interesting read This atomic cache implementation has been used in parallel processors for a while but this is the first time its been used in a consumer product. |
|
|
|
|
|
|
#12 |
|
Member
Join Date: Feb 2002
Posts: 532
|
Nice find and an interesting read, thanks!
|
|
|
|
|
|
#13 |
|
ea_spouse is H4WT!
Join Date: Feb 2002
Location: 53:4F:4E:59
Posts: 1,566
|
Well not much of a "find" considering Deano links to his blog in his sig... :P
__________________
"The sooner someone gets sued by Intel for violation, the sooner the patent can be revoked from orbit for gratuitous and wanton disregard for prior art and obviousness." ~TomF |
|
|
|
|
|
#14 |
|
meandering Velosoph
Join Date: Apr 2002
Location: Vienna
Posts: 3,677
|
Since this has been posted, in the Heavenly Sword thread already and there have been some really nice responses (thanks DeanA, DeanoC, etc.), I've decided to copy the posts over here, since some might miss those.
__________________
"Anybody can be a glutton, but only a true cyclist is a bottomless pit." - Ken Kifer (R.I.P.) "I think you'll find the improved video is a part of Sony's integration of the cutting edge Placebo technology. They've integrated it into all firmwares and this fabulous system provides all sorts of minor upgrades at very little developer cost. Great stuff!" - Shifty Geezer |
|
|
|
|
|
#15 |
|
Member
Join Date: Mar 2002
Location: California
Posts: 884
|
WOW!I learned something new today
__________________
"the anonymity of the internet gives people the ability to grow e-balls..." |
|
|
|
|
|
#16 |
|
Senior Member
Join Date: Apr 2002
Location: San Francisco, CA
Posts: 1,224
|
This reminds me, any fellow game developers in the Bay Area may want to check out the SF game dev meetup. It used to be at Thirsty Bear, but now it's at the Metreon. The next meeting should be around mid-June (check out the site for details). There are lots of local developers in attendance. It's an informal get together, so please, no solicitors (i.e. people trying to sell middleware) or journalists.
|
|
|
|
|
|
#17 | |
|
Member
Join Date: Feb 2002
Posts: 532
|
Quote:
|
|
|
|
|
|
|
#18 |
|
Junior Member
Join Date: Dec 2006
Posts: 55
|
This ACU - does it get used for successive NON128-byte aligned DMAs.. or just atomic ops.
e.g. lets say you're streaming through a list of 96 byte objects*, - do the crossover cachelines get buffered instead of adding main-memory accesses for the overlap.. Up until now ive' been thinking in terms of manually buffering this sort of data with larger 128byte aligned loads (i.e. to get multiple missaligned objects in togther back to back) |
|
|
|
|
|
#19 | |
|
Senior Member
Join Date: Feb 2003
Location: Cambridge, UK
Posts: 1,396
|
Quote:
The atomic ops have to be 128 byte aligned as well, when the SPU does a <128 byte atomic it grabs the whole line and the masks out the bit you want. So I suspect its not going to be very helpful in the 96 byte case.
__________________
Spouter of random rubbish @ http://blog.deanoc.com |
|
|
|
|
![]() |
| Bookmarks |
| Thread Tools | |
| Display Modes | |
|
|