rendezvous said:I knew i shouldn't trust MPR.
What's MPR? ...microsoft PR? Actually I may have done MS a disservice not the other way around.
rendezvous said:I knew i shouldn't trust MPR.
BlueTsunami said:It sucks. I know. I also hate that I can read through your post...some or alot may not be right (as DeanoC stated, alot) and not be corrected.
rendezvous said:I knew i shouldn't trust MPR.
DeanoC said:No to be fair, the bits from MPR are largely right...
But really you don't know what version of Cell is being used, what RSX is and how Cell <-> RSX works. How likely are you to be right?
Trying to estimate things like bandwidth, with the information you have currently is futile.
scificube said:Cell must have it’s own memory controller to access the XDR ram it uses for main ram. RSX must have it’s own memory controller to access it’s pool of GDDR3. What is interesting is that RSX also has access to the XDR ram in the system. This allows RSX to access up 512 MB of Ram minus whatever Cell consumes which makes perfect sense. (no different than what Xenon consumes of the GDDR3 in X360) RSX has 22.4GB/s bandwidth to its pool of GDDR3. RSX can write 20GB/s to Cell and read 15GB/s from Cell. Cell does not have access to the GDDR3 in the system. What is most significant is that via Cell’s memory controller RSX had an additional 25.6GB/s read/write bandwidth with the XDR ram in the system. This provides RSX with 48GB/s to read/write from memory in the system on top of the read/write bandwidth between it and Cell.
Let's hope Kutaragi is rightTitanio said:edit - Here's a Kutaragi quote re. Cell accessing GDDR3:
"CELL and RSX have close relationship and both can access the main memory and the VRAM transparently. CELL can access the VRAM just like the main memory, and RSX can use the main memory as a frame buffer. They are just separated for the main usage, and do not really have distinction."
overclocked said:I dont understand how you can calculate the bandwidth for RSX to 48GB by adding the VRAM+XDRAM togheter.
As i se it as the bandwidth of VRAM is underwhelming you could "lock" a certain portion of the XDR memory and read from there as your framebuffer but as i understand you cant get more then the write(15GB) that the flexIO supports and you would still need to let the Cpu have this BW shared also(if my understanding of the bus i right of course) .
So in a game where the dev would need around 30GB of bandwidth you still have around 5-7 GB for Cell "info" left to be sent to the RSX. Thats how i have understand the hardware atleast, if some knows more well then im wrong.
scificube said:I seriously want to scrap this thread and never show my face again. I'm disgusted with myself.
nAo said:Let's hope Kutaragi is right
DeanoC said:No to be fair, the bits from MPR are largely right...
But really you don't know what version of Cell is being used, what RSX is and how Cell <-> RSX works. How likely are you to be right?
Trying to estimate things like bandwidth, with the information you have currently is futile.
scificube said:I seriously want to scrap this thread and never show my face again. I'm disgusted with myself.
scificube said:Just checked Ign (again) and they have it 20GB/s read from Cell and 15GB/s write to Cell for RSX.
To be clear on what this really means though, TTBOMK XeCPU can have 3 threads actually executing at any one time and Cell can have 8. The hardware threads on XeCPU and PPE are an optimization for context switches on stalls or task-switching.one said:Cell can handle 9 HW threads and Xenon can handle 6 HW threads
Thanks, I posted without reading most of scificube wrote so it seems my reckless post was out of context. Anyway superscalar PPE has some hardware resources to support 2-way fine-grained MT if not full execution resource for true SMT, therefore they can't be "logical" or software threads, as described in MPR article rendezvous probably indicated (sorry if it's a different article)Shifty Geezer said:In terms of program threads in execution at a given clock cycle then, it's 3 for XeCPU, 8 for Cell.
But I'm still wondering about what Crytek guy said about the difference between PPE threading and Xbox 360 CPU threading.The Cell Power core has hardware fine-grain multithreading. The multithreading design supports fine-grained multithreading with round-robin thread scheduling. If both threads are active, the processor will fetch an instruction from each thread in turn. When one thread cannot issue a new instruction or is not active, the other active thread will be allowed to issue an instruction every cycle. Threading does add some burden to the die size (around 7% in this case), as there must be duplicated register files, program counters, and parallel instruction buffers (before the decode stage).
Titanio said:Geez scifi, relax, it's been a good thread. I've liked it anyway
edit - about cell<->rsx, doesn't rsx write to cell at 15GB/s and read from it at 20GB/s? That's enough to allow it to consume XDR's bandwidth entirely if it wanted (XDR is 12.8GB/s read, 12.8GB/s write), and still leave some cell<->rsx bandwidth for cell to read and write directly from chip to chip (7.2GB/s to RSX and 2.2GB/s to read from it) , but of course, you'd have no xdr bandwidth left over for cell So assuming no Cell bandwidth usage, you could say RSX had 48GB/s to use. In reality it will be "up to" that figure depending on CPU usage and how much cpu<->gpu bandwidth you require for things other than direct memory transactions....?
Gholbine said:When you guys say that current PC GPUs have ~40GB/s of bandwidth, what does that mean? Bandwidth to what? I was under the impression that the CPU<=>GPU bandwidth in the next-generation consoles was streets ahead of current PCs.
Wow, I ought to read that article. This explains Deano's comments on PPE being used as a dual 1.6 GHz core, as if you have both threads active then they're interleaved. Have I missed the same sort of details on XeCPU's cores as well? I checked the Ars explanation that only seemed to confirm the second thread remains inactive unless the first becomes inactive, but I'm not absolutely sure that's the case.one said:...therefore they can't be "logical" or software threads, as described in MPR article rendezvous probably indicated...
If both threads are active, the processor will fetch an instruction from each thread in turn.
overclocked said:Are you sure about the 12,8GB read/write for XDR cause on the diagram you showed the bandwidth to/from RSX is on two arrows while the XDR inteface shows 25,6GB both ways..