If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
![]() |
|
|
#1 | ||
|
Member
Join Date: May 2003
Posts: 295
|
This new processor by Clearspeed seems to show the viability of a CELL-type chip.
http://www.newscientist.com/news/news.jsp?id=ns99994274 Quote:
I just discovered these articles, so I haven't done much background reading yet. But for me, this erases any doubt that 1-Tflop CELL is indeed possible. (EDIT: This means I think 1-Tflop cell is possible.) As for the price, keep in mind that this is a low-volume specialized chip made by a small company - most of the price is R&D, not production. For CELL, that will obviously not be the case. As I said in this post I believe that a architecture that recognizes and exploits the massive data parallelism in computer graphics will exhibit much better cost-effective performance. You can see Clearspeed's emphasis on parallel-data processing in their previous products. http://www.eetimes.com/semi/news/OEG20010611S0119 Quote:
__________________
With the return of school, I have regained the need to procrastinate, and hence, the need to come to Beyond3d... It's too bad all the posts I want to reply to are locked. |
||
|
|
|
|
|
#2 |
|
Senior Member
|
Interesting, the 25 GFLOPS must include DP FP as that would make it more impressive: the EE+GS chip consumes 8 Watts, needs only 86 mm^2 of area and the CPU delivers 6.2 GFLOPS.
Not bad considering the surface area for the chip. The EE portion uses about half the area so if we accepted a chip around 215 mm^2 we could fit 4 EE's and 1 GS using 90 nm technology: this would mean around 24.8 GFLOPS of performance. Of course these guys had to start from a different position and are probably using some different trick. With the limited production they are having $16k for a single chip is not unreasonable: the mighty Intel is selling Pentium 4 EE chips at $800 ( this is not entirely fair as Intel is making mad profits on this probably ) and those are in quite higher volume than this chips which is being sold in limited quantities to Research labs and Universities. These guys are also trying to get NRE costs back as well while selling these chips. PlayStation 3 will have CELL chips produced in high volumes and they will be pushing the target 65 nm manufacturing process ( relatively large die like the original GS which was 279 mm^2 of surface area ) quite a bit. The enphasys on highly parallel processing is a good assumption for multi-media ( 3D graphics included ) and scientific processing which is one of the goals of CELL: these are the biggest areas in which such a massive performance is needed. If a CELL CPU averages 10-30 GOPS and 10-30 GFLOPS running a port of Microsoft Word or the Mozilla web browser I would not necessarily be very saddened as that is more than what I need to run those applications and in multi-media applications we would see CELL shine ( maturity of compilers permitting ). |
|
|
|
|
|
#3 | |
|
Senior Member
|
Quote:
|
|
|
|
|
|
|
#4 | |
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
Just a word of advice. I first heard of Pixelfusion back in 1999 I think when they were trying to make a 250nm IC for graphics based on the same concept. Their design failed hard (ergo ClearSpeed) and was basically destroyed by the likes of Sony and nVidia with their respective ICs. If their architecture is anything like before in their use of functional constructs, then I applaud you for actually staking a position on this. It's like looking at the P10 and then blatently stating that the preformance levels seen in the NV3x and R3x00 are impossible. But, hey, to each his own. |
|
|
|
|
|
|
#5 |
|
Senior Member
|
To be honest, 250 nm was not the best technology to introduce that kind of vision to the market.
|
|
|
|
|
|
#6 | ||
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
Also, for my buddy Megadrive who likes the big numbers - this is the reason I told you transistor count is irelevent: Quote:
|
||
|
|
|
|
|
#7 | ||
|
Member
Join Date: May 2003
Posts: 295
|
Quote:
Are we on the same page here? Spelling it out clearly: I claim that the incredible power consumption/performance of THIS chip, which takes the same approach to computing as CELL, is a good indicator that CELL will have the same kind of power/performance. I'm not saying that this chip is a competitor to CELL, I'm saying that this chip shows that the CELL approach is a viable one. (Oh, and what do you mean by functional constructs? The processing units?)
__________________
With the return of school, I have regained the need to procrastinate, and hence, the need to come to Beyond3d... It's too bad all the posts I want to reply to are locked. |
||
|
|
|
|
|
#8 | |
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
I thought you ment, like, "erases any doubt you (as a skeptic) held that this could, somehow, be possible - so it's thus impossible" Argh, forget this.. I'm confusing myself again. |
|
|
|
|
|
|
#9 | ||
|
Member
Join Date: May 2003
Posts: 295
|
Quote:
__________________
With the return of school, I have regained the need to procrastinate, and hence, the need to come to Beyond3d... It's too bad all the posts I want to reply to are locked. |
||
|
|
|
|
|
#10 |
|
Senior Member
Join Date: Feb 2003
Location: United States
Posts: 1,974
|
I thought you meant the same "Seeing this I now know Cell will not hit 1tflops" when I saw your post earlier. I didn't comment, the sentence was confusing as hell.
__________________
WHEREAS, SCE and Toshiba have entered into a joint development agreement (the "[*] Agreement") with[*] to develop a broadband microprocessor (designated as the "Broadband Engine") for a[*] product; |
|
|
|
|
|
#11 | ||
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
Quote:
http://www.clearspeed.com/downloads/overview_cs301.pdf |
||
|
|
|
|
|
#12 | ||
|
Senior Member
|
Quote:
|
||
|
|
|
|
|
#13 | |
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
|
|
|
|
|
|
|
#14 | |
|
Senior Member
|
Quote:
|
|
|
|
|
|
|
#15 | |
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
Cell (eg. Broadband Engine) is truely this things bigger, steroid using, brother. Twice as many FPU/FXU's arranged in better constucts and hierarchies with more resources (be it Register, SRAM, eDRAM) avalable to it. And this isn't even looking at the differences in logic design, which STI ranks among the best. Much better design than this - although, in all fairness, this isn't targetted at high-end 3D anymore. |
|
|
|
|
|
|
#16 |
|
Senior Member
|
Vince, that processor is only running at 200 MHz and is realized in 130 nm technology.
At 65 nm you could fit twice the logic in the same die area or more and if you think about a 1-2 GHz closk-speed... 2x * 5 or 10 = ( rough estimate ) 10-20x the performance. Thinking about increasing the chip's surface area ( adding more logic ) and bringing the clock up further, we can end with a 40x increase of performance ( still following the same design ). 40 * 25.6 = 1 TFLOPS 25.6 GFLOPS at 200 MHz means 128 FP ops per cycle. Each PE has a dual issue FPU ( peak of 2 FP ops/cycle: 1 LOAD and 1 STORE: it does not have a bus supporting true MADD instructions for the FPU as we only have 2x32 bits busses going from the Register file to the Execution units, unless R1 = R2 * R3 + <constant> is good enough for you or maybe the diagram is slightly inaccurate ) and we have 64 PEs in this chip. 64 PEs * 1 FPU/PE * 0.2 GHz * 2 FP op/cycle = 25.6 GFLOPS/s An APU rated at 32 GFLOPS at 4 GHz does 8 operations per clock ( havin four mixed FP/FX Units it is understandable as a peak value for FP and Integer Vector MADD instructions ). 8 APUs do 64 ops/cycle and 4 PEs ( with 8 APUs each ) would do 256 FP ops/cycle. The CELL chip would basically only need twice the FP/FX Units, more e-DRAM and SRAM ( accepting also a bigger surface area for the chip ) and a frequency boost to achieve 1 TFLOPS: looking at the companies involved in CELL, looking at the technologies ( manufacturing processes, etc... ) they are working on and the R&D budget they have, etc... I think they can pull 1 TFLOPS of peak performance for PlayStation 3 ( might be most obtained from the CELL based CPU and part from the GPU or it might be all from the CPU, we will see ). I do not like the fact that the control portion of the chip has to manage so many independent sub-processors: I would prefer to have each PE with its own PU to manage the PE's workload. |
|
|
|
|
|
#17 | ||
|
Senior Member
|
Quote:
I like learning and discussing about technology, but not being used... Unless that meant "used as a play-tester of games for PSP and PlayStation 3 in SCE's labs" |
||
|
|
|
|
|
#18 | |
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
Hey, have you ever seen a micrograph of the the original EmotionEngine? |
|
|
|
|
|
|
#19 | |
|
Senior Member
Join Date: Feb 2003
Location: United States
Posts: 1,974
|
Quote:
Atleast on a purely marketing point of view. I can imagine people like Chap running around come a year going, "only 1ghz clock? xbox2 is 3ghz... Although I am truely doubting a 1Ghz clock..
__________________
WHEREAS, SCE and Toshiba have entered into a joint development agreement (the "[*] Agreement") with[*] to develop a broadband microprocessor (designated as the "Broadband Engine") for a[*] product; |
|
|
|
|
|
|
#20 | ||
|
Senior Member
|
Quote:
|
||
|
|
|
|
|
#21 | |||||
|
Naughty Boy!
Join Date: Jan 2003
Posts: 391
|
Quote:
Quote:
Quote:
Quote:
Quote:
|
|||||
|
|
|
|
|
#22 |
|
Senior Member
|
I think that for the APU's execution units and Registers 4 GHz is possible: not all the sections of the chip need to run at that speed though
|
|
|
|
|
|
#23 | |
|
Senior Member
|
Quote:
|
|
|
|
|
|
|
#24 | |||
|
Join Date: Apr 2002
Posts: 2,158
|
Quote:
Quote:
Which are the reason's I stated earlier that it's different than Cell. Quote:
|
|||
|
|
|
|
|
#25 | |
|
Naughty Boy!
Join Date: Jan 2003
Posts: 391
|
You left out this block of article.
Quote:
|
|
|
|
|
![]() |
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Ken Kutaragi Interview | Mikage | Console Technology | 63 | 27-May-2005 20:24 |
| Pentium 4 based Cell processor | version | Console Technology | 10 | 27-Feb-2005 03:32 |
| Kutaragi Ken: "WAN Cell Computing" | ChryZ | Console Technology | 75 | 14-Sep-2003 23:26 |
| Kutaragi Ken: "Relation Between Cell & PS3" | ChryZ | Console Technology | 11 | 11-Sep-2003 08:41 |
| Microsoft to own every GPU? | Cyborg | 3D Architectures & Chips | 26 | 14-Jul-2002 11:15 |