Overclocking increases performance without the need for increasing the cache size, end of discussion.
Panajev2001a said:You hit an obvious wall there as soon enough the disparity between processor's clock-rate and memory's clock rate will be so high that each cache miss will be a too strong hit on your performance to make overclocking not worth your time and the heat generated (if the chip manages to clock that high that is ).
Sometimes's linear increases in clock-rate do produce a less than linear increment in performance that shrinks as the clock-rate grows.
Ah, well, I sort of was talking about the bigass FPGA arrays that are capable of simulating large processors with tens of millions of transistors (and these do run very slowly). I should have realized there were other types of FPGAs too, heh.nondescript said:No, not true...I've personally made FPGA-based logic clocked at over 100MHz, and the new generations can go much higher.
PC-Engine said:I totally agree with you. Yes eventually you'll hit a wall. :smile:
You have no ability to read whatsoever, do you? I said pretty clearly that the chances of a drop are pretty high. Oh, I'm sorry... that was in the middle of a paragraph, so it must have completely lost you, what with having to read sentences one after another. Don't ever confuse the theoretical gain with real-world gain. Expecting more is the most effective way to be disappointed.Will aggregate/total/overall performance increase??? YES or NO??
.
.
Sure it could just like I could win the lottery. Aggregate/total/overall performance is more likely to INCREASE by adding a 4th PPE
You have no clue what you're talking about. Have you increased the number of resources vying for cache? You've sped up the core logic and sped up the cache, and depending on how you're approaching it, you've probably also sped up memory, and every other bus in your machine... wow and that's faster... I would never have dreamt it. You are aware, of course, that PC processors are conservatively clocked as they are and that a family of PC CPUs is more than likely intended to scale much farther than the speed you've bought it at... After all, if someone manufactured cache amounts specific to each and every speed grade, that costs too much to produce.Overclocking increases performance without the need for increasing the cache size, end of discussion.
PC-Engine said:The obvious disadvantage is the fact you're using up precious die area to add redundancy instead of more processing power.
scificube said:pc-Engine...that A64 post was in response to me...and how did I appear to be hung up on large cache sizes being a requirement for good performance...I mean that is exactly the opposite of the intent of my post.
I recognized larger caches do help but when it comes to real performance gains it's about more execution elements etc in a processor.
Just some other comments...
The redundancy in the Cell is not a waste. It is a saving for the PS3. Now chips with defects that take out only one SPE can still be used. Those Cells with no defects can be used for other purposes or placed into the PS3. If die size is a set size (by fab restrictions or just as a logical argument) and you can either increase or decrease the number of cores on that die...the more cores you use combined with how lax your requirements are for a useful chip will net you more usable chips than using larger cores. If you increase die size and keep the number of cores set per die well...the larger your die the more likely defects are to occur meaning as die size goes up the number of usable chips goes down.
This only makes logical sense.
This is why going to a smaller fab process saves one money no?
I don't think many engineers would agree that adding more larger cores to a chip is a way to increase the number of useful chips one can obtain per wafer. You are reducing the number of die per wafer and at the same time increasing the probability of defects per core. Sounds bad to me.
PC-Engine said:First of all I wasn't responding to you. Second I was agreeing with A64.
Adding a 4th core will decrease the yields relative to a 3 core yes. The question is how much does it decrease the yields and is it an acceptable tradeoff in terms of gaining more performance from the result of having 4 cores instead of 3. Obviously MS went with the 3 core because it's smaller and cheaper than a 4 core and is enough for their needs. Finally redundancy CAN be a waste. It just depends on how much die area the redundant blocks take up. For example does it make sense to have 50% of your die area being redundant? 40%? 30%? 20%? Where do you draw the line? Higher redundancy can make your chip bigger than it needs to be. In fact the bigger the chip the more defects you have so you even need more redundancy. It becomes a domino effect.
Given what has been seen thus far, I'm inclined to believe MS when they say that 3 cores is a "sweet spot," although not for the same reasons they speak of.
You have no clue what you're talking about. Have you increased the number of resources vying for cache? You've sped up the core logic and sped up the cache, and depending on how you're approaching it, you've probably also sped up memory, and every other bus in your machine... wow and that's faster... I would never have dreamt it. You are aware, of course, that PC processors are conservatively clocked as they are and that a family of PC CPUs is more than likely intended to scale much farther than the speed you've bought it at... After all, if someone manufactured cache amounts specific to each and every speed grade, that costs too much to produce.
If anything, it seems that as the # of cores goes up, so does yield (in terms of the PS3 since they are specced for 7).
PC-Engine said:That's because they downgraded it to 7. Originally they wanted 8.
PC-Engine said:MS can down grade their CPU to 2 cores and claim better yields too, but what's the point?
PC-Engine said:What I see is you assuming the redundant SPE automatically solving the yield issue making it equal to XCPU yields or better.
Ty said:If anything, it seems that as the # of cores goes up, so does yield (in terms of the PS3 since they are specced for 7).
PC-Engine said:What if the actual yields even with the redundant SPE is worse than XCPU even with 4 cores? Heck we don't even know the die size of a hypothetical 4 core XCPU.
The more cores on a die, the higher the chances of a core being bad. 8 is two times 4. If you tak out the SPEs and replaced them with 3 PPEs you would get better yields and would not need a 5th PPE for redundancy.
I've never heard that it has such an architectural feature built in. It has a tight crossbar instead of the ring bus in Cell so I assume they couldn't.PC-Engine said:MS can down grade their CPU to 2 cores and claim better yields too, but what's the point?
Nope, not at all. In fact I took pains to clearly point this out. I'm not claiming yields will be better for CELL or worse. I'm just saying that having more cores (with some acting as redundant) improves yields not hurts it.
It's not the # of cores that impact the defect rate. So having a redundant core for CELL is good for their yields. It doesn't imply that CELL will be better or worse than the XCPU in terms of yields.
one said:I've never heard that it has such an architectural feature built in. It has a tight crossbar instead of the ring bus in Cell so I assume they couldn't.
AFAIK it's not a true ring in cell, rather a bidirectional set of two buses.one said:I've never heard that it has such an architectural feature built in. It has a tight crossbar instead of the ring bus in Cell so I assume they couldn't.
I didn't say it equals missed cache cycles... I said that each cache miss costs more CPU cycles when clock speed has changed (assuming memory latency is still the same in absolute time) -- the cache miss rate doesn't really change when you overclock. 100 ns of latency at 2 GHz is 200 cycles, but at 3.2 GHz, it's 320 cycles. That's 120 more cycles that could have been time that you're working. If you overclock a CPU using only the multiplier, you'll see that as opposed to changing memory bus clock in addition.My statement was replying to your higher clock equal missed cache cycles statement.
PC-Engine said:It doesn't exactly improve yields in an absolute sense. Sure adding more redundancy will keep your chip "alive", but if you need a bigger die to do it then it becomes hipocritical because a bigger die has a higher chance of defects not to mention higher cost. It might just be cheaper to have a slighly smaller die with no redundancy.
PC-Engine said:Of course it's related to the number of cores since more cores take up more space and more space equal bigger die.
Nope. Which is why I don't argue with engineers. Wait, didn't you tell me you were an engineer?PC-Engine said:It seems elementary engineering is not your forte.