"MathBox", also called "extended math unit" is a unit inside the Gen4 intel igps (i965, g35, g45, Ironlake) performing all those complicated arithmetic instructions - rcp, rsq, sin, cos, pow, log,... It'll perform pretty much the same things as nvidias SFU (though this one will also do mul and attribute interpolation) or AMD's T unit (though this one also can do almost the same things as their other 4 units). However, the way this is achieved is quite different - in contrast to amds or nvidias solution the mathbox is a completely separate (shared by several shader units) unit which can't access the register file of the shader units, instead it's invoked by passing messages to it (similar to how the sampler is invoked actually, though sampler is shared by all units not just by some).Pardon my ignorance: what is a MathBox?
I must have missed that... linky?mczak, on that diagram there are total of 12 EUs which are split into 3 rows. Each rows say "3X Mathbox".
Yes talking about Ironlake. There's some pieces in the opensource linux driver, in particular there's a BRW_CLIPMODE_KERNEL_CLIP bit which older chips didn't have, and I assume this does the clipping in fixed function hardware.Now where have you got that info about the hw clip unit from? You are talking about Ironlake yes? Does that mean that the G45 doesn't have hw clip?
No, why would they? That's after clipping anyway.Even back in G965 they talked about Early Z. Were they using all software for that then?
Hmm, I guess this means the original also had the GPU overclocked? It looks like it increased the reference clock (from 133 to 200Mhz) maybe the multiplier for GPU stays the same which would mean it's running at 800Mhz instead of 533Mhz (that is, ~10% above the levels of a core i3).No, its cause you OVERCLOCKED the damn thing!
http://translate.googleusercontent....gle.ca&usg=ALkJrhi8p3118OsDcPcv_f6EqBiHoeisQQ
It is actually 1/2 the performance almost spot on.1/3 fast as the original!
Where did you get that 466Mhz? If it's running at that it would explain almost all the difference, if the original indeed ran at 800Mhz.Although there's a possibility of running only at 466MHz and the driver is slowing down a bit(around 20%), that won't make up for 3x.
Well the review using the core i5 661 is using ddr3-1333 ram instead of ddr3-1600. The bandwidth saving features of G45 are rather weak in comparison to other chips (read: except early-z nonexistent), so that could possibly explain some of the difference, as I'd expect it to be somewhat bandwidth limited (despite it has a huge bandwidth advantage to i785/g210/4350 when using dual channel ram). Also the 4350 they tested seems to be a slow one, in other reviews it got about 10% more 3dmark06 score so I don't know what's up with that. Add it up and not much is missing.Still can't see what's missing that allows it to outperform the 210 and 4350 even with overclock. The driver that allows CPU assist should be responsible for ~20% from the same site. The i5 661 should then increase its lead to 40-50% over 785G, which is not enough to overtake the discrete parts. Now where's the rest 30-40% then?
It just doesn't make sense. Display outputs too slow or what .Maybe the conclusion by Zhao isn't flawed and there is something that makes H57 faster than H55 for IGPs.
I don't know if the driver still has the same limits. And it doesn't really matter, the point was that you can dismiss the result at that resolution as bogus, even if it's for another reason.You should know what the G45/Ironlake is able to allocate for DVMT. It's 1795MB with 4GB memory and 780MB with 2GB. I don't think it should run out even with 780MB.
Is the IGP not directly connected to the IMC? Which means 25.6GB/s with DDR3-1600, even shared with CPU a lot more than the 6.4GB/s on G210/HD4350 with DDR2-800@64-Bit.
Well the review using the core i5 661 is using ddr3-1333 ram instead of ddr3-1600. The bandwidth saving features of G45 are rather weak in comparison to other chips (read: except early-z nonexistent),
*snip*
I think there were limitations on the memory clock depending on the FSB, so that could be true.Now on the Celeron, it was like capped or something. Dual channel didn't bring anything more than 10%. I've read somewhere putting a Celeron in slows memory down to 533MHz. Whatever.
Indeed. Dunno why it wouldn't do anything.DC should have done something though, like on the Core 2, which did WAY more than 10%.
I can't follow you there that doesn't really make a lot of sense. The IGP living in the chipset always has a direct connection to memory.Is there a possibility that the IGP is artificially handicapped on the lower value chips? That definitely seems true. What if most of the memory access is done through the FSB rather than a direct connection? It would guarantee Celerons to be slower. I'm guessing with claims of being "DMA" maybe few operations can do direct access, but not most of them.
Oh Hierarchical Z that's nice (in fact early z doesn't save any bandwidth, just shader cycles). Compared to the competition still not a lot (still missing z/color buffer compression) but it's a start.Ironlake has Hierarchial Z, which I assume is an advancement from Early Z, and there's a possible fixed function clip unit(even the presentations say much faster clip).
It wasn't really 1/5 all the time. Don't forget the G45 could already achieve a score of over 1000 in 3dmark06 in ideal conditions - that's only roughly a factor 2 increase there for Ironlake.What's the real cause that went from sometimes being 1/5 of 785G with G45 to being lot faster?
Saving shader cycles can save bandwidthOh Hierarchical Z that's nice (in fact early z doesn't save any bandwidth, just shader cycles). Compared to the competition still not a lot (still missing z/color buffer compression) but it's a start.
Oh yes right for the texture lookups. Still, I don't really consider that a bandwidth saving feature, since you should save more in execution resources than bandwidth.Saving shader cycles can save bandwidth
http://en.ocworkbench.com/tech/h55-with-pentium-g6950-performance-detailed-against-g45-g41/From the chart, we can see that the H55 definitely runs better than the predecessors. When compared to GF9300, it is still slightly behind.