What are the differences between the single core Mali-400 MP and a Mali 200?
Is Mali400 a single or a dual core config? I had the impression its the latter.
What are the differences between the single core Mali-400 MP and a Mali 200?
If we are to believe these documentsIs Mali400 a single or a dual core config? I had the impression its the latter.
Cheers.Here's a presentation that ARM did on Mali at the ARM DEVCON 2008
Surely they meant 555?Hmmm...comparing Mali 400MP to 520/530/535 ?? whats that about
550 never existed ?
Tsk tsk, marketing ftw? PowerVR's numbers include a 2.5x multiplier because of the claimed higher efficiency of TBDRs when there is overdraw. That number is a tad excessive, but it is fair to say that if you designed a game for both IMRs and TBDRs then you could save on a Z-Pass which does increase your effective bandwidth.Mali 400MP dual core fill rate 550Mpix 30M tri, SGX545 does 1000Bpix and 40M tri ?
Yes, it's interesting that ST-Micro seems to be their only Mali-400MP licensee at this point though! At least Mali 200 has a lot more momentum than I thought...Launch dates for various Mali-ed products are on slide 13.
Slide 7 there doesn't match http://www.arm.com/miscPDFs/21863.pdf on performance & area though (at least for M200).Here's a presentation that ARM did on Mali at the ARM DEVCON 2008
Lead licensee in ARM lingo usually means flagship or "driving customer" rather than "the only one", although I haven't seen any other announcements for M400 so far. Could it have anything to do with the "high-end 3-D graphics accelerator" mentioned here?Yes, it's interesting that ST-Micro seems to be their only Mali-400MP licensee at this point though!
Slide 7 there doesn't match http://www.arm.com/miscPDFs/21863.pdf on performance & area though (at least for M200).
If you want to compare MP vs. MP (with 2 cores at a time) and always on what each of the two has officially announced:Mali 400MP "beats 6 SGX cores 520/530/535/540/545/550"
Hmmm...comparing Mali 400MP to 520/530/535 ?? whats that about
550 never existed ?
Looking at that presentation, I can't help but notice the 300MP/s @ 200MHz number. Err, what? 1.5 TMUs or ROPs, really? Gives me a hunch it might be inflated in the same way TBDR numbers are (although in this case I can't figure out what the reasoning might be!) and so the 2-3mm2 was really for a half-pixel TMU configuration, while the only Mali200 config that still exists today is a full-pixel TMU. I could be horribly wrong, of course. Of course, in this context it is also interesting to look at ARM's claimed scaling numbers from 90GP to 65LP in the presentation tangey posted - they are pretty awful (1.5->1mm² for Mali55!) and can also help explain the numbers a bit.5mm2 vs. 4.1mm2 at 65nm and it doesn't have a ~ as with M400 estimated die sizes. Past estimates were a lot more optimistic:
http://www.hardocp.com/article.html?art=ODAyLDEsLGhlbnRodXNpYXN0
Looking at that presentation, I can't help but notice the 300MP/s @ 200MHz number. Err, what? 1.5 TMUs or ROPs, really? Gives me a hunch it might be inflated in the same way TBDR numbers are (although in this case I can't figure out what the reasoning might be!) and so the 2-3mm2 was really for a half-pixel TMU configuration, while the only Mali200 config that still exists today is a full-pixel TMU.
BTW: I found this little gem in an ARM presentation. A die partitioning for the Mali55! There's a fair bit of SRAM, but the biggest block is obviously the 'Texture Mapper', followed by the 'Tri setup master', and then a bunch of smaller ones that are much harder to read but which include a 'Framebuffer/Blenders' block and a 'MMU AMB' one.
http://www.jp.arm.com/event/pdf/forum2007/t3-1.pdf - Page 11
Well, it's the fillrate that struck me as not plausibly being a raw number. The GFlops number, I'd find believable at 3mm2 if you have a large batch size and it's FP16. However it was my understanding that their PS is FP24 and the VS is FP32... So I'll admit it certainly wasn't believable as a raw number either.That presentation was before ARM bought Falanx and it fairly sounds like an effective fillrate. It's the fillrate that struck you as weird? Try 20GFLOPs@200MHz from a mere 3mm2@90nm core. Damn creative math if you ask me.
WRT bandwidth, I think many of their comparisons make perfect sense relative to a basic kind of tiler that doesn't really exist in the industry, although it'd probably be nearest what ATI did in the OpenGL ES 1.x generation (but not quite that either).Instead of trying to convince with their presentations that they consume X less bandwidth than the competition (which I have severe doubts it's even true)
Well it helps that I saw similarly colored diagrams in much larger format s for other ARM cores in the past, so my brain instantly realized the similarity...Jebus I never would had noticed without you pointing me at it.
Well, it's the fillrate that struck me as not plausibly being a raw number.
Anyway, most of the bandwidth claims in the industry are even less credible than most of the spam e-mails I get in my mailbox. Probably at the same level as home fitness equipment marketing...
Heh, the most terrifying thing is it did have pixel shaders; in a few ways they seemed in fact more advanced than the GF3's... Yet it didn't have true Early-Z; depth testing saved power through clock gating and preventing memory accesses, but it never improved shading performance one iota. Ugh... It's a pretty weird and not always very logical architecture.Or that the 1st generation GoForce had pixel shaders....
Yeah, it is definitely interesting. The basic rendering strategy is certainly much more interesting than Tegra's. I don't know how exciting/smart the low-level details are for either since that's basically unknown for everybody in the handheld world, but I can honestly say I'd love to know in both cases...Don't get me wrong I really like Falanx and I find Mali as an architecture very insteresting, in fact more interesting than Tegra. I just don't see the reason for that type of marketing; you lose more than you gain after all at least IMHLO.
Heh, the most terrifying thing is it did have pixel shaders; in a few ways they seemed in fact more advanced than the GF3's... Yet it didn't have true Early-Z; depth testing saved power through clock gating and preventing memory accesses, but it never improved shading performance one iota. Ugh... It's a pretty weird and not always very logical architecture.
Here's the relevant patent: http://v3.espacenet.com/publicationDetails/description?CC=EP&NR=1759380A2&KC=A2&FT=D&date=20070307&DB=EPODOC&locale=en_E
What is much more laughably primitive in the original GoForce is the transform engine, which just reuses the setup engine to do very basic transforms in HW instead of on the CPU. Honestly, I'm not sure why they even bothered... heh. And of course, the whole 'let's keep the framebuffer/textures in on-chip SRAM!' idea was insane. The original GoForce probably was awful at hiding memory latency therefore; I wonder how/if that evolved in the 4800/5500 when they started being dependent on external memory for textures.
Yeah, it is definitely interesting. The basic rendering strategy is certainly much more interesting than Tegra's. I don't know how exciting/smart the low-level details are for either since that's basically unknown for everybody in the handheld world, but I can honestly say I'd love to know in both cases...
So what else do you want? Don't you remember how incredibly basic DX8 Pixel Shading was?I just skimmed through it, but it rather sounds like a generic scalar ALU, which (unless I've missed something) I wouldn't necessarily conclude that its capable of pixel shading.
Yup, it's pretty depressing seeing how every single handheld chip/IP I've ever looked at, I've *always* overestimated its 3D performance by at least 2x until I had the real info. At least in Mali's case, I can claim it's not my fault...The fact that die estimates were way off in the past, aren't really annoying me with Mali. What annoys me is that estimated performance and featureset of that 2005 presentation are quite on a different level than the final result.
Sigh, I really am a retard. See, Neil Trevett (president of Khronos/chair of OpenGL ES) was at the NV stand and I didn't realize it until it was too late, so I didn't ask him any questions except stuff obviously related to NV/Tegra. Bah! Heck, I even realized he was probably at the stand, but didn't realize that was him....I've no idea if NV made any modifications to their CSAA algorithm for OpenVG; if not it might give some nasty side-effects with VG content.
So what else do you want? Don't you remember how incredibly basic DX8 Pixel Shading was?
Sigh, I really am a retard. See, Neil Trevett (president of Khronos/chair of OpenGL ES) was at the NV stand and I didn't realize it until it was too late, so I didn't ask him any questions except stuff obviously related to NV/Tegra. Bah! Heck, I even realized he was probably at the stand, but didn't realize that was him....
That's a dual A9-based SoC.There's no indication of the number of cores in the U8500, so I assume it's just one. No indications of MHz for either the A9 or the Mali400 in there, but two interesting tidbits there and on the new ST-Ericsson page about the U8500: it sports a 1080p H.264 *High Profile* camcorder but, unlike OMAP4, only 32-bit LPDDR2. Also has two camera ISPs: 18MP for the primary, 5MP for the secondary. Nice, I wonder how much a phone like that would cost in 1H11... Probably more than anyone sane would ever pay but heh
Err, gosh, I realize now my sentence was very ambiguous: I didn't mean the number of cores for the A9. I meant the number of cores for the Mali 400!That's a dual A9-based SoC.