Yes that's quite nice. And the PCLMULQDQ too has (like ordinary simd muls on jaguar) ultra-low latency and throughput of 1! AESKEYGENASSIST would be like 10 times faster than on haswell according to Agner's data (not sure why intel has published so vastly different figures for that one). But even without considering this one, the encryption instructions on jaguar actually look better than on haswell even, and of course much better than on silvermont.Jaguar's numbers are really surprising
AESENC/DEC 3 cycles latency (5 according to Agner Fog) 1 cycle throughput. Double instruction.
AESIMC/KEYGENASSIT 2 cycles latency 1 cycle throughput. Single instruction
The new Atom's performance continues to match or exceed the AMD A4-5000's, as well.
As far as I can tell from these benches, Jaguar would be quite a match for this chip in performance if it would support turbo, IPC indeed seems a bit higher on Jaguar (based on the assumption that most likely silvermont is running closer to 2.4Ghz rather than 1.4Ghz in most of the tests). Granted it still wouldn't be a match in perf/w but at least if AMD would have implemented some turbo (say 1.5Ghz up to 2Ghz) it wouldn't look like it drew much more power while still being slower...http://techreport.com/review/25329/intel-atom-z3000-bay-trail-soc-revealed
Jaguar never stood a chance against Silvermont. Silvermont is a slam dunk win for Intel.
Sorry for the late reply. We moved mountains to make it happen.EDIT
As a side note it is clear that Intel did not want ARM to get their hand on that micro server market. They managed to ship something 1 years ahead of the competition (ARM based on ARM V8).
Been working with Baytrail chips for the most of the year and was actually sufficiently impressed that I went out and bought an ASUS Transformer Book T100 last week - my first ever tablet and the first time I've bought a laptop in about 7 years!
Spent the weekend and this week doing some testing with it, mostly in games, and have posted up some videos here: http://www.youtube.com/user/BrueComputing
Graphics performance is somewhere in-line with the 1st-gen Core i3/i5 stuff, which isn't too bad considering this is a 2W SDP part. Done some testing against Temash chips and they do edge it in graphics benchmarks, but are 8W TDP parts.
In the T100 it's the first Windows tablet I've felt it's worth owning in terms of price/performance/usefulness.
OpenSSL 1.0.1e 11 Feb 2013
built on: Mon Feb 11 16:33:34 2013
options:bn(64,32) rc4(8x,mmx) des(idx,cisc,2,long) aes(partial) idea(int) blowfish(idx)
compiler: cl -D_USE_32BIT_TIME_T /MD /Ox /O2 /Ob2 -DOPENSSL_THREADS -DDSO_WIN32 -D_USE_32BIT_TIME_T -W3 -Gs0 -GF -Gy -nologo -DOPENSSL_SYSNAME_WIN32 -DWIN32_LEAN_AND_MEAN -DL_ENDIAN -D_CRT_SECURE_NO_DEPRECATE -DOPENSSL_BN_ASM_PART_WORDS -DOPENSSL_IA32_SSE2 -DOPENSSL_BN_ASM_MONT -DOPENSSL_BN_ASM_GF2m -DSHA1_ASM -DSHA256_ASM -DSHA512_ASM -DMD5_ASM -DRMD160_ASM -DAES_ASM -DVPAES_ASM -DWHIRLPOOL_ASM -DGHASH_ASM -DOPENSSL_USE_APPLINK -I. -DOPENSSL_NO_RC5 -DOPENSSL_NO_MD2 -DOPENSSL_NO_KRB5 -DOPENSSL_NO_JPAKE -DOPENSSL_NO_STATIC_ENGINE
The 'numbers' are in 1000s of bytes per second processed.
type 16 bytes 64 bytes 256 bytes 1024 bytes 8192 bytes
aes-128-cbc 164791.16k 249379.82k 294641.11k 311612.47k 317188.81k
OpenSSL 1.0.1e 11 Feb 2013
built on: Mon Feb 11 16:33:34 2013
options:bn(64,32) rc4(8x,mmx) des(idx,cisc,2,long) aes(partial) idea(int) blowfish(idx)
compiler: cl -D_USE_32BIT_TIME_T /MD /Ox /O2 /Ob2 -DOPENSSL_THREADS -DDSO_WIN32 -D_USE_32BIT_TIME_T -W3 -Gs0 -GF -Gy -nologo -DOPENSSL_SYSNAME_WIN32 -DWIN32_LEAN_AND_MEAN -DL_ENDIAN -D_CRT_SECURE_NO_DEPRECATE -DOPENSSL_BN_ASM_PART_WORDS -DOPENSSL_IA32_SSE2 -DOPENSSL_BN_ASM_MONT -DOPENSSL_BN_ASM_GF2m -DSHA1_ASM -DSHA256_ASM -DSHA512_ASM -DMD5_ASM -DRMD160_ASM -DAES_ASM -DVPAES_ASM -DWHIRLPOOL_ASM -DGHASH_ASM -DOPENSSL_USE_APPLINK -I. -DOPENSSL_NO_RC5 -DOPENSSL_NO_MD2 -DOPENSSL_NO_KRB5 -DOPENSSL_NO_JPAKE -DOPENSSL_NO_STATIC_ENGINE
The 'numbers' are in 1000s of bytes per second processed.
type 16 bytes 64 bytes 256 bytes 1024 bytes 8192 bytes
aes-128 cbc 32189.81k 36132.41k 37676.07k 72772.13k 74997.63k
Had a play with a Venue 11 Pro today and compared it to the T100 in 3DMark to see how the Z3740 stacks up vs. Z3770.
The Z3740 actually comes out 11% faster in graphics performance, Z3770 29% faster in the CPU Physics test.
I'd wager that it is down to how the power usage is balanced on the SoC - the upper power and GPU clock limits are the same for both chips but the higher base clock on the Z3770 is presumably leaving less power headroom for the GPU to utilise...
I always consider it like a ~5W chip or so (just get away with that "scenario design power" please...).The Venue 8 Pro BIOS has a stunning load of settings if you access it through the Win8 advanced boot, UEFI settings menu. Lots of IGP settings are available including a graphics boost option that is disabled by default.
I haven't bothered with benchmarks but have played some games. I have found it to run some quite well. Guild Wars and Defense Grid for example. SupCom2 is too slow though. It has interesting performance characteristics. Low bandwidth and fillrate but the ALU capability isn't bad for what it is. I find it totally amazing for a 2W SOC.
The Venue 8 Pro BIOS has a stunning load of settings if you access it through the Win8 advanced boot, UEFI settings menu. Lots of IGP settings are available including a graphics boost option that is disabled by default.