In the last interview with Ken Kutaragi by Hiroshige Goto, Kutaragi revealed that Cell supports bi-endian to emulate Emotion Engine/MIPS, but in the next article by Goto he clarifies that, according to Kutaragi, Cell itself is not bi-endian so you can't switch back and forth different endiannesses on run-time like PPC, but it has special load/store instructions implemented in hardware that can translate little-endian data into big-endian.
The abstract of Goto's article itself, which focuses on binary translation and endianness, is like this:
+ Though bi-endian was considered in the early stage of the development, eventually Cell supports only the big-endian mode to simplify the structure of ALUs. Instead, it has special load/store instructions implemented in hardware that can translate little-endian data into big-endian. With this feature, you don't have to have a little-endian OS code and codes of both endiannesses can co-exist.
+ Perhaps Waternoose CPU may have this feature or a similar mechanism too. Allard said it doesn't support bi-endian because it doesn't run general-purpose code but suggested it can do fast endian translations.
+ Apple uses Transitive technology to translate PPC into x86, but performance can be suffered because of different endiannesses, unlike 680x0 and PPC. It can be solved if Intel implements endian-translation instructions into its new CPUs for Apple, and it's very likely.
The abstract of Goto's article itself, which focuses on binary translation and endianness, is like this:
+ Though bi-endian was considered in the early stage of the development, eventually Cell supports only the big-endian mode to simplify the structure of ALUs. Instead, it has special load/store instructions implemented in hardware that can translate little-endian data into big-endian. With this feature, you don't have to have a little-endian OS code and codes of both endiannesses can co-exist.
+ Perhaps Waternoose CPU may have this feature or a similar mechanism too. Allard said it doesn't support bi-endian because it doesn't run general-purpose code but suggested it can do fast endian translations.
+ Apple uses Transitive technology to translate PPC into x86, but performance can be suffered because of different endiannesses, unlike 680x0 and PPC. It can be solved if Intel implements endian-translation instructions into its new CPUs for Apple, and it's very likely.