I've never really delved into Cyrix history.Quite a short memories here?
Cyrix found some good use of "L0" cache for their 6x86 architecture -- the 256-byte line for instructions:
Another example where someone stuck some fiddly SRAM in front of the L1 and called it an L0.Note the L1 cache was unified, e.g. code & data in one place.
Some would call it quasi-L2 implementation, but since the 256-byte "extension" to the instruction queue can't be really qualified as L1 by any means, it's dubbed L0+L1 design.