I assume the improvement was meant in comparison to the current consoles. Next to desktop parts from what I've seen these are pretty standard Zen2's with no boost, maybe reduded cache and possibly less optimal memory access.
In terms of clocks it's a little slower than a Ryzen 3700x without boost.
Offcourse, they strip these things that have not so much impact, if any, on gaming performance, but they also needed to reduce clock speeds and cache sizes. They did the best they could, it's probably the perfect fit for these consoles. They won't be as fast as full fat desktop CPU's, first you cant expect that, and second it probably wasn't needed either. If heat, die size and cost wouldn't be a problem, they could have slapped in a 3950X or something for full 12c/24t at 4ghz or higher.
It's hopefully not all bad!
Assuming next gen consoles are based on something like the 4xxx series APUs (Renoir), there's some good news in terms of latencies.
https://www.anandtech.com/show/1570...k-business-with-the-ryzen-9-4900hs-a-review/2
Inter CCX cache accesses are faster for the monolithic APUs than the chiplet designs. For the chiplet based desktop processors, inter-CCX access goes off-chip even if the other CCX is on the same physical chiplet, as it's done via IF routed through the big "hub" IO chip containing the memory controller.
So Renoir takes about 1/4 off the inter CCX latency. Perhaps in terms of games, this could make up somewhat in terms of IPC for having less L3. I suspect the huge L3 on Ryzen 3xxx series desktops is due to a common chiplet with server targetted stuff, and that for purely desktop and gaming purposes it's not possibly optimal use of the die area (not all workloads receive the same benefits from cache scaling).
Anandtech (Ian Cutress writing) also had this to say about the smaller L3 in Renoir:
For Renoir, AMD decided to minimize the amount of L3 cache to 1 MB per core, compared to 4 MB per core on the desktop Ryzen variants and 4 MB per core for Threadripper and EPYC. The reduction in the size of the cache does three things: (a) makes the die smaller and easier to manufacture, (b) makes the die use less power when turned on, but (c) causes more cache misses and accesses to main memory, causing a slight performance per clock decrease.
With (c), normally doubling (2x) the size of the cache gives a square root of 2 decrease in cache misses. Therefore going down from 4 MB on the other designs to 1 MB on these designs should imply that there will be twice as many cache misses from L3, and thus twice as many memory accesses. However, because AMD uses a non-inclusive cache policy on the L3 that accepts L2 cache evictions only, there’s actually less scope here for performance loss.
It would also be interesting to know how main memory latency in consoles compared to Renoir and Matisse, particularly under heavy load.
If infinity fabric speed is tied to the memory clock as in Matisse, then 14 Gbps might be quite close to the 3733 mhz "sweet spot" that AMD talked about for that setup...?