if XeSS can do a fantastic job of 540p to 1080p in upscaling and for lighter games 720p -> 1440p
XSS is going to be a good spot for its price point.
There's some specific benefits for XSS.
The lower the native resolution the harder it is to get a good output one.
Even though TAAU with UE5 being a good example of a really good implementation, going from <=720p to 1080p probably won't look native.
ML-U I expect to generally give a better final image.
Other inherent benefits that are forgotten, are things like the higher relative fillrate at lower resolution that it may struggle with otherwise
The primary implementation uses XMX and the DP4A is the general fallback. I haven't found any actual timings either, just the same general representative image showing relative quality and timing.
When I said general, I meant non Intel ARC specific implementation.
Dp4a is more than just simply INT8 support.
As in just using INT16 it more than likely will be more than twice as slow
I'm curious how much difference you can generally expect from dp4a compared to RPM for inference.
And XeSS dp4a implementation compared to XMX, bit more detailed than their bar graph, although useful
XeSS will work by default on XS consoles for any game that uses it.
Be interesting to see it used, although ARC GPUs aren't anywhere to be seen at the moment.