Technical investigation into PS4 and XB1 audio solutions *spawn

Discussion in 'Console Technology' started by Brad Grenz, Jun 2, 2013.

  1. Davros

    Legend

    Joined:
    Jun 7, 2004
    Messages:
    17,884
    Likes Received:
    5,334
    well yes
     
  2. bkilian

    Veteran

    Joined:
    Apr 22, 2006
    Messages:
    1,539
    Likes Received:
    3
    I don't have those benchmarks. I have not had access to the hardware in almost a year. Suffice it to say that the vector cores are at least 3-wide SIMD, they have custom instructions especially for accelerating MEC and speech pipelines, they have a quad MAC, and they're running at high clocks.

    BDTI is a company that does performance benchmarks for DSPs using 12 functions: Real Block FIR, Two-Biquad IIR, Viterbi Decoder, Single-Sample FIR, Vector Dot Product, Control, Complex Block FIR, Vector Add, 256-Point FFT, LMS Adaptive FIR, Vector Maximum and Bit Unpack.

    The ADSP 21369 scores 2050 on their benchmark. The Tensilica core used by MS scores >6000 on the same benchmark, at half the clock rate MS will be running. (Higher is better). The X1 audio block has two of them.

    The fixed function stuff is harder to quantify. The numbers you've seen are about as much as you'll get. Because it's fixed function, it doesn't matter what order of polyphase SRC or how many bands the EQ has. You get what you get. This makes it a lot less useful than a general purpose DSP, but you get a ton of functionality at low cost and low power.
     
  3. Relab

    Newcomer

    Joined:
    Jul 7, 2013
    Messages:
    53
    Likes Received:
    0
    I can find the 32bit float benchmark number for the ADSP-21369, but I can't find the 32bit float benchmark for the Tensilica chip - I would appreciate a link. I could only find a simulated 16bit fixed-point benchmark number, which is obviously not the correct one.

    Would game developers have full access to Tensilica DSP?
     
  4. XpiderMX

    Veteran

    Joined:
    Mar 14, 2012
    Messages:
    1,768
    Likes Received:
    0
  5. bkilian

    Veteran

    Joined:
    Apr 22, 2006
    Messages:
    1,539
    Likes Received:
    3
    Yeah, hard to do, because the tensilica cores are configurable. This one is configured similarly, although there will still be differences. the 16/32 bit difference may also explain why their result is about 2x what I expected. I do know that the MS vector cores have full 32bit float vector engines, because that's what the speech pipeline uses.

    As far as I know, game developers do not have access to the 4 DSP cores. They are all system managed. They have access to codec algorithms running on the cores, and full access to the fixed function hardware. Much to the audio team's chagrin, the speech team bogarted the two vector cores. I know there was some internal pressure to force the speech team to give up some of their CPU so that developers could use it, but I have no idea if anything ever materialized from that.
     
  6. Relab

    Newcomer

    Joined:
    Jul 7, 2013
    Messages:
    53
    Likes Received:
    0
    Yeah - the two benchmarks are not very comparable, since the ADSP benchmark is based on a verified test on actual hardware in 32bit float. The Tensilica is a simulated benchmark not done on any hardware and based on 16bit integer. And further more the Tensilica test had 11 customized instructions specifically designed for this simulated benchmark to improve performance compared to other 16bit fixed-point DSP's.
     
  7. bkilian

    Veteran

    Joined:
    Apr 22, 2006
    Messages:
    1,539
    Likes Received:
    3
    Well, yes, that's the point. The MS cores have a bunch of customized instructions specifically to accelerate audio workloads in general, and the MS speech pipeline in particular. The scalar core has customized instructions to accelerate codec functions. Like I've said multiple times before, this is not a general purpose DSP to be used in high end mixing stations. It's a game console audio engine with a focus primarily on reducing power consumption and offloading audio workloads from the CPU. There's an entire core dedicated just to managing the other cores, doing housekeeping on the audio graph, moving memory around, and keeping the fixed function hardware well utilized, so the CPU doesn't have to (something no desktop audio card supports, as far as I know)
     
  8. Cyan

    Cyan orange
    Legend

    Joined:
    Apr 24, 2007
    Messages:
    9,734
    Likes Received:
    3,460
    It seems capable because bkillian pointed out before that Shape is an order of magnitude more capable than the best Creative X-Fi (this was long before the recent discussion on the capabilities of the audio block), so a version for Shape would be EAX50 or more, to be precise.

    If the GPU of the Xbox One is from the Sea Islands family, then Shape would be from the Solaris Islands :lol2:, if that even exist, which doesn't afaik.
     
  9. Davros

    Legend

    Joined:
    Jun 7, 2004
    Messages:
    17,884
    Likes Received:
    5,334
    Is bilikan talking more capable as a whole or more capable of environment modeling ?
    If my memory isnt failing me shape doesnt do reverb I do know it doesnt wave trace so any enviromental modeling will either be faked/pre computed or done somewhere else
    It also doesnt do hrtf
    It also strikes me a strange that if shape is so powerfull why did he state that doing reverb was expensive ?

    ps:
    anyone with an asus xonar be prepared to run some tests ?
     
  10. onQ

    onQ
    Veteran

    Joined:
    Mar 4, 2010
    Messages:
    1,540
    Likes Received:
    56
    :shock: :grin: That was beautiful, can't wait to see/hear what the devs do with all this audio hardware.
     
  11. bkilian

    Veteran

    Joined:
    Apr 22, 2006
    Messages:
    1,539
    Likes Received:
    3
    Unfortunately, Devs only have access to a small part of it. Most of it is reserved for Kinect processing. As a bonus though, it means devs don't have to ask the question "do I have the resources to spare for adding Kinect?" like they did in the last console. Kinect is free(*). _Not_ using it is leaving processing power on the table. I hope this encourages them to be more liberal in their kinect integration this time around.

    (*) For certain values of "Kinect". I believe there are some features that devs can hook in to that require memory/processing on their part. Speech is not one of them.
     
  12. Jay

    Jay
    Veteran

    Joined:
    Aug 3, 2013
    Messages:
    4,033
    Likes Received:
    3,428
    What's the chances now that XB1 has good audio that MS may create a DX eax type api? (for both PC and XB1)
    Would there be enough _spare_ processing capacity to process it on the audio block?
    Either way I think it would be a progressive move.
     
  13. onQ

    onQ
    Veteran

    Joined:
    Mar 4, 2010
    Messages:
    1,540
    Likes Received:
    56
    Even then I think it's going to be something special when the right devs start to thinking up ideas.

    I want to see where voice control/recognition can go to in games & with extra hardware to make it easier for the devs & the console I'm sure we will see some nice things.

    I think the next step is to actually make a A.I co-processor for a console so video games can really put NUI to good use & let the A.I respond back naturally in every game. but I guess that could also be done by using the Cloud & having a really large database of A.I interactions.
     
  14. DaveNagy

    Newcomer

    Joined:
    Jan 18, 2013
    Messages:
    51
    Likes Received:
    0
    I've got a question for bkilian, if he knows...

    Can the fancy speech processing and recognition "tricks" that the hardware does in conjunction with the Kinect also be done from the headset inputs? Is is physically possible to do that, if MS chose to allow it?

    I ask, because on the 360 you can't do most of the voice stuff unless you have a Kinect hooked up. I assumed that was for business reasons, not technical ones, but I could be wrong.

    Since MS has come up with an alternate way to encourage Kinect usage this coming gen, I'm hoping that headset mics can be used for more than just game-chat this time around.

    And heck, since I've got you here, do you have any Kinect-mounting recommendations for best audio quality? MS has been pretty vague. Does the Kinect like to be set on a audio-reflective surface? (like a boundary mic) Or does it prefer to be "floating", up and away from such reflectors? Similarly, is it better to be close to, or far from, the wall behind the TV? Is it recommended to decouple the Kinect from any vibration sources? (Like my TV which has fans and color-wheels and such, before we even start talking about the big-ass speakers that sit to either side.)

    I know a lot of blood, sweat and tears went into making the Kinect(2) relatively immune to crappy room acoustics, but I figure there's no reason not to make life as easy as possible for it.

    I'm thinking... cover the wall with egg cartons, and then suspend the Kinect from bungie cords in front of that. Just gotta clear it with the wife! :wink:
     
  15. bkilian

    Veteran

    Joined:
    Apr 22, 2006
    Messages:
    1,539
    Likes Received:
    3
    You want it as far away from your center speaker as you can get, and not in an enclosed area. The rest is normal open mic type stuff. The less echo you can get in your room, the better it'll work.

    And yes, you can do speech reco using the headset mic. It requires you to retrain the speech database because the audio pipeline will be different, which is one of the reasons it wasn't done in the 360. I don't know the plans for the X1.
     
  16. Brad Grenz

    Brad Grenz Philosopher & Poet
    Veteran

    Joined:
    Mar 3, 2005
    Messages:
    2,531
    Likes Received:
    2
    Location:
    Oregon
    I assumed the voice quality on the 360's standard headset mic was too low for it to effectively be used for voice recognition, but it occurs to me I don't how the audio fidelity impacts that kind of thing.
     
  17. Davros

    Legend

    Joined:
    Jun 7, 2004
    Messages:
    17,884
    Likes Received:
    5,334
    Ive setup a few pc's for voice operation in every case the user had a low quality microphone and it worked quite well despite the fact no one completed the training
     
  18. Rangers

    Legend

    Joined:
    Aug 4, 2006
    Messages:
    12,791
    Likes Received:
    1,596
  19. Arwin

    Arwin Now Officially a Top 10 Poster
    Moderator Legend

    Joined:
    May 17, 2006
    Messages:
    18,762
    Likes Received:
    2,639
    Location:
    Maastricht, The Netherlands
    Yep, pretty basic. But probably still more than enough, considering what Garageband can do on the iPad.
     
  20. MrFox

    MrFox Deludedly Fantastic
    Legend

    Joined:
    Jan 7, 2012
    Messages:
    6,488
    Likes Received:
    5,996
    This is just a repeat of previous vgleaks rumors, what's new in there?

    The slide looks so fake, for an official presentation it should have mentioned fine grain buzzword compute things.
     
Loading...

Share This Page

  • About Us

    Beyond3D has been around for over a decade and prides itself on being the best place on the web for in-depth, technically-driven discussion and analysis of 3D graphics hardware. If you love pixels and transistors, you've come to the right place!

    Beyond3D is proudly published by GPU Tools Ltd.
Loading...