The non-standard game interfaces discussion thread (move, voice, vitality, etc.)

That's possible if it can track 4 or more people at a time. But then again, it doesn't work while your sitting down, so i'm unconvinced.



Perhaps, but the fact that they haven't yet managed to get it working while sitting down (let alone any of the other 'experiences') is serious cause for concern. If they don't have the basic stuff working yet, how long til we get the more complex stuff that was promised

Bleh, the working while sitting thing is still a rumor, so I'm unconvinced. There's one fact: we're 5 months from launch and nobody knows shit till then. How's that?

Tommy McClain
 
What do you mean ?

The speakers placed across the kinect body

Kinect_specs_610x405.jpg


They can be used to focus on one player tracking where the voice is coming from through out the room. Or they can be used normaly and pick up as many voices as you want. You can even have each mic pick a person and track for that voice only.
 
The speakers placed across the kinect body

You mean the mic (microphone) !
There is no speaker in the Kinect accessory.

They can be used to focus on one player tracking where the voice is coming from through out the room. Or they can be used normaly and pick up as many voices as you want. You can even have each mic pick a person and track for that voice only.

Yes, that's the 4 mic array we referred to above. But Kinect only supports 1 (human) speaker voice recognition according to Microsoft's store specs.

They have not demoed simultaneous/multiple speaker voice recognition, have they ? 1 speaker should be sufficient for device control (and of course SP games and turn-base MP games).
 
The speakers placed across the kinect body

Kinect_specs_610x405.jpg


They can be used to focus on one player tracking where the voice is coming from through out the room. Or they can be used normaly and pick up as many voices as you want. You can even have each mic pick a person and track for that voice only.

You are just making things up. You cannot use each mic to track separate voices. The point of having an array of mics is to have a direction for a sound source. Each mic receives every sound (with slightly different delay).
 
You mean the mic (microphone) !
There is no speaker in the Kinect accessory.



Yes, that's the 4 mic array we referred to above. But Kinect only supports 1 (human) speaker voice recognition according to Microsoft's store specs.

They have not demoed simultaneous/multiple speaker voice recognition, have they ? 1 speaker should be sufficient for device control (and of course SP games and turn-base MP games).

You guys are miss-reading what is there.

You are just making things up. You cannot use each mic to track separate voices. The point of having an array of mics is to have a direction for a sound source. Each mic receives every sound (with slightly different delay).

You can do anything you want with those mic's if you write proper software for it. And yes I know the point for the 4 mics.
 
I will say this though that although they may not be delivering on some the experiences they showed off last year for this year's launch, there's nothing that says that wasn't a vision that they planned for in future updates or games. Some people have no patience I swear. Again, last year was the announcement of a long term vision. This year was the announcement for what to expect at this year's launch. One doesn't preclude the other.

Tommy McClain
Sounds like Ken Kutaragi all over again... Promises, promises... ;)
 
How ?

It says, "Array of 4 microphones supporting single speaker voice recognition"



Would you be able to elaborate on the mechanics ?

What would you want me to elaborate on ? If mics can pick up sounds and the software is working to isolate a single voice and track it through those 4 mics you can easily replicate that with multiple voices. The only problem will become processing over head. In a game like scene it , there should be processing power to spare.


We will know in nov who is right tho I guess.
 
You can do anything you want with those mic's if you write proper software for it. And yes I know the point for the 4 mics.
Stop it already. Even if you can track multiple voices (which may be theoretically possible, but not practical at this point), you can not make each mic track separate voice with software. It's an hardware issue.
 
You mentioned that we have misread the specs, so I thought you know the details. If you don't have any info, then I'll trust Microsoft's specs.
 
Bleh, the working while sitting thing is still a rumor, so I'm unconvinced. There's one fact: we're 5 months from launch and nobody knows shit till then. How's that?

Tommy McClain

But, it's not a rumour:

http://kotaku.com/5565777/xbox-kinect-does-not-play-well-with-couch-potatoes

UPDATE: A Microsoft spokesperson told me after the publication of this article that the company is certain that Kinect gesture control will work for movies, ESPN and other "entertainment" features before the sensor is launched. As I originally reported, that is not an implemented feature yet. The spokesperson was not able to provide any update on the Kinect's tolerance of a person who sits while playing games.
 
What would you want me to elaborate on ? If mics can pick up sounds and the software is working to isolate a single voice and track it through those 4 mics you can easily replicate that with multiple voices.
Let's be clear here - what you said with each microphone tracking one person is decidedly incorrect. That makes it sound like each is a motorised, highy directional mic! All four microphones pick up all the sounds, but with slightly different timings, and using these four different flavours fo the same sound, you can process them to isolate a particular sound. I haven't done this myself, but I believe the basic idea is you can time-shift two samples so the voice you want exactly matches which will anplify that voice, while the alternative sounds that aren't a perfect match will muddle to become noise.

For this to work, you need the sounds you're picking up to be reasonably isolated. In a room full of reverb for example, the end result would be very muddy. And if two people are sat next to each other shouting answers, that's going to pose a problem. It appears my suspicions that the size of the device are for the spacing of the mics was correct, as a greater distance between mics means great sound separation. I expect PSEye's array is too narrow to be much use, hence the fact it's not doing any sound isolation in my experience. There are still going to be major limits though, and you cannot just "do anything you want" with them, unless someone has invented a brand-new audio detection engine as sophisticated as human hearing! Voice recognition is ropey at best, and with four voices cross-talking, it'll be struggling. hence it's not surprising that the specifications state 1 voice recognition. One at a time is good enough for a quiz, but we're a long ways still from Star Trek style talking to computers.
 
But, it's not a rumour:

http://kotaku.com/5565777/xbox-kinect-does-not-play-well-with-couch-potatoes
Quote:
UPDATE: A Microsoft spokesperson told me after the publication of this article that the company is certain that Kinect gesture control will work for movies, ESPN and other "entertainment" features before the sensor is launched. As I originally reported, that is not an implemented feature yet. The spokesperson was not able to provide any update on the Kinect's tolerance of a person who sits while playing games.
That only confirms the gesture controls for movies will work, period. That suggests it'll work only while standing. Whether Kinect works while sitting on a couch is still unconfirmed, according to that spokesperson.
 
That only confirms the gesture controls for movies will work, period.
It doesn't even confirm that. MS is certain that it will, but unless they are infallible, the best laid scheme o' mice an' men gang aft agley and it may not work as they intend.
 
Dunno why this thread got closed sp I've opened it up again pending some other mod saying, "how dare you!," and putting an end to it. ;)
 
Let's be clear here - what you said with each microphone tracking one person is decidedly incorrect. That makes it sound like each is a motorised, highy directional mic! All four microphones pick up all the sounds, but with slightly different timings, and using these four different flavours fo the same sound, you can process them to isolate a particular sound. I haven't done this myself, but I believe the basic idea is you can time-shift two samples so the voice you want exactly matches which will anplify that voice, while the alternative sounds that aren't a perfect match will muddle to become noise.
Perhaps I was wrong with each microphone tracking a voice. But software should be able to do it

For this to work, you need the sounds you're picking up to be reasonably isolated. In a room full of reverb for example, the end result would be very muddy. And if two people are sat next to each other shouting answers, that's going to pose a problem. It appears my suspicions that the size of the device are for the spacing of the mics was correct, as a greater distance between mics means great sound separation. I expect PSEye's array is too narrow to be much use, hence the fact it's not doing any sound isolation in my experience. There are still going to be major limits though, and you cannot just "do anything you want" with them, unless someone has invented a brand-new audio detection engine as sophisticated as human hearing! Voice recognition is ropey at best, and with four voices cross-talking, it'll be struggling. hence it's not surprising that the specifications state 1 voice recognition. One at a time is good enough for a quiz, but we're a long ways still from Star Trek style talking to computers.

I believe they will be able to track multiple voices in quiz show type games. For each team they can simply have the team pick a person to be team leader and only that person is needed to be followed. Mabye for quick games as a way to buzz in it might not work .But the 360 itself should be more than capable of picking out multiple voices from the audio stream of 4 diffrent mics
 
Back
Top