SKELETAL MODELING FOR WORLD SPACE OBJECT SOUNDS
First Claim
Patent Images
1. A method for providing three-dimensional audio, comprising:
- receiving a depth map imaging a scene from a depth camera;
recognizing a human subject present in the scene;
modeling the human subject with a virtual skeleton comprising a plurality of joints defined with a three-dimensional position;
determining, based on the virtual skeleton, a world space ear position of the human subject;
recognizing an object present in the scene;
determining a world space object position of the object;
recognizing audio input information encoding a sound;
determining one or more audio-output transformations based on the world space ear position, the one or more audio-output transformations configured to produce a three-dimensional audio output from the audio input information, the three-dimensional audio output configured such that at the world space ear position the sound appears to originate from the world space object position; and
providing the three-dimensional audio output to the human subject via an acoustic transducer array comprising one or more acoustic transducers.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for providing three-dimensional audio includes determining a world space object position and a world space ear position of a human subject based on a modeled virtual skeleton. The method further includes providing three-dimensional audio output to the human subject via an acoustic transducer array including one or more acoustic transducers. The three-dimensional audio output is configured such that sounds appear to originate from the object.
-
Citations
20 Claims
-
1. A method for providing three-dimensional audio, comprising:
-
receiving a depth map imaging a scene from a depth camera; recognizing a human subject present in the scene; modeling the human subject with a virtual skeleton comprising a plurality of joints defined with a three-dimensional position; determining, based on the virtual skeleton, a world space ear position of the human subject; recognizing an object present in the scene; determining a world space object position of the object; recognizing audio input information encoding a sound; determining one or more audio-output transformations based on the world space ear position, the one or more audio-output transformations configured to produce a three-dimensional audio output from the audio input information, the three-dimensional audio output configured such that at the world space ear position the sound appears to originate from the world space object position; and providing the three-dimensional audio output to the human subject via an acoustic transducer array comprising one or more acoustic transducers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A three-dimensional audio system, comprising:
-
a depth camera input to receive a depth map imaging a scene from one or more depth cameras; an audio input; an audio output to provide three-dimensional audio output information to an acoustic transducer array comprising one or more acoustic transducers; a logic subsystem; and a storage subsystem storing instructions that are executable by the logic subsystem to; receive the depth map; recognize a human subject present in the scene; model the human subject with a virtual skeleton comprising a plurality of joints defined with a three-dimensional position; determine, based on the virtual skeleton, a world space ear position of the human subject; recognize an object present in the scene; determine a world space object position of the object; receive the audio input information via the audio input; determine one or more audio-output transformations based on the world space ear position of the human subject, the one or more audio-output transformations configured to produce three-dimensional audio output information from the audio input information, the three-dimensional audio output information configured to effect the acoustic transducer array to provide a three-dimensional audio output such that at the world space ear position the sound appears to originate from the world space object position; and provide the three-dimensional audio output information to the acoustic transducer array such that the acoustic transducer array provides the three-dimensional audio output to the human subject. - View Dependent Claims (16, 17)
-
-
18. A method of providing three-dimensional audio, comprising:
-
receiving a depth map imaging a scene from a depth camera; recognizing a human subject present in the scene; modeling the human subject with a virtual skeleton comprising a plurality of joints defined with a three-dimensional position; determining, based on the virtual skeleton, a world space ear position of the human subject; determining a world space object position of an object present in the scene; recognizing audio input information encoding a sound; determining a head related transfer function (HRTF) for the human subject; determining a crosstalk cancellation transformation based on a spatial relationship between the world space ear position and a world space transducer position of the one or more acoustic transducers; producing a three-dimensional audio output from the audio input information, the HRTF, and the crosstalk cancellation transformation, the three-dimensional audio output configured such that at the world space ear position the sound appears to originate from the world space object position; and providing the three-dimensional audio output to the human subject via the one or more acoustic transducers. - View Dependent Claims (19, 20)
-
Specification