Method and apparatus for tracking listener's head position for virtual stereo acoustics
First Claim
Patent Images
1. A method of tracking a position of a head of a listener comprising a skin region, the method comprising:
- obtaining two images using two image pickup units;
determining the head of the listener in at least one of the two images by detecting an edge of the skin region of the head of the listener;
tracking a skin color of one of the images, thereby obtaining a 2-dimensional (2D) coordinate value of the position, wherein a Gaussian skin classifier is applied only to a region of interest (ROI) consisting of the head of the listener substantially bound by the detected edge of the skin region of the head in order to reduce computation required to achieve color tracking; and
obtaining a distance between the image pickup units and the listener using stereo area correlation.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for tracking a listener'"'"'s head position for virtual stereo acoustics. The method of tracking the head position of a listener includes: obtaining face images of the listener using two image pickup units; tracking the skin color of an image, thereby obtaining the two-dimensional (2D) coordinate value of the listener'"'"'s position; and obtaining the distance between the image pickup units and the listener using triangulation.
22 Citations
15 Claims
-
1. A method of tracking a position of a head of a listener comprising a skin region, the method comprising:
-
obtaining two images using two image pickup units; determining the head of the listener in at least one of the two images by detecting an edge of the skin region of the head of the listener; tracking a skin color of one of the images, thereby obtaining a 2-dimensional (2D) coordinate value of the position, wherein a Gaussian skin classifier is applied only to a region of interest (ROI) consisting of the head of the listener substantially bound by the detected edge of the skin region of the head in order to reduce computation required to achieve color tracking; and obtaining a distance between the image pickup units and the listener using stereo area correlation. - View Dependent Claims (2, 3, 4, 12)
-
-
5. An apparatus for tracking a position of a head of a listener comprising a skin region, the apparatus comprising:
-
a first image pickup unit capturing a first image of the face of the listener; a second image pickup unit capturing a second image of the face of the listener from a second angle of vision different from the first; a 2-dimensional (2D) coordinate value generation unit generating a 2D coordinate value of the position by tracking a skin color of the image, wherein the 2D coordinate value generation unit comprises a skin region detection unit determining the head of the listener in at least one of the two images by detecting an edge of the skin region of the head and detecting the skin region using a Gaussian skin classifier that is applied only to a region of interest (ROI) consisting of the head of the listener substantially bound by the detected edge of the skin region of the head in order to reduce computation required to achieve color tracking; and a distance calculation unit calculating a distance between the image pickup units and the listener using stereo area correlation. - View Dependent Claims (6, 7, 8)
-
-
9. An apparatus, comprising:
-
two image pickup units respectively capturing two images, wherein a head of a listener is determined from at least one of the two images by detecting an edge of a skin region of the head; a 2-dimensional coordinate value generation unit generating a 2D coordinate value of a position of the head by tracking a skin color region of one of the captured images, wherein a Gaussian skin classifier is applied only to a region of interest (ROI) consisting of the head of the listener substantially bound by the detected edge of the skin region of the head in order to reduce computation required to achieve color tracking; a distance calculation unit calculating a distance from the image pickup units to the listener using stereo area correlation of the two images; and a listener'"'"'s position calculation unit setting a location of a sweet spot in a multi-channel audio signal to coincide with the 2D coordinate position of the head generated by the 2-D coordinate value generation unit.
-
-
10. A method, comprising:
-
capturing two images from different perspectives via two image capturing units; determining a head of a listener in at least one of the two images by detecting an edge of a skin region of the head of the listener; determining a 2D coordinate position of the face by tracking a skin color region of one of the captured images, wherein a Gaussian skin classifier is applied only to a region of interest (ROI) consisting of the head of the listener substantially bound by the detected edge of the skin region of the head in order to reduce computation required to achieve color tracking; calculating a distance from the image capturing units to a head of the listener via triangulation based on the two images; and setting a location of a sweet spot in a multi-channel audio signal to coincide with the determined 2D coordinate position of the face. - View Dependent Claims (13)
-
-
11. A method of resetting a location of a sweet spot in a multi-channel audio signal, the method comprising:
-
determining a position of a head of a listener by capturing two images from different perspectives via two image capturing units and by detecting an edge of a skin region of the head of the listener; determining a 2D coordinate position of a face by tracking a skin color region of one of the captured images and calculating a distance from the image capturing units to the head of the listener via triangulation based on the two images, wherein a Gaussian skin classifier is applied only to a region of interest (ROI) consisting of the head of the listener substantially bound by the detected edge of the skin region of the head in order to reduce computation required to achieve color tracking; and resetting the location of the sweet spot in the multi-channel audio signal to coincide with the determined position of the head of the listener. - View Dependent Claims (14)
-
-
15. A method of resetting a location of a sweet spot in a multi-channel audio signal with respect to a location of a head of a listener, the method comprising:
-
tracking a head position of the listener by capturing two images from different perspectives via two image capturing units and by detecting an edge of a skin region of the head of the listener; generating a 2D coordinate value of a position of the face by tracking a skin color region of one of the two captured images, wherein a Gaussian skin classifier is applied only to a region of interest (ROI) consisting of the head of the listener substantially bound by the detected edge of the skin region of the head in order to reduce computation required to achieve color tracking; calculating a distance from the image pickup units to the listener using stereo area correlation of the two images; and setting the location of the sweet spot in the multi-channel audio signal to coincide with the generated 2D coordinate position of the face.
-
Specification