Methods and systems for classifying pixels as foreground using both short-range depth data and long-range depth data
First Claim
1. A method comprising:
- obtaining, from a three-dimensional (3-D) video camera, video data depicting at least a portion of a user;
obtaining, from the 3-D video camera operating in a short-range mode, short-range depth data associated with the video data;
obtaining, from the 3-D video camera operating in a long-range mode, long-range depth data associated with the video data;
detecting less than a threshold amount of motion in at least one of the obtained short-range depth data and the obtained video data during a motion-detection period;
classifying pixels of the video data as foreground based at least in part on a comparison between the short-range depth data and the long-range depth data; and
extracting a user-persona from the video data based at least in part on the pixels of the video data classified as foreground.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are methods and systems for classifying pixels as foreground using both short-range depth data and long-range depth data. One embodiment takes the form of a process that includes obtaining video data depicting at least a portion of a user. The process also includes obtaining short-range depth data associated with the video data. The process also includes obtaining long-range depth data associated with the video data. The video data, short-range depth data, and long-range depth data may be obtained via a single 3-D video camera. The process also includes classifying pixels of the video data as foreground based at least in part on both the short-range depth data and the long-range depth data. In some embodiments, classifying pixels of the video data as foreground comprises employing an alpha mask. The alpha mask may comprise binary foreground (hard) indicators. The alpha mask may comprise foreground-likelihood (soft) indicators.
-
Citations
20 Claims
-
1. A method comprising:
-
obtaining, from a three-dimensional (3-D) video camera, video data depicting at least a portion of a user; obtaining, from the 3-D video camera operating in a short-range mode, short-range depth data associated with the video data; obtaining, from the 3-D video camera operating in a long-range mode, long-range depth data associated with the video data; detecting less than a threshold amount of motion in at least one of the obtained short-range depth data and the obtained video data during a motion-detection period; classifying pixels of the video data as foreground based at least in part on a comparison between the short-range depth data and the long-range depth data; and extracting a user-persona from the video data based at least in part on the pixels of the video data classified as foreground. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system comprising:
-
a communication interface; a processor; and data storage containing instructions executable by the processor for causing the system to carry out a set of functions, the set of functions including; obtaining, from a three-dimensional (3-D) video camera, video data depicting at least a portion of a user; obtaining, from the 3-D video camera operating in a short-range mode, short-range depth data associated with the video data; obtaining, from the 3-D video camera operating in a long-range mode, long-range depth data associated with the video data; detecting less than a threshold amount of motion in at least one of the obtained short-range depth data and the obtained video data during a motion-detection period; classifying pixels of the video data as foreground based at least in part on a comparison between the short-range depth data and the long-range depth data; and extracting a user-persona from the video data based at least in part on the pixels of the video data classified as foreground.
-
-
15. A method comprising:
-
obtaining, from a three-dimensional (3-D) video camera, video data depicting at least a portion of a user; obtaining, from the 3-D video camera operating in a short-range mode, short-range depth data associated with the video data; obtaining, from the 3-D video camera operating in a long-range mode, long-range depth data associated with the video data; classifying pixels of the video data as foreground based at least in part on a comparison between the short-range depth data and the long-range depth data; extracting a user-persona from the video data based at least in part on the pixels of the video data classified as foreground; identifying a short-range foreground region at least in part by using the short-range depth data; identifying a long-range foreground region at least in part by using the long-range depth data; and determining a user-hair region of the video data at least in part by using both the short-range foreground region and the long-range foreground region. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification