Intra-mode region-of-interest video object segmentation
First Claim
1. A method performed by an automatic region of interest (ROI) object segmentation apparatus, the method comprising:
- receiving, in the automatic ROI object segmentation apparatus, a video frame of a video sequence;
detecting, in the automatic ROI object segmentation apparatus, region of interest (ROI) features within the video frame independent of other video frames of the video sequence;
verifying the detected ROI features within the video frame based on a geometric relationship of the detected ROI features;
approximating, in the automatic ROI object segmentation apparatus, a ROI object shape within the video frame based on the detected ROI features and the geometric relationship of the detected ROI features,wherein approximating a ROI object shape comprises approximating the ROI object shape based on a geometric model defined by a location of the detected ROI features;
segmenting, in the automatic ROI object segmentation apparatus, the video frame into multiple candidate regions, wherein segmenting comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and
selecting, in the automatic ROI object segmentation apparatus, one or more of the candidate regions located within the approximated ROI object shape of the video frame as an ROI object.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosure is directed to techniques for automatic segmentation of a region-of-interest (ROI) video object from a video sequence. ROI object segmentation enables selected ROI or “foreground” objects of a video sequence that may be of interest to a viewer to be extracted from non-ROI or “background” areas of the video sequence. Examples of a ROI object are a human face or a head and shoulder area of a human body. The disclosed techniques include a hybrid technique that combines ROI feature detection, region segmentation, and background subtraction. In this way, the disclosed techniques may provide accurate foreground object generation and low-complexity extraction of the foreground object from the video sequence. A ROI object segmentation system may implement the techniques described herein. In addition, ROI object segmentation may be useful in a wide range of multimedia applications that utilize video sequences, such as video telephony applications and video surveillance applications.
110 Citations
52 Claims
-
1. A method performed by an automatic region of interest (ROI) object segmentation apparatus, the method comprising:
-
receiving, in the automatic ROI object segmentation apparatus, a video frame of a video sequence; detecting, in the automatic ROI object segmentation apparatus, region of interest (ROI) features within the video frame independent of other video frames of the video sequence; verifying the detected ROI features within the video frame based on a geometric relationship of the detected ROI features; approximating, in the automatic ROI object segmentation apparatus, a ROI object shape within the video frame based on the detected ROI features and the geometric relationship of the detected ROI features, wherein approximating a ROI object shape comprises approximating the ROI object shape based on a geometric model defined by a location of the detected ROI features; segmenting, in the automatic ROI object segmentation apparatus, the video frame into multiple candidate regions, wherein segmenting comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and selecting, in the automatic ROI object segmentation apparatus, one or more of the candidate regions located within the approximated ROI object shape of the video frame as an ROI object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory computer-readable medium having thereon instructions that cause a programmable processor to:
-
receive a video frame of a video sequence; detect region of interest (ROI) features within the video frame independent of other video frames of the video sequence; verify the detected ROI features within the video frame based on a geometric relationship of the detected ROI features; approximate a ROI object shape within the video frame based on the detected ROI features, wherein the ROI object shape is approximated based on a geometric model defined by a location of the detected ROI features and the geometric relationship of the detected ROI features; segment the video frame into multiple candidate wherein segmenting comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and select one or more of the candidate regions located within the approximated ROI object shape of the video frame as an ROI object. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. An automatic region of interest (ROI) object segmentation system comprising:
-
a ROI feature detector that detects ROI features within a video frame of a video sequence independent of other video frames of the video sequence; a feature verification module that verifies the detected ROI features within the video frame based on a geometric relationship of the detected ROI features; an object shape approximation module that approximates a ROI object shape within the video frame based on the detected ROI features and the geometric relationship of the detected ROI features, wherein the ROI object shape is approximated based on a geometric model defined by a location of the detected ROI features; a region segmentation module that segments the video frame into multiple candidate regions, wherein segmentation by the region segmentation module comprises classifying each pixels within the video frame based on a similarity to neighboring pixels; and an object generation module that selects one or more of the candidate regions located within the approximated ROI object shape of the video frame as an ROI object. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A method performed by an automatic region of interest (ROI) object segmentation apparatus, the method comprising:
-
receiving, in the automatic ROI object segmentation apparatus, a video frame of a video sequence; detecting, in the automatic ROI object segmentation apparatus, region of interest (ROI) features within the video frame independent of other video frames of the video sequence, wherein the ROI features include a face mask including eye features and mouth features; combining, in the automatic ROI object segmentation apparatus, eye feature candidates and mouth feature candidates into eye-mouth triangles; verifying the eye-mouth triangles based on orientation of the eye-mouth triangles within the video frame; and approximating an ROI object shape within the video frame based on the detected ROI features, wherein the ROI object shape is approximated based on a geometric model defined by a location of the detected ROI features; and segmenting, in the automatic ROI object segmentation apparatus, the video frame into multiple candidate regions, wherein segmenting comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and selecting, in the automatic ROI object segmentation apparatus, one or more of the candidate regions located within the approximated ROI object shape as an ROI object. - View Dependent Claims (40, 41, 42)
-
-
43. A non-transitory computer-readable medium having thereon instructions that cause a programmable processor to:
-
receive a video frame of a video sequence; detect region of interest (ROI) features within the video frame independent of other video frames of the video sequence, wherein the ROI features include a face mask including eye features and mouth features; combine eye feature candidates and mouth feature candidates into eye-mouth triangles; verify the eye-mouth triangles based on orientation of the eye-mouth triangles within the video frame; and approximate an ROI object shape within the video frame based on the detected ROI features, wherein the ROI object shape is approximated based on a geometric model defined by a location of the detected ROI features; and segment the video frame into multiple candidate regions, wherein segmenting comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and select one or more of the candidate regions located within the approximated ROI object shape as an ROI object. - View Dependent Claims (44, 45, 46)
-
-
47. A system comprising:
-
a region of interest (ROI) feature detector that detects ROI features within a video frame of a video sequence independent of other video frames of the video sequence, wherein the ROI features include a face mask including eye features and mouth features; a feature verification module that combines eye feature candidates and mouth feature candidates into eye-mouth triangles, and verifies the eye-mouth triangles based on orientation of the eye-mouth triangles within the video frame; an object shape approximation module that approximates a ROI object shape within the video frame based on a geometric model defined by a location of the detected ROI features; a region segmentation module that segments the video frame into multiple candidate regions, wherein segmenting the frame into multiple candidate regions comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and an object generation module that selects one or more of the candidate regions located within the approximated ROI object shape as an ROI object. - View Dependent Claims (48, 49, 50)
-
-
51. An apparatus for automatically segmenting a region of interest (ROI), the apparatus comprising:
-
means for receiving, in the automatic ROI object segmentation apparatus, a video frame of a video sequence; means for detecting, in the automatic ROI object segmentation apparatus, region of interest (ROI) features within the video frame independent of other video frames of the video sequence; means for verifying the detected ROI features within the video frame based on a geometric relationship of the detected ROI features; means for approximating, in the automatic ROI object segmentation apparatus, a ROI object shape within the video frame based on the detected ROI features and the geometric relationship of the detected ROI features, wherein approximating a ROI object shape comprises approximating the ROI object shape based on a geometric model defined by a location of the detected ROI features; means for segmenting, in the automatic ROI object segmentation apparatus, the video frame into multiple candidate regions, wherein segmenting comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and means for selecting, in the automatic ROI object segmentation apparatus, one or more of the candidate regions located within the approximated ROI object shape of the video frame as an ROI object.
-
-
52. An apparatus for automatically segmenting a region of interest (ROI), the apparatus comprising:
-
means for receiving, in the automatic ROI object segmentation apparatus, a video frame of a video sequence; means for detecting, in the automatic ROI object segmentation apparatus, region of interest (ROI) features within the video frame independent of other video frames of the video sequence, wherein the ROI features include a face mask including eye features and mouth features; means for combining, in the automatic ROI object segmentation apparatus, eye feature candidates and mouth feature candidates into eye-mouth triangles; means for verifying the eye-mouth triangles based on orientation of the eye-mouth triangles within the video frame; and means for approximating an ROI object shape within the video frame based on the detected ROI features, wherein the ROI object shape is approximated based on a geometric model defined by a location of the detected ROI features; and means for segmenting, in the automatic ROI object segmentation apparatus, the video frame into multiple candidate regions, wherein segmenting comprises classifying pixels within the video frame based on a similarity to neighboring pixels; and means for selecting, in the automatic ROI object segmentation apparatus, one or more of the candidate regions located within the approximated ROI object shape as an ROI object.
-
Specification