Intra-mode region-of-interest video object segmentation
First Claim
1. A method comprising:
- receiving a video frame of a video sequence;
detecting region of interest (ROI) features within the video frame;
approximating a ROI object shape within the video frame based on the detected ROI features;
segmenting the video frame into multiple candidate regions based on the detected ROI features; and
selecting one or more of the candidate regions located within the ROI object shape of the video frame as an ROI object.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosure is directed to techniques for automatic segmentation of a region-of-interest (ROI) video object from a video sequence. ROI object segmentation enables selected ROI or “foreground” objects of a video sequence that may be of interest to a viewer to be extracted from non-ROI or “background” areas of the video sequence. Examples of a ROI object are a human face or a head and shoulder area of a human body. The disclosed techniques include a hybrid technique that combines ROI feature detection, region segmentation, and background subtraction. In this way, the disclosed techniques may provide accurate foreground object generation and low-complexity extraction of the foreground object from the video sequence. A ROI object segmentation system may implement the techniques described herein. In addition, ROI object segmentation may be useful in a wide range of multimedia applications that utilize video sequences, such as video telephony applications and video surveillance applications.
187 Citations
50 Claims
-
1. A method comprising:
-
receiving a video frame of a video sequence;
detecting region of interest (ROI) features within the video frame;
approximating a ROI object shape within the video frame based on the detected ROI features;
segmenting the video frame into multiple candidate regions based on the detected ROI features; and
selecting one or more of the candidate regions located within the ROI object shape of the video frame as an ROI object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer-readable medium comprising instructions that cause a programmable processor to:
-
receive a video frame of a video sequence;
detect region of interest (ROI) features within the video frame;
approximate a ROI object shape within the video frame based on the detected ROI features;
segment the video frame into multiple candidate regions based on the detected ROI features; and
select one or more of the candidate regions located within the ROI object shape of the video frame as an ROI object. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. An automatic region of interest (ROI) object segmentation system comprising:
-
a ROI feature detector that detects ROI features within a video frame of a video sequence;
an object shape approximation module that approximates a ROI object shape within the video frame based on the detected ROI features;
a region segmentation module that segments the video frame into multiple candidate regions based on the detected ROI features; and
a object generation module that selects one or more of the candidate regions located within the ROI object shape of the video frame as an ROI object. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A method comprising:
-
receiving a video frame of a video sequence;
detecting region of interest (ROI) features within the video frame, wherein the ROI features include a face mask including eye features and mouth features;
segmenting the video frame into multiple candidate regions based on the detected ROI features;
combining eye feature candidates and mouth feature candidates into eye-mouth triangles;
verifying the eye-mouth triangles based on orientation of the eye-mouth triangles within the video frame; and
selecting one or more of the candidate regions as an ROI object based on the verification. - View Dependent Claims (40, 41, 42)
-
-
43. A computer-readable medium comprising instructions to cause a programmable processor to:
-
receive a video frame of a video sequence;
detect region of interest (ROI) features within the video frame, wherein the ROI features include a face mask including eye features and mouth features;
segment the video frame into multiple candidate regions based on the detected ROI features;
combine eye feature candidates and mouth feature candidates into eye-mouth triangles;
verify the eye-mouth triangles based on orientation of the eye-mouth triangles within the video frame; and
select one or more of the candidate regions as an ROI object based on the verification. - View Dependent Claims (44, 45, 46)
-
-
47. A system comprising:
-
a region of interest (ROI) feature detector that detects ROI features within a video frame of a video sequence, wherein the ROI features include a face mask including eye features and mouth features;
a region segmentation module that segments the video frame into multiple candidate regions based on the detected ROI features;
a feature verification module that combines eye feature candidates and mouth feature candidates into eye-mouth triangles, and verifies the eye-mouth triangles based on orientation of the eye-mouth triangles within the video frame; and
an object generation module that selects one or more of the candidate regions as an ROI object based on the verification. - View Dependent Claims (48, 49, 50)
-
Specification