Multi-mode region-of-interest video object segmentation
First Claim
1. A method comprising:
- receiving a video frame of a video sequence;
determining an amount of motion activity between the video frame and the different video frame of the video sequenceapplying one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode comprising selecting the first segmentation mode when the amount of motion activity is above a pre-determined level;
segmenting a region of interest (ROI) object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and
segmenting an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosure is directed to techniques for automatic segmentation of a region-of-interest (ROI) video object from a video sequence. ROI object segmentation enables selected ROI or “foreground” objects of a video sequence that may be of interest to a viewer to be extracted from non-ROI or “background” areas of the video sequence. Examples of a ROI object are a human face or a head and shoulder area of a human body. The disclosed techniques include a hybrid technique that combines ROI feature detection, region segmentation, and background subtraction. In this way, the disclosed techniques may provide accurate foreground object generation and low-complexity extraction of the foreground object from the video sequence. A ROI object segmentation system may implement the techniques described herein. In addition, ROI object segmentation may be useful in a wide range of multimedia applications that utilize video sequences, such as video telephony applications and video surveillance applications.
-
Citations
23 Claims
-
1. A method comprising:
-
receiving a video frame of a video sequence; determining an amount of motion activity between the video frame and the different video frame of the video sequence applying one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode comprising selecting the first segmentation mode when the amount of motion activity is above a pre-determined level; segmenting a region of interest (ROI) object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and segmenting an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable storage medium having stored thereon instructions that cause a programmable processor to:
-
receive a video frame of a video sequence; determine an amount of motion activity between the video frame and the different video frame of the video sequence; apply one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode comprising selecting the first segmentation mode when the amount of motion activity is above a pre-determined level; segment a region of interest (ROI) object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and segment an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A video encoding device including a processor configured to:
-
receive a video frame of a video sequence; determines an amount of motion activity between the video frame and the different video frame of the video sequence; apply one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode, wherein the processor is configured to select the first segmentation mode when the amount of similarity is below a pre-determined level; segment a region of interest (ROI) object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and segment an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
-
23. A video coding device comprising:
-
means for receiving a video frame of a video sequence; means for determining an amount of motion activity between the video frame and the different video frame of the video sequence means for applying one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode comprising selecting the first segmentation mode when the amount of motion activity is above a pre-determined level; means for segmenting a region of interest (ROI) object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and means for segmenting an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected.
-
Specification