Multi-mode region-of-interest video object segmentation
First Claim
1. A method performed by a video coding device, the method comprising:
- receiving a video frame of a video sequence;
applying, in the video coding device, one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode, wherein each of the first and second segmentation modes comprise modes of segmenting region of interest (ROI) objects from the video frame and wherein applying the one or more segmentation mode decision factors to select the segmentation mode comprises determining a computational complexity of the video frame by determining a number of ROI features within the video frame and selecting the first segmentation mode when the computation complexity is above a pre-determined level;
segmenting, in the video coding device, an ROI object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and
segmenting, in the video coding device, an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosure is directed to techniques for automatic segmentation of a region-of-interest (ROI) video object from a video sequence. ROI object segmentation enables selected ROI or “foreground” objects of a video sequence that may be of interest to a viewer to be extracted from non-ROI or “background” areas of the video sequence. Examples of a ROI object are a human face or a head and shoulder area of a human body. The disclosed techniques include a hybrid technique that combines ROI feature detection, region segmentation, and background subtraction. In this way, the disclosed techniques may provide accurate foreground object generation and low-complexity extraction of the foreground object from the video sequence. A ROI object segmentation system may implement the techniques described herein. In addition, ROI object segmentation may be useful in a wide range of multimedia applications that utilize video sequences, such as video telephony applications and video surveillance applications.
137 Citations
34 Claims
-
1. A method performed by a video coding device, the method comprising:
-
receiving a video frame of a video sequence; applying, in the video coding device, one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode, wherein each of the first and second segmentation modes comprise modes of segmenting region of interest (ROI) objects from the video frame and wherein applying the one or more segmentation mode decision factors to select the segmentation mode comprises determining a computational complexity of the video frame by determining a number of ROI features within the video frame and selecting the first segmentation mode when the computation complexity is above a pre-determined level; segmenting, in the video coding device, an ROI object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and segmenting, in the video coding device, an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable medium having stored thereon instructions that when executed by a programmable processor cause the programmable processor to:
-
receive a video frame of a video sequence; apply one or more segmentation mode decision factors to the video frame to select a a segmentation mode from at least a first segmentation mode and a second segmentation mode, wherein each of the first and second segmentation modes comprise modes of segmenting region of interest (ROI) objects from the video frame and wherein to apply the one or more segmentation mode decision factors to select the segmentation mode, the instructions cause the processor to determine a computational complexity of the video frame by determining a number of ROI features within the video frame and selecting select first segmentation mode when the computation complexity is above a pre-determined level; segment an ROI object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and segment an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A video encoding device including a processor programmed to:
-
receive a video frame of a video sequence; apply one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode, wherein each of the first and second segmentation modes comprise modes of segmenting region of interest (ROI) objects from the video frame and wherein to apply the one or more segmentation mode decision factors to select the segmentation mode, the processor is configured to determine a computational complexity of the video frame by determining a number of ROI features within the video frame and selecting select first segmentation mode when the computation complexity is above a pre-determined level; segment an ROI object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and segment an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25)
-
-
26. A video coding device comprising:
-
means for receiving a video frame of a video sequence; means for applying one or more segmentation mode decision factors to the video frame to select a segmentation mode from at least a first segmentation mode and a second segmentation mode, wherein each of the first and second segmentation modes comprise modes of segmenting region of interest (ROI) objects from the video frame and wherein said applying means comprises means for determining a computational complexity of the video frame by determining a number of ROI features within the video frame and means for selecting the first segmentation mode when the computation complexity is above a pre-determined level; means for segmenting an ROI object from the video frame without reference to motion information for the video frame when the first segmentation mode is selected; and means for segmenting an ROI object from the video frame based on motion information for the video frame and a different video frame of the video sequence when the second segmentation mode is selected. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34)
-
Specification