Identifying scene boundaries using group sparsity analysis

US 9,665,775 B2
Filed: 07/22/2016
Issued: 05/30/2017
Est. Priority Date: 08/03/2012
Status: Active Grant

First Claim

Patent Images

1. A method for determining scene boundaries within a video sequence including a time sequence of video frames, each video frame including an array of image pixels having pixel values, comprising:

a) selecting a set of video frames from the video sequence, the set of video frames comprising a starting point and an ending point;

b) extracting a feature vector for each video frame in the set of video frames;

c) applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames in the set of video frames, each feature vector for the other video frames in the group sparse combination having an associated weighting coefficient, wherein the weighting coefficients for feature vectors corresponding to other video frames that are most similar to the particular video frame are non-zero, and the weighting coefficients for feature vectors corresponding to other video frames that are most dissimilar from the particular video frame are zero;

d) analyzing the weighting coefficients to determine a video frame cluster of temporally-contiguous, similar video frames that includes the particular video frame;

e) repeating steps c)-d) for a plurality of particular video frames to provide a plurality of video frame clusters;

f) identifying one or more scene boundaries corresponding scenes in the video sequence based on the locations of boundaries between the determined video frame clusters; and

g) storing an indication of the identified scene boundaries in a processor-accessible memory;

wherein selecting a set of video frames from the video sequence comprises temporally sub-sampling the video sequence to select a subset of video frames.

View all claims

5 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for identifying a set of key video frames from a video sequence comprising extracting feature vectors for each video frame and applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames. Weighting coefficients associated with the group sparse combination are analyzed to determine video frame clusters of temporally-contiguous, similar video frames. The video sequence is segmented into scenes by identifying scene boundaries based on the determined video frame clusters.

Citations

10 Claims

1. A method for determining scene boundaries within a video sequence including a time sequence of video frames, each video frame including an array of image pixels having pixel values, comprising:
- a) selecting a set of video frames from the video sequence, the set of video frames comprising a starting point and an ending point;
  
  b) extracting a feature vector for each video frame in the set of video frames;
  
  c) applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames in the set of video frames, each feature vector for the other video frames in the group sparse combination having an associated weighting coefficient, wherein the weighting coefficients for feature vectors corresponding to other video frames that are most similar to the particular video frame are non-zero, and the weighting coefficients for feature vectors corresponding to other video frames that are most dissimilar from the particular video frame are zero;
  
  d) analyzing the weighting coefficients to determine a video frame cluster of temporally-contiguous, similar video frames that includes the particular video frame;
  
  e) repeating steps c)-d) for a plurality of particular video frames to provide a plurality of video frame clusters;
  
  f) identifying one or more scene boundaries corresponding scenes in the video sequence based on the locations of boundaries between the determined video frame clusters; and
  
  g) storing an indication of the identified scene boundaries in a processor-accessible memory;
  
  wherein selecting a set of video frames from the video sequence comprises temporally sub-sampling the video sequence to select a subset of video frames.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1 wherein selecting a set of video frames further comprises separating the selected subset of video frames by a predefined interval.
  - 3. The method of claim 2 wherein the predefined interval comprises an interval defined by every tenth video frame in the video sequence.
  - 4. The method of claim 1 wherein selecting a set of video frames from the video sequence comprises using a user interface to enable a user to manually indicate the starting point and the ending point for the set of video frames.

5. A method for determining scene boundaries within a video sequence including a time sequence of video frames, each video frame including an array of image pixels having pixel values, comprising:
- a) selecting a set of video frames from the video sequence, the set of video frames comprising a starting point and an ending point;
  
  b) extracting a feature vector for each video frame in the set of video frames;
  
  c) applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames in the set of video frames, each feature vector for the other video frames in the group sparse combination having an associated weighting coefficient, wherein the weighting coefficients for feature vectors corresponding to other video frames that are most similar to the particular video frame are non-zero, and the weighting coefficients for feature vectors corresponding to other video frames that are most dissimilar from the particular video frame are zero;
  
  d) analyzing the weighting coefficients to determine a video frame cluster of temporally-contiguous, similar video frames that includes the particular video frame;
  
  e) repeating steps c)-d) for a plurality of particular video frames to provide a plurality of video frame clusters;
  
  f) identifying one or more scene boundaries corresponding scenes in the video sequence based on the locations of boundaries between the determined video frame clusters; and
  
  g) storing an indication of the identified scene boundaries in a processor-accessible memory;
  
  wherein the video sequence comprises a plurality of independently encoded video frames and a plurality of video frames that are encoded using inter-frame coding, and wherein selecting a set of video frames from the video sequence comprises selecting at least a subset of the plurality of independently encoded video frames.
- View Dependent Claims (6)
- - 6. The method of claim 5 wherein selecting a set of video frames from the video sequence comprises using a user interface to enable a user to manually indicate the starting point and the ending point for the set of video frames.

7. A method for determining scene boundaries within a video sequence including a time sequence of video frames, each video frame including an array of image pixels having pixel values, comprising:
- a) selecting a set of video frames from the video sequence, the set of video frames comprising a starting point and an ending point;
  
  b) extracting a feature vector for each video frame in the set of video frames;
  
  c) applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames in the set of video frames, each feature vector for the other video frames in the group sparse combination having an associated weighting coefficient, wherein the weighting coefficients for feature vectors corresponding to other video frames that are most similar to the particular video frame are non-zero, and the weighting coefficients for feature vectors corresponding to other video frames that are most dissimilar from the particular video frame are zero;
  
  d) analyzing the weighting coefficients to determine a video frame cluster of temporally-contiguous, similar video frames that includes the particular video frame;
  
  e) repeating steps c)-d) for a plurality of particular video frames to provide a plurality of video frame clusters;
  
  f) identifying one or more scene boundaries corresponding scenes in the video sequence based on the locations of boundaries between the determined video frame clusters; and
  
  g) storing an indication of the identified scene boundaries in a processor-accessible memory;
  
  wherein extracting a feature vector for each video frame comprises extracting a color channel of each video frame.
- View Dependent Claims (8, 9, 10)
- - 8. The method of claim 7 wherein the color channel comprises a green color channel.
  - 9. The method of claim 7 wherein the color channel comprises pixel values for a plurality of color channels.
  - 10. The method of claim 7 wherein selecting a set of video frames from the video sequence comprises using a user interface to enable a user to manually indicate the starting point and the ending point for the set of video frames.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Kodak Alaris Inc (KPP (no. 2) Trustees Ltd.)
Original Assignee
Kodak Alaris Inc (KPP (no. 2) Trustees Ltd.)
Inventors
Pillman, Bruce Harold, Kumar, Mrityunjay, Loui, Alexander C.
Primary Examiner(s)
Le, Vu
Assistant Examiner(s)
WOLDEMARIAM, AKLILU K

Application Number

US15/217,421
Publication Number

US 20160328615A1
Time in Patent Office

312 Days
Field of Search

382173, 382197, 382199, 382224, 382225, 382305, 375240, 345525, 345607
US Class Current
CPC Class Codes

G06F 18/2136   based on sparsity criteria,...

G06V 10/513   Sparse representations

G06V 10/7715   Feature extraction, e.g. by...

G06V 20/47   Detecting features for summ...

G06V 20/49   Segmenting video sequences,...

Identifying scene boundaries using group sparsity analysis

First Claim

5 Assignments

0 Petitions

Accused Products

Abstract

Citations

10 Claims

Specification

Solutions

Use Cases

Quick Links

Identifying scene boundaries using group sparsity analysis

First Claim

5 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

10 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links