METHOD AND APPARATUS FOR AUTOMATICALLY SUMMARIZING VIDEO
First Claim
Patent Images
1. A method for automatically producing a summary of a video, comprising:
- partitioning a video into scenes using a frame-similarity matrix, each element in the frame-similarity matrix representing a distance between feature vectors of a corresponding pair of frames;
generating a scene-similarity matrix comprising a plurality of elements based on the frame-similarity matrix, each element of the scene-similarity matrix representing a measure of similarity between different scenes of the video;
determining, by a processor, an importance score for each scene based on the scene-similarity matrix, the importance score for a scene indicating a relative importance of the scene and wherein the importance score is increased responsive to the scene having a high similarity with other scenes in the video;
selecting representative scenes from the video based on the determined importance scores; and
combining selected scenes to produce the summary for the video.
1 Assignment
0 Petitions
Accused Products
Abstract
One embodiment of the present invention provides a system that automatically produces a summary of a video. During operation, the system partitions the video into scenes and then determines similarities between the scenes. Next, the system selects representative scenes from the video based on the determined similarities, and combines the selected scenes to produce the summary for the video.
101 Citations
20 Claims
-
1. A method for automatically producing a summary of a video, comprising:
-
partitioning a video into scenes using a frame-similarity matrix, each element in the frame-similarity matrix representing a distance between feature vectors of a corresponding pair of frames; generating a scene-similarity matrix comprising a plurality of elements based on the frame-similarity matrix, each element of the scene-similarity matrix representing a measure of similarity between different scenes of the video; determining, by a processor, an importance score for each scene based on the scene-similarity matrix, the importance score for a scene indicating a relative importance of the scene and wherein the importance score is increased responsive to the scene having a high similarity with other scenes in the video; selecting representative scenes from the video based on the determined importance scores; and combining selected scenes to produce the summary for the video. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer-readable storage medium storing instructions that, when executed by a computer, cause the computer to perform a method for automatically producing a summary of a video, the method comprising:
-
partitioning a video into scenes using a frame-similarity matrix, each element in the frame-similarity matrix representing a distance between feature vectors of a corresponding pair of frames; generating a scene-similarity matrix comprising a plurality of elements based on the frame-similarity matrix, each element of the scene-similarity matrix representing a measure of similarity between different scenes of the video; determining an importance score for each scene based on the scene-similarity matrix, the importance score for a scene indicating a relative importance of the scene and wherein the importance score is increased responsive to the scene having a high similarity with other scenes in the video; selecting representative scenes from the video based on the determined importance scores; and combining selected scenes to produce the summary for the video. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. An apparatus that automatically produces a summary of a video, comprising:
-
a non-transitory computer readable storage medium storing instructions executable to perform steps comprising; partitioning a video into scenes using a frame-similarity matrix, each element in the frame-similarity matrix representing a distance between feature vectors of a corresponding pair of frames; generating a scene-similarity matrix comprising a plurality of elements based on the frame-similarity matrix, each element of the scene-similarity matrix representing a measure of similarity between different scenes of the video; determining an importance score for each scene based on the scene-similarity matrix, the importance score for a scene indicating a relative importance of the scene and wherein the importance score is increased responsive to the scene having a high similarity with other scenes in the video; selecting representative scenes from the video based on the determined importance scores; and combining selected scenes to produce the summary for the video; and a processor configured to execute the instructions. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification