Brief and high-interest video summary generation
First Claim
1. A method of video summarization comprising:
- determining if a video contains a junk frame;
modifying a boundary of shots of the video based on the determining if a video contains a junk frame;
sampling a plurality of the shots of the video into a plurality of subshots;
clustering the plurality of subshots with a multiple step k-means clustering, comprising;
clustering the plurality of subshots;
determining a distance among the plurality of subshots as a weighted summation of a motion compensated matching error, a histogram difference in red, green, blue color space, and a histogram difference in hue, saturation, value color space;
determining a distance between a plurality of clusters as a maximum distance between pairs of subshots within the plurality of clusters;
merging the plurality of clusters if a determined distance between the plurality of clusters is smaller than a predetermined threshold; and
dividing a cluster in the plurality of clusters if a difference in a keyframe file size in the cluster exceeds a predetermined threshold; and
creating a video summary based on the clustering the plurality of subshots.
1 Assignment
0 Petitions
Accused Products
Abstract
A video is summarized by determining if a video contains one or more junk frames, modifying one or more boundaries of shots of the video based at least in part on the determination of if the video contains one or more junk frames, sampling a plurality of the shots of the video into a plurality of subshots, clustering the plurality of subshots with a multiple step k-means clustering, and creating a video summary based at least in part on the clustered plurality of subshots. The video is segmented into a plurality of shots and a keyframe from each of the plurality of shots is extracted. A video summary is created based on a determined importance of the subshots in a clustered plurality of subshots and a time budget. The created video summary is rendered by displaying playback rate information for the rendered video summary, displaying a currently playing subshot marker with the rendered video summary, and displaying an indication of similar content in the rendered video summary.
18 Citations
18 Claims
-
1. A method of video summarization comprising:
-
determining if a video contains a junk frame; modifying a boundary of shots of the video based on the determining if a video contains a junk frame; sampling a plurality of the shots of the video into a plurality of subshots; clustering the plurality of subshots with a multiple step k-means clustering, comprising; clustering the plurality of subshots; determining a distance among the plurality of subshots as a weighted summation of a motion compensated matching error, a histogram difference in red, green, blue color space, and a histogram difference in hue, saturation, value color space; determining a distance between a plurality of clusters as a maximum distance between pairs of subshots within the plurality of clusters; merging the plurality of clusters if a determined distance between the plurality of clusters is smaller than a predetermined threshold; and dividing a cluster in the plurality of clusters if a difference in a keyframe file size in the cluster exceeds a predetermined threshold; and creating a video summary based on the clustering the plurality of subshots. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of video summarization comprising:
-
determining if a video contains a junk frame; modifying a boundary of shots of the video based on the determining if a video contains a junk frame; sampling a plurality of the shots of the video into a plurality of subshots; clustering the plurality of subshots with a multiple step k-means clustering; creating a video summary based on the clustering the plurality of subshots; rendering the video summary; displaying playback rate information for a rendered video summary; displaying a currently playing subshot marker with the rendered video summary; and displaying an indication of similar content in the rendered video summary.
-
-
10. A non-transitory computer readable medium storing computer program instructions for video summarization, which, when executed on a processor, cause the processor to perform a method comprising:
-
determining if a video contains a junk frame; modifying a boundary of shots of the video based on the determining if a video contains a junk frame; sampling a plurality of the shots of the video into a plurality of subshots; clustering the plurality of subshots with a multiple step k-means clustering, comprising; clustering the plurality of subshots; determining a distance among the plurality of subshots as a weighted summation of a motion compensated matching error, a histogram difference in red, green, blue color space, and a histogram difference in hue, saturation, value color space; determining a distance between a plurality of clusters as a maximum distance between pairs of subshots within the plurality of clusters; merging a plurality of clusters if a determined distance between the clusters is smaller than a predetermined threshold; and dividing a cluster in the plurality of clusters if a difference in a keyframe file size in the cluster exceeds a predetermined threshold; and creating a video summary based on the clustering the plurality of subshots. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification