Methods and systems for real-time storyboarding with a web page and graphical user interface for automatic video parsing and browsing
First Claim
1. A real-time storyboarding system, comprising:
- a frame capture device;
a command detector that detects if the input multimedia image data signal contains a command data that indicates key representative frames, the command data is embedded in a closed-caption portion of an input multimedia image data signal in addition to a closed-caption data to be displayed;
a command decoder that decodes the detected command data; and
an image significance determiner that determines in response to the command detector if there is at least one representative frame in the input multimedia image data signal, and that, if at least one representative frame is found, outputs the at least representative frame.
11 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods to enable real-time and near real-time storyboarding on the World Wide Web in addition to a graphical user interface for video parsing and browsing the of the storyboard. Specifically, storyboarding can be accomplished on the World Wide Web by parsing an input video into representative or key frames. These frames then can be posted to a web document, or the like, for subsequent viewing by a user. This allows a video to be distilled down to the essential frames thus eliminating storage and bandwidth problems as well as eliminating the need for a user to view the entirety of the video. Furthermore, the graphical user interface allows a user to visually interact with an input video signal to determine the key or representative frames, or to retrieve video segments associated with already determined key frames. Furthermore, the interface allows manipulation of these frames including, but not limited to, playing of the entire segment represented by that key or significant frame as well as actual determining of the cuts between significant segments.
125 Citations
22 Claims
-
1. A real-time storyboarding system, comprising:
-
a frame capture device;
a command detector that detects if the input multimedia image data signal contains a command data that indicates key representative frames, the command data is embedded in a closed-caption portion of an input multimedia image data signal in addition to a closed-caption data to be displayed;
a command decoder that decodes the detected command data; and
an image significance determiner that determines in response to the command detector if there is at least one representative frame in the input multimedia image data signal, and that, if at least one representative frame is found, outputs the at least representative frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
6. The system of claim 5, wherein, if the change in intensity between the at least two frames is greater than a predefined threshold, one of the current frame and the next frame is identified as a representative frame.
-
7. The system of claim 1, wherein the at least one representative frame is output to a device capable of displaying the at least one representative frame.
-
8. The system of claim 1, wherein the at least one representative frame is output to a printer.
-
9. The system of claim 1, wherein the at least one representative frame is output to a web document.
-
10. The system of claim 9, wherein the web document is updated once a threshold number of representative frames are determined.
-
11. The system of claim 1, wherein at least a portion of the input multimedia image data signal is analyzed before the at least one representative frame is output.
-
12. The system of claim 11, wherein the entire input multimedia image data signal is analyzed before the at least one representative frame is output.
-
13. A method for real-time storyboarding, comprising:
-
selecting at least a portion of an input multimedia image data signal;
determining whether a command data is present such that the command data is embedded in a closed-caption portion of the input multimedia image data signal in addition to a closed-caption data to be displayed;
the command data indicates key representative frames, wherein if command data is present, decoding the command data to determine at least one representative frame;
determining in response to the command data is present the at least one representative frame of the selected portion of the input multimedia image data signal;
adding the at least one representative frame to a document; and
outputting the document. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
determining whether command data is present comprises determining if speaker data identifying at least one representative frame is present; and
if speaker data identifying at least one representative frame is present, adding the at least one identified representative frame to the document.
-
-
15. The method of claim 13, wherein:
-
determining whether command data is present comprises determining if at least one special characters identifying at least one representative frame is present; and
if at least one special character identifying at least one representative frame is present, adding the at least one identified representative frame to the document.
-
-
16. The method of claim 13, wherein:
-
determining whether command data is present comprises determining if a number of at least one special character identifies at least one representative frame; and
if the number of the at least one special character identifies at least one representative frame, adding the at least one representative frame to the document.
-
-
17. The method of claim 13, wherein:
-
determining whether command data is present comprises determining if a time lapse in the selected portion of the input multimedia image data signal since a last representative frame is greater than a threshold;
if the time lapse is greater than the threshold, identifying at least one additional frame as a representative frame of the selected portion of the input multimedia image data signal; and
adding the at least one identified frame to the document.
-
-
18. The method of claim 13, further comprising:
-
determining a frame difference between a pair of frames; and
if the frame difference exceeds a threshold, identifying one of the pair of frames as a representative frame of the selected portion of the input multimedia image data signal.
-
-
19. The method of claim 18, wherein the threshold is an intensity threshold.
-
20. The method of claim 19, wherein the intensity E(ti) for a current frame occurring at a time t=ti, relative to a next frame occurring at time t=ti+1, is:
-
21. The method of claim 20, wherein if a change in intensity between the current frame and the next frame is greater than a predefined threshold, at least one of the frames is identified as a representative frame.
-
22. The method of claim 13, further comprising:
-
determining whether a threshold number of representative frames for the document have been identified; and
if the threshold is reached, selecting a new document; and
adding the at least one representative frame exceeding the threshold to the new document.
-
Specification