Browsing and retrieval of full broadcast-quality video
First Claim
Patent Images
1. A method comprising steps of:
- classifying a media collection comprising program content and television commercials as classified program content versus classified television commercials, wherein the television commercials are presented chronologically separate from the program content and in a same display screen as the program content;
identifying program segments within the classified program content based on synchronizing recognized speech in each identified segment with captioning to extract stories and to yield synchronized speech, wherein synchronizing recognized speech comprises;
recognizing speech in the media collection to yield recognized speech,generating word timestamps based on the recognized speech, andaligning the captioning with the recognized speech based on the word timestamps;
indexing the media collection to create an indexed library based on the program segments to yield an indexed media collection;
receiving at a server a search query to the indexed media collection from a user;
searching the indexed library to identify a set of candidate program segments based on the search query; and
presenting at a client device of the user the set of candidate program segments for the user to browse and select.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing.
-
Citations
7 Claims
-
1. A method comprising steps of:
-
classifying a media collection comprising program content and television commercials as classified program content versus classified television commercials, wherein the television commercials are presented chronologically separate from the program content and in a same display screen as the program content; identifying program segments within the classified program content based on synchronizing recognized speech in each identified segment with captioning to extract stories and to yield synchronized speech, wherein synchronizing recognized speech comprises; recognizing speech in the media collection to yield recognized speech, generating word timestamps based on the recognized speech, and aligning the captioning with the recognized speech based on the word timestamps; indexing the media collection to create an indexed library based on the program segments to yield an indexed media collection; receiving at a server a search query to the indexed media collection from a user; searching the indexed library to identify a set of candidate program segments based on the search query; and presenting at a client device of the user the set of candidate program segments for the user to browse and select. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for video indexing and delivery, the system comprising:
-
a first module configured to classify video into program segments versus television commercial segments, wherein the television commercials are presented chronologically separate from the program content and in a same display screen as the program content; a second module configured to identify speaker segments within the program segments based on speaker voice characteristics to yield identified speaker segments; a third module configured to extract stories from the identified speaker segments based on synchronizing speech with closed captioning of the identified speaker segments, the synchronizing based on; recognizing speech in the video, generating word timestamps based on the recognized speech, and aligning the captioning with the recognized speech based on the word timestamps; a fourth module configured to receive a natural language query; a fifth module configured to select key frames from identified speech segments in response to the query; and a sixth module configured to present the key forms to a user device for browsing by the user.
-
Specification