Method and apparatus for updating speech recognition databases and reindexing audio and video content using the same
First Claim
1. A method for reindexing media content for search applications, comprising:
- providing a speech recognition database that include entries defining acoustical representations for a plurality of words;
providing a searchable database containing a plurality of metadata documents descriptive of a plurality of media resources, each of the plurality of metadata documents including a sequence of speech recognized text indexed using the speech recognition database;
updating the speech recognition database with at least one word candidate; and
reindexing the sequence of speech recognized text for a subset of the plurality of metadata documents using the updated speech recognition database, the subset of metadata documents including metadata documents having a sequence of speech recognized text generated before the speech recognition database was updated with the at least one word candidate.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for reindexing media content for search applications that includes steps and structure for providing a speech recognition database that include entries defining acoustical representations for a plurality of words; providing a searchable database containing a plurality of metadata documents descriptive of a plurality of media resources, each of the plurality of metadata documents including a sequence of speech recognized text indexed using the speech recognition database; updating the speech recognition database with at least one word candidate; and reindexing the sequence of speech recognized text for a subset of the plurality of metadata documents using the updated speech recognition database.
305 Citations
30 Claims
-
1. A method for reindexing media content for search applications, comprising:
-
providing a speech recognition database that include entries defining acoustical representations for a plurality of words;
providing a searchable database containing a plurality of metadata documents descriptive of a plurality of media resources, each of the plurality of metadata documents including a sequence of speech recognized text indexed using the speech recognition database;
updating the speech recognition database with at least one word candidate; and
reindexing the sequence of speech recognized text for a subset of the plurality of metadata documents using the updated speech recognition database, the subset of metadata documents including metadata documents having a sequence of speech recognized text generated before the speech recognition database was updated with the at least one word candidate. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. An apparatus for reindexing media content for search applications, comprising:
-
a speech recognition database that includes entries defining acoustical representations for a plurality of words;
a searchable database containing a plurality of metadata documents descriptive of a plurality of media resources, a media indexer that generates a sequence of speech recognized text included in each of the plurality of metadata documents using the speech recognition database;
an update module that updates the speech recognition database with at least one word candidate; and
a reindexing module that causes the media indexer to reindex the sequence of speech recognized text for a subset of the plurality of metadata documents using the updated speech recognition database, the subset of metadata documents including metadata documents having a sequence of speech recognized text generated before the speech recognition database was updated with the at least one word candidate.
-
-
30. An apparatus for reindexing media content for search applications, comprising:
-
means for providing a speech recognition database that include entries defining acoustical representations for a plurality of words;
means for providing a searchable database containing a plurality of metadata documents descriptive of a plurality of media resources, each of the plurality of metadata documents including a sequence of speech recognized text indexed using the speech recognition database;
means for updating the speech recognition database with at least one word candidate; and
means for reindexing the sequence of speech recognized text for a subset of the plurality of metadata documents using the updated speech recognition database, the subset of metadata documents including metadata documents having a sequence of speech recognized text generated before the speech recognition database was updated with the at least one word candidate.
-
Specification