Speech recognition training method for audio and video file indexing on a search engine
First Claim
Patent Images
1. A method for indexing audio/video documents through the use of a search engine, the method comprising:
- providing, to the search engine, a source of training documents comprising textual content;
the search engine retrieving at least some of the training documents from the source of training documents;
the search engine extracting the textual content from the retrieved training documents;
the search engine indexing the textual content;
training a speech recognition profile using the indexed textual content;
providing, to the search engine, a source for the audio/video documents each of which comprise an associated audio content;
the search engine retrieving at least some of the audio/video documents from the source of documents;
the search engine extracting the associated content from the audio/video documents;
converting the associated audio content into transcriptions using the trained speech recognition profile;
the search engine indexing the transcriptions thereby resulting in an indexing of the audio/video documents; and
saving the indexed transcriptions;
wherein the training of the speech recognition profile comprises using summary sentences and comparing the number of sentences to a threshold to determine if all sentences will be kept for the training.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and a related system to index audio and video documents and to automatically train the language model of a speech recognition system according to the context of the documents being indexed.
-
Citations
16 Claims
-
1. A method for indexing audio/video documents through the use of a search engine, the method comprising:
-
providing, to the search engine, a source of training documents comprising textual content; the search engine retrieving at least some of the training documents from the source of training documents; the search engine extracting the textual content from the retrieved training documents; the search engine indexing the textual content; training a speech recognition profile using the indexed textual content; providing, to the search engine, a source for the audio/video documents each of which comprise an associated audio content; the search engine retrieving at least some of the audio/video documents from the source of documents; the search engine extracting the associated content from the audio/video documents; converting the associated audio content into transcriptions using the trained speech recognition profile; the search engine indexing the transcriptions thereby resulting in an indexing of the audio/video documents; and saving the indexed transcriptions; wherein the training of the speech recognition profile comprises using summary sentences and comparing the number of sentences to a threshold to determine if all sentences will be kept for the training. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A search engine system for indexing audio/video documents comprising:
-
a search engine for; receiving a source of training documents comprising textual content; retrieving at least some of the training documents from the source of training documents; extracting the textual content from the retrieved training documents; indexing the textual content; receiving a source for the audio/video documents each of which comprise an associated audio content; retrieving at least some of the audio/video documents from the source of documents; and extracting the associated audio content from the audio/video documents; a training engine for training a speech recognition profile using the indexed textual content; a speech recognition engine converting the associated audio content into transcriptions using the trained speech recognition profile; the search engine further for indexing the transcriptions thereby resulting in an indexing of the audio/video documents; and an index for saving the indexed transcriptions; wherein the training of the speech recognition profile comprises using summary sentences and comparing the number of sentences to a threshold to determine if all sentences will be kept for the training. - View Dependent Claims (16)
-
Specification