Automatic language model update
First Claim
1. A method for generating a speech recognition model, comprising:
- accessing a baseline speech recognition model;
obtaining information related to recent language usage from search queries; and
modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.
-
Citations
23 Claims
-
1. A method for generating a speech recognition model, comprising:
-
accessing a baseline speech recognition model;
obtaining information related to recent language usage from search queries; and
modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
2. The method of claim 1, wherein the portion of the sound comprises a word.
-
14. A method for generating a speech recognition model, comprising:
-
receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording;
synchronizing the transcript with the audio recording;
extracting one or more letters from the transcript and extracting an associated pronunciation of the one or more letters from the audio recording; and
generating a dictionary entry in a pronunciation dictionary. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21)
-
-
22. The method of claim 14, wherein the dictionary entry includes weightings associated with when the transcript was received.
-
22-1. A computer implemented method for transmitting verbal terms, comprising:
transmitting search terms from a remote device to a server device, wherein the server device generates word occurrence data associated with the search terms and modifies a language model based on the word occurrence data.
-
23. A system for updating a language model, comprising:
-
a request processor to receive search terms;
an extractor for obtaining information related to recent language usage from the search terms; and
means for modifying a language model to revise probabilities of a word occurrence based on the information.
-
-
23-2. The computer implemented method of claim 24, wherein the remote device is selected from a group consisting of a mobile telephone, a personal digital assistant, a desktop computer, and a mobile email device.
Specification