Adjusting language models
First Claim
1. A computer-implemented method, comprising:
- accessing audio data;
accessing information that indicates a first context, the first context comprising a first physical environment or physical state of a device that records the audio data;
accessing at least one term;
accessing information that indicates a second context, the second context comprising a second physical environment or physical state associated with the accessed term;
determining a similarity score that indicates a degree of similarity between the first physical environment or physical state and the second physical environment or physical state;
adjusting a language model based on the accessed term and the determined similarity score to generate an adjusted language model, wherein the adjusted language model includes the accessed term and a weighting value assigned to the accessed term based on the similarity score; and
performing speech recognition on the audio data using the adjusted language model to select one or more candidate transcriptions for a portion of the audio data.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatuses, including computer programs encoded on a computer storage medium, for adjusting language models. In one aspect, a method includes accessing audio data. Information that indicates a first context is accessed, the first context being associated with the audio data. At least one term is accessed. Information that indicates a second context is accessed, the second context being associated with the term. A similarity score is determined that indicates a degree of similarity between the second context and the first context. A language model is adjusted based on the accessed term and the determined similarity score to generate an adjusted language model. Speech recognition is performed on the audio data using the adjusted language model to select one or more candidate transcriptions for a portion of the audio data.
-
Citations
29 Claims
-
1. A computer-implemented method, comprising:
-
accessing audio data; accessing information that indicates a first context, the first context comprising a first physical environment or physical state of a device that records the audio data; accessing at least one term; accessing information that indicates a second context, the second context comprising a second physical environment or physical state associated with the accessed term; determining a similarity score that indicates a degree of similarity between the first physical environment or physical state and the second physical environment or physical state; adjusting a language model based on the accessed term and the determined similarity score to generate an adjusted language model, wherein the adjusted language model includes the accessed term and a weighting value assigned to the accessed term based on the similarity score; and performing speech recognition on the audio data using the adjusted language model to select one or more candidate transcriptions for a portion of the audio data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 24, 25, 26, 27)
-
-
12. A system comprising:
-
one or more computers; and a computer-readable medium coupled to the one or more computers having instructions stored thereon which, when executed by the one or more computers, cause the one or more computers to perform operations comprising; accessing audio data; accessing information that indicates a first context, the first context comprising a first physical environment or physical state of a device that records the audio data; accessing at least one term; accessing information that indicates a second context, the second context comprising a second physical environment or physical state associated with the accessed term; determining a similarity score that indicates a degree of similarity between the first physical environment or physical state and the second physical environment or physical state; adjusting a language model based on the accessed term and the determined similarity score to generate an adjusted language model, wherein the adjusted language model includes the accessed term and a weighting value assigned to the accessed term based on the similarity score; and performing speech recognition on the audio data using the adjusted language model to select one or more candidate transcriptions for a portion of the audio data. - View Dependent Claims (13, 14, 15, 16, 28, 29)
-
-
17. A non-transitory computer storage medium encoded with a computer program, the program comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
accessing audio data; accessing information that indicates a first context, the first context comprising a first physical environment or physical state of a device that records the audio data; accessing at least one term; accessing information that indicates a second context, the second context comprising a second physical environment or physical state associated with the accessed term; determining a similarity score that indicates a degree of similarity between the first physical environment or physical state and the second physical environment or physical state; adjusting a language model based on the accessed term and the determined similarity score to generate an adjusted language model, wherein the adjusted language model includes the accessed term and a weighting value assigned to the accessed term based on the similarity score; and performing speech recognition on the audio data using the adjusted language model to select one or more candidate transcriptions for a portion of the audio data. - View Dependent Claims (18, 19, 20)
-
-
21. A computer-implemented method comprising:
-
transmitting, at a client device, audio data to a server system; identifying a first context of the client device, the first context comprising a first physical environment or physical state of the client device; transmitting information indicating the first context to the server system; and receiving, at the client device, a transcription of at least a portion of the audio data at the client device, the server system having accessed at least one term, accessed information that indicates a second context, the second context comprising a second physical environment or physical state associated with the accessed term, determined a similarity score that indicates a degree of similarity between the first physical environment or physical state and the second physical environment or physical state, adjusted a language model based on the accessed term and the determined similarity score to generate an adjusted language model, wherein the adjusted language model includes the accessed term and a weighting value assigned to the accessed term based on the similarity score, performed speech recognition on the audio data using the adjusted language model to select one or more candidate transcriptions for a portion of the audio data, and transmitted the transcription to the client device. - View Dependent Claims (22, 23)
-
Specification