VOICE RECOGNITION SYSTEM
First Claim
1. A method comprising:
- receiving, at an automatic speech recognition (ASR) system, a current voice input from a user, the current voice input associated with at least two contexts, each context of the at least two contexts having a respective weight indicating a likelihood that the voice input is associated with the respective context;
generating, by the ASR system, an intermediate recognition result of the current voice input from the user;
adjusting, by the ASR system, the respective weights of the at least two contexts based on the intermediate recognition result; and
transcribing, by the ASR system, the current voice input using a language model, the language model biasing the transcription of the voice input toward one of the at least two contexts based on the adjusted weights.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for voice recognition. In one aspect, a method includes the actions of receiving a voice input; determining a transcription for the voice input, wherein determining the transcription for the voice input includes, for a plurality of segments of the voice input: obtaining a first candidate transcription for a first segment of the voice input; determining one or more contexts associated with the first candidate transcription; adjusting a respective weight for each of the one or more contexts; and determining a second candidate transcription for a second segment of the voice input based in part on the adjusted weights; and providing the transcription of the plurality of segments of the voice input for output.
1 Citation
20 Claims
-
1. A method comprising:
-
receiving, at an automatic speech recognition (ASR) system, a current voice input from a user, the current voice input associated with at least two contexts, each context of the at least two contexts having a respective weight indicating a likelihood that the voice input is associated with the respective context; generating, by the ASR system, an intermediate recognition result of the current voice input from the user; adjusting, by the ASR system, the respective weights of the at least two contexts based on the intermediate recognition result; and transcribing, by the ASR system, the current voice input using a language model, the language model biasing the transcription of the voice input toward one of the at least two contexts based on the adjusted weights. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An automatic speech recognition (ASR) system comprising:
-
data processing hardware; and memory hardware in communication with the data processing hardware and storing instructions that when executed on the data processing hardware cause the data processing hardware to perform operations comprising; receiving a current voice input from a user, the current voice input associated with at least two contexts, each context of the at least two contexts having a respective weight indicating a likelihood that the voice input is associated with the respective context; generating an intermediate recognition result of the current voice input from the user; adjusting the respective weights of the at least two contexts based on the intermediate recognition result; and transcribing the current voice input using a language model, the language model biasing the transcription of the voice input toward one of the at least two contexts based on the adjusted weights. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification