Method and apparatus including parallell processes for voice recognition
First Claim
1. A computer-implemented method comprising:
- defining, in an automated speech recognizer in which audio data is processed by a signal conditioning stage followed by a noise suppression stage followed by a language modeling stage, the signal conditioning stage including quantity i processing alternatives, the noise suppression stage including quantity j processing alternatives, and the language modeling stage including quantity k processing alternatives, quantity (i*j*k) alternative paths for processing the audio data through the multiple stages of the automated speech recognizer, i, j, and k being greater than one;
generating, for each of the quantity (i*j*k) alternative paths, a transcription of particular audio data based on processing the particular audio data through each of the stages of the automated speech recognizer according to the alternative path; and
selecting a particular transcription from among the respective transcriptions that are generated for the quantity (i*j*k) alternative paths; and
providing the particular transcription for output.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for voice recognition performed in a voice recognition block comprising a plurality of voice recognition stages. The method includes receiving a first plurality of voice inputs, corresponding to a first phrase, into a first voice recognition stage of the plurality of voice recognition stages, wherein multiple ones of the voice recognition stages includes a plurality of voice recognition modules and multiples ones of the voice recognition stages perform a different type of voice recognition processing, wherein the first voice recognition stage processes the first plurality of voice inputs to generate a first plurality of outputs for receipt by a subsequent voice recognition stage. The method further includes, receiving by each subsequent voice recognition stage a plurality of outputs from a preceding voice recognition stage, wherein a plurality of final outputs is generated by a final voice recognition stage from which to approximate the first phrase.
57 Citations
15 Claims
-
1. A computer-implemented method comprising:
-
defining, in an automated speech recognizer in which audio data is processed by a signal conditioning stage followed by a noise suppression stage followed by a language modeling stage, the signal conditioning stage including quantity i processing alternatives, the noise suppression stage including quantity j processing alternatives, and the language modeling stage including quantity k processing alternatives, quantity (i*j*k) alternative paths for processing the audio data through the multiple stages of the automated speech recognizer, i, j, and k being greater than one; generating, for each of the quantity (i*j*k) alternative paths, a transcription of particular audio data based on processing the particular audio data through each of the stages of the automated speech recognizer according to the alternative path; and selecting a particular transcription from among the respective transcriptions that are generated for the quantity (i*j*k) alternative paths; and providing the particular transcription for output. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; defining, in an automated speech recognizer in which audio data is processed by a signal conditioning stage followed by a noise suppression stage followed by a language modeling stage, the signal conditioning stage including quantity i processing alternatives, the noise suppression stage including quantity j processing alternatives, and the language modeling stage including quantity k processing alternatives, quantity (i*j*k) alternative paths for processing the audio data through the multiple stages of the automated speech recognizer, i, j, and k being greater than one; generating, for each of the quantity (i*j*k) alternative paths, a transcription of particular audio data based on processing the particular audio data through each of the stages of the automated speech recognizer according to the alternative path; and selecting a particular transcription from among the respective transcriptions that are generated for the quantity (i*j*k) alternative paths signal conditioning stage; and providing the particular transcription for output. - View Dependent Claims (7, 8, 9, 10)
-
11. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
defining, in an automated speech recognizer in which audio data is processed by a signal conditioning stage followed by a noise suppression stage followed by a language modeling stage, the signal conditioning stage including quantity i processing alternatives, the noise suppression stage including quantity j processing alternatives, and the language modeling stage including quantity k processing alternatives, quantity (i*j*k) alternative paths for processing the audio data through the multiple stages of the automated speech recognizer, i, j, and k being greater than one; generating, for each of the quantity (i*j*k) alternative paths, a transcription of particular audio data based on processing the particular audio data through each of the stages of the automated speech recognizer according to the alternative path; and selecting a particular transcription from among the respective transcriptions that are generated for the quantity (i*j*k) alternative paths; and providing the particular transcription for output. - View Dependent Claims (12, 13, 14, 15)
-
Specification