Multi-Microphone Speech Recognition Systems and Related Techniques
First Claim
1. A speech recognition system for resolving far-field utterances, comprising:
- a recognition engine configured to concurrently receive a plurality of representations of an utterance and to determine a highest-probability representation of the utterance; and
an utterance decoder configured to determine a most-likely transcription corresponding to the highest-probability representation of the utterance.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.
191 Citations
31 Claims
-
1. A speech recognition system for resolving far-field utterances, comprising:
-
a recognition engine configured to concurrently receive a plurality of representations of an utterance and to determine a highest-probability representation of the utterance; and an utterance decoder configured to determine a most-likely transcription corresponding to the highest-probability representation of the utterance. - View Dependent Claims (2, 3, 4, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
5. (canceled)
-
6. (canceled)
-
7. (canceled)
-
8. (canceled)
-
18. A speech-recognition method, comprising:
-
selecting a highest-probability representation of an utterance from a plurality of concurrently generated representations of the utterance; and determining a most-likely transcription of the utterance in correspondence to the highest-probability representation of the utterance. - View Dependent Claims (19, 20)
-
-
21. (canceled)
-
22. (canceled)
- 23. (canceled)
-
25. A non-transitory, computer-readable media containing instructions that, when executed by a processor, cause a computing environment to perform a speech recognition method comprising:
-
selecting a highest-probability representation of an utterance from a plurality of concurrently generated representations of the utterance; and determining a most-likely transcription of the utterance in correspondence to the highest-probability representation of the utterance. - View Dependent Claims (26, 27)
-
-
28. (canceled)
-
29. (canceled)
-
30. (canceled)
-
31. (canceled)
Specification