Speech recognition using speech characteristic probabilities
First Claim
Patent Images
1. An acoustic front-end device comprising:
- a frame parser receiving an acoustic signal and parsing the received acoustic signal into a plurality of frames;
a plurality of correlators, each of the correlators correlating each of the plurality of frames with one or more acoustic property sets to produce a first set of acoustic property correlations;
a controllerretrieving one or more speech characteristic samples from one or more speech codebooks based on the first set of acoustic property correlations; and
a speech characteristic probability generator configured to;
generate a plurality of speech characteristic probabilities over one or more subsequent frames of the plurality of frames by individually correlating a digital signal component of the subsequent frame with the one or more acoustic property sets to produce a second set of acoustic property correlations; and
a processor configured to interpret the plurality of speech characteristic probabilities to generate at least a language probability and a language syntax bias; and
select a plurality of words from a series of plurality of words based on the language probability and the language syntax bias; and
output the plurality of words through an interface.
7 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition module includes an acoustic front-end module, a sound detection module, and a word detection module. The acoustic front-end module generates a plurality of representations of frames from a digital audio signal and generates speech characteristic probabilities for the plurality of frames. The sound detection module determines a plurality of estimated utterances from the plurality of representations and the speech characteristic probabilities. The word detection module determines one or more words based on the plurality of estimated utterances and the speech characteristic probabilities.
26 Citations
19 Claims
-
1. An acoustic front-end device comprising:
-
a frame parser receiving an acoustic signal and parsing the received acoustic signal into a plurality of frames; a plurality of correlators, each of the correlators correlating each of the plurality of frames with one or more acoustic property sets to produce a first set of acoustic property correlations; a controller retrieving one or more speech characteristic samples from one or more speech codebooks based on the first set of acoustic property correlations; and a speech characteristic probability generator configured to; generate a plurality of speech characteristic probabilities over one or more subsequent frames of the plurality of frames by individually correlating a digital signal component of the subsequent frame with the one or more acoustic property sets to produce a second set of acoustic property correlations; and a processor configured to interpret the plurality of speech characteristic probabilities to generate at least a language probability and a language syntax bias; and select a plurality of words from a series of plurality of words based on the language probability and the language syntax bias; and output the plurality of words through an interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for frame-by-frame analyzing of an acoustic signal to determine speech characteristics comprising:
-
configuring a processor to; parse the acoustic signal into a plurality of frames; correlate each of the plurality of frames with one or more acoustic property sets to produce a first set of acoustic property correlations and first index; retrieve one or more first speech characteristic samples from one or more speech codebooks based on the first index; and generate a plurality of speech characteristic probabilities for a subsequent frame of the plurality of frames by; individually correlating a digital signal component of the subsequent frame with the one or more acoustical property sets to produce a second set of acoustic property correlations and a second index; addressing the one or more speech codebooks based on the second index to retrieve a second speech characteristic sample; and correlating the digital signal component with the second speech characteristic sample to produce the speech characteristic probability for the subsequent frame; and retrieving and outputting words based on a language probability and a language syntax bias of the plurality of speech characteristic probabilities. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. An acoustic front-end device comprising:
-
a frame parser receiving an acoustic signal and parsing the received acoustic signal into a plurality of frames; a plurality of correlators, each of the correlators correlating each of the plurality of frames with the one or more acoustic property sets to produce a first set of acoustic property correlations; a controller, the controller retrieving one or more first speech characteristic samples from one or more speech codebooks based on the acoustic property set correlations; a speech characteristic probability generator generating speech characteristic probabilities for a subsequent frame of the plurality of frames by; individually correlating a digital signal component of the subsequent frame with the one or more acoustical property sets to produce a second set of acoustic property correlations addressing the one or more speech codebooks based on the second set of acoustic property correlations to retrieve a second speech characteristic sample; and correlating the digital signal component with the second speech characteristic sample to produce the speech characteristic probability for the subsequent frame; and a processor, the processor; interpreting the plurality of speech characteristic probabilities to generate at least a language probability, word bias, and language syntax bias; selecting a plurality of words from a series of plurality of words based on the word bias and language probability; and selecting a plurality of language syntaxes from a series of plurality of language syntaxes based on the language probability. - View Dependent Claims (19)
-
Specification