Speech recognition
First Claim
Patent Images
1. A processor implemented method for speech recognition of a speech signal comprising:
- within a processor;
providing at least one codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, that are frequency weighted; and
processing the speech signal for speech recognition comprising;
extracting at least one feature vector from the speech signal and matching the feature vector with the entries of the codebook;
providing at least one additional codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, without frequency weights;
determining whether the speech signal corresponds to an utterance of a native speaker or to an utterance of a non-native speaker; and
if it is determined that the speech signal corresponds to the utterance of a native speaker, using the at least one additional codebook comprising codebook entries without frequency weights for the speech recognition;
orif it is determined that the speech signal corresponds to the utterance of a non-native speaker, using the at least one codebook comprising codebook entries that are frequency weighted for the speech recognition.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to a method for speech recognition of a speech signal comprising the steps of providing at least one codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, that are frequency weighted such that higher weights are assigned to entries corresponding to frequencies below a predetermined level than to entries corresponding to frequencies above the predetermined level and processing the speech signal for speech recognition comprising extracting at least one feature vector from the speech signal and matching the feature vector with the entries of the codebook.
19 Citations
13 Claims
-
1. A processor implemented method for speech recognition of a speech signal comprising:
-
within a processor; providing at least one codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, that are frequency weighted; and processing the speech signal for speech recognition comprising; extracting at least one feature vector from the speech signal and matching the feature vector with the entries of the codebook; providing at least one additional codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, without frequency weights; determining whether the speech signal corresponds to an utterance of a native speaker or to an utterance of a non-native speaker; and if it is determined that the speech signal corresponds to the utterance of a native speaker, using the at least one additional codebook comprising codebook entries without frequency weights for the speech recognition;
orif it is determined that the speech signal corresponds to the utterance of a non-native speaker, using the at least one codebook comprising codebook entries that are frequency weighted for the speech recognition. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product comprising a non-transitory computer readable medium having computer-executable instructions thereon for speech recognition of a speech signal comprising, the computer-executable instructions comprising:
-
computer code for providing at least one codebook comprising code-book entries, in particular, multivariate Gaussians of feature vectors, that are frequency weighted; and computer code for processing the speech signal for speech recognition comprising; computer code for extracting at least one feature vector from the speech signal and computer code for matching the feature vector with the entries of the codebook; computer code for providing at least one additional codebook comprising codebook entries, in particular, multivariate Gaussians of feature vectors, without frequency weights; computer code for determining whether the speech signal corresponds to an utterance of a native speaker or to an utterance of a non-native speaker; and computer code for using the at least one additional codebook comprising codebook entries without frequency weights for the speech recognition if it is determined that the speech signal corresponds to the utterance of a native speaker; computer code for using the at least one codebook comprising codebook entries that are frequency weighted for the speech recognition if it is determined that the speech signal corresponds to the utterance of a non-native speaker. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
Specification