Speech recognition system and method which permits a speaker's utterance to be recognized using a hidden markov model with subsequent calculation reduction
First Claim
Patent Images
1. A speech recognition system comprising:
- sound analyzing means for sound analyzing an input speech signal to obtain a plurality of two-dimensional feature parameters;
a phonetic segment dictionary for storing a plurality of types of phonetic segments including continuant segments, consonant segments, and boundary segments;
means, coupled to the sound analyzing means and the phonetic segment dictionary, for matrix-quantizing the plurality of two-dimensional features parameters obtained by the sound analyzing means using the plurality of types of phonetic segments stored in the phonetic segment dictionary to obtain a phonetic segment similarity vector sequence;
means for generating a phonemic feature vector sequence, wherein the phonemic feature vector sequence is comprised of elements, wherein each element represents similarity of the input to a stored phonetic segment;
verifying means, coupled to the generating means, for verifying the phonemic feature vector sequence obtained by the generating means by comparing the phonemic feature vector sequence with a previously stored continuous hidden markov model (HHM); and
means for recognizing speech based on a result of verifying the phonemic feature vector sequence.
0 Assignments
0 Petitions
Accused Products
Abstract
A sound analyzer sound analyzes an input speech signal to obtain feature vectors. A matrix quantizer performs a matrix quantization process between the feature vectors obtained by the sound analyzer and a phonetic segment dictionary prepared in phonetic segment units to obtain a phonetic segment similarity sequence. A PS-phoneme integrating section integrates the phonetic segment similarity sequence into a phonemic feature vector. A HMM recognizer checks the phonemic feature vector using a HMM prepared in certain units, to thereby perform a recognition process.
-
Citations
6 Claims
-
1. A speech recognition system comprising:
-
sound analyzing means for sound analyzing an input speech signal to obtain a plurality of two-dimensional feature parameters; a phonetic segment dictionary for storing a plurality of types of phonetic segments including continuant segments, consonant segments, and boundary segments; means, coupled to the sound analyzing means and the phonetic segment dictionary, for matrix-quantizing the plurality of two-dimensional features parameters obtained by the sound analyzing means using the plurality of types of phonetic segments stored in the phonetic segment dictionary to obtain a phonetic segment similarity vector sequence; means for generating a phonemic feature vector sequence, wherein the phonemic feature vector sequence is comprised of elements, wherein each element represents similarity of the input to a stored phonetic segment; verifying means, coupled to the generating means, for verifying the phonemic feature vector sequence obtained by the generating means by comparing the phonemic feature vector sequence with a previously stored continuous hidden markov model (HHM); and means for recognizing speech based on a result of verifying the phonemic feature vector sequence. - View Dependent Claims (2, 3, 5, 6)
-
-
4. A speech recognition method comprising the computser steps of:
-
sound analyzing an input speech signal to obtain a plurality of two-dimensional feature parameters; matrix-quantizing the feature parameters obtained by the sound analyzing step using a phonetic segment dictionary storing a plurality of types of phonetic segments including continuant segments, consonant segments and boundary segments, to obtain a phonetic segment similarity vector sequence; generating a phonemic feature vector sequence from the phonetic segment similarity vector sequence; verifying the phonemic feature vector obtained by the generating step by comparing the phonemic feature vector sequence with a previously stored continuous hidden markov model (HMM); and recognizing speech based on a result of verifying the phonemic feature vector sequence.
-
Specification