Method of speaker adaptive speech recognition
First Claim
1. A method for recognizing spoken words of a speech, comprising the steps of:
- extracting feature vectors from a speech signal corresponding to a spoken phrase to be recognized;
segmenting and classifying the successive extracted feature vectors into syllable oriented word subunits by means of a stored supply of word subunits to form a set of hypotheses;
comparing the set of hypotheses formed from the segmented and classified word subunits with standard pronunciations and pronunciation variants of a plurality of words stored in a reference pattern vocabulary over a three-dimensional time dynamic period to generate a set of word hypotheses; and
subjecting the generated set of word hypotheses to syntactic analysis in order to determine the spoken phrase.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for recognizing spoken words of a speech includes extracting feature vectors from a speech signal which corresponds to a spoken phrase, and segmenting and classifying the successive extracted feature vectors into syllable oriented word subunits by means of a stored supply of word subunits to form a set of hypotheses. The set of hypotheses is used to generate, by three dimensional time dynamic comparision, a set of word hypotheses by comparing the segmented and classified word subunits with standard pronunciations and pronunciation variants of a plurality of words stored in a reference pattern vocabulary. The generated set of word hypotheses are then subjected to syntactic analysis to determine the spoken phrase.
-
Citations
8 Claims
-
1. A method for recognizing spoken words of a speech, comprising the steps of:
-
extracting feature vectors from a speech signal corresponding to a spoken phrase to be recognized; segmenting and classifying the successive extracted feature vectors into syllable oriented word subunits by means of a stored supply of word subunits to form a set of hypotheses; comparing the set of hypotheses formed from the segmented and classified word subunits with standard pronunciations and pronunciation variants of a plurality of words stored in a reference pattern vocabulary over a three-dimensional time dynamic period to generate a set of word hypotheses; and subjecting the generated set of word hypotheses to syntactic analysis in order to determine the spoken phrase. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
Specification