Speech recognition by neural network adapted to reference pattern learning
First Claim
1. A pattern recognition method for recognizing syllables and sound elements on the basis of the comparison of input time series patterns expressed as feature vectors of the syllables and sound elements with reference pattern models using a finite status transition network, in which each status of said finite status transition network has a predictor, comprising the steps of:
- (a) calculating, in each predictor, a predicted feature vector at time t from a plurality of input feature vectors between time (t-1) and time (t-τ
F) and a plurality of input feature vectors between time (t+1) and time (t+τ
B), wherein said τ
B and τ
F are predetermined natural number;
(b) determining a local distance at every t between said input feature vectors and t-th status of said finite transition network by using said input feature vectors, said predicted feature vector and a covariance matrix which accompanies t-th status of said finite status transition network;
(c) calculating an accumulated value of said local distances for every reference pattern defined by said status of said finite state transition network;
(d) detecting a minimum of said accumulated values for every reference pattern; and
(e) outputting a category of the reference pattern corresponding to said minimum as a recognition result.
0 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition method according to the present invention uses distances calculated through a variance weighting process using covariance matrixes as the local distances (prediction residuals) between the feature vectors of input syllables/sound elements and predicted vectors formed by different statuses of reference neural prediction models (NPM'"'"'s) using finite status transition networks. The category to minimize the accumulated value of these local distances along the status transitions of all the prediction models is figured out by dynamic programming, and used as the recognition output. Learning of the reference prediction models used in this recognition method is accomplished by repeating said distance calculating process and the process to correct the parameters of the different statuses and the covariance matrixes of said prediction models in the direction of reducing the distance between the learning patterns whose category is known and the prediction models of the same category as this known category, and what have satisfied prescribed conditions of convergence through these calculating and correcting processes are determined as reference pattern models.
49 Citations
2 Claims
-
1. A pattern recognition method for recognizing syllables and sound elements on the basis of the comparison of input time series patterns expressed as feature vectors of the syllables and sound elements with reference pattern models using a finite status transition network, in which each status of said finite status transition network has a predictor, comprising the steps of:
-
(a) calculating, in each predictor, a predicted feature vector at time t from a plurality of input feature vectors between time (t-1) and time (t-τ
F) and a plurality of input feature vectors between time (t+1) and time (t+τ
B), wherein said τ
B and τ
F are predetermined natural number;(b) determining a local distance at every t between said input feature vectors and t-th status of said finite transition network by using said input feature vectors, said predicted feature vector and a covariance matrix which accompanies t-th status of said finite status transition network; (c) calculating an accumulated value of said local distances for every reference pattern defined by said status of said finite state transition network; (d) detecting a minimum of said accumulated values for every reference pattern; and (e) outputting a category of the reference pattern corresponding to said minimum as a recognition result. - View Dependent Claims (2)
-
Specification