Speaker independent speech recognition system and method
First Claim
1. A method of generating command models from a set of spoken commands, each spoken command being represented by a set of feature vectors determined from speech signals, the method comprising the steps of:
- vectorily summing each feature vectors associated with each spoken command to create a single command vector for each spoken command;
summing each single command vector associated with each spoken command to create a command set vector;
scaling each single command vector inversely proportional to a number of said feature vectors of said set representing each spoken command; and
adding the command set vector to each single command vector to create a scaled single command vector for each spoken command to create an individual command model for each command, the individual command model being a single vector.
4 Assignments
0 Petitions
Accused Products
Abstract
An improved method of training a SISRS uses less processing and memory resources by operating on vectors instead of matrices which represent spoken commands. Memory requirements are linearly proportional to the number of spoken commands for storing each command model. A spoken command is identified from the set of spoken commands by a command recognition procedure (200). The command recognition procedure (200) includes sampling the speaker'"'"'s speech, deriving cepstral coefficients and delta-cepstral coefficients, and performing a polynomial expansion on cepstral coefficients. The identified spoken command is selected using the dot product of the command model data and the average command structure representing the unidentified spoken command.
49 Citations
16 Claims
-
1. A method of generating command models from a set of spoken commands, each spoken command being represented by a set of feature vectors determined from speech signals, the method comprising the steps of:
-
vectorily summing each feature vectors associated with each spoken command to create a single command vector for each spoken command; summing each single command vector associated with each spoken command to create a command set vector; scaling each single command vector inversely proportional to a number of said feature vectors of said set representing each spoken command; and adding the command set vector to each single command vector to create a scaled single command vector for each spoken command to create an individual command model for each command, the individual command model being a single vector. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of generating command models for a set of commands, each command being represented by a set of feature vectors, the method comprising the steps of:
-
combining the set of feature vectors for each command to create a high order command structure vector for each command; summing each high order command structure vector to create a total command structure vector; adding the total command structure vector to a scaled version of each high order command structure vector to create a scaled individual command structure vector for each command; computing an individual command model for each command using the scaled individual command structure vector for each command and the set of feature vectors for each command; and identifying an unidentified spoken command, said unidentified spoken command being represented by a plurality of spoken feature vectors, the identifying step further comprising the steps of; averaging the plurality of spoken feature vectors to produce an average command structure for the unidentified spoken command; performing a dot product with said average command structure and each individual command model to create a set of score values, each score value being associated with one command of the set of commands; and selecting a command from said set of commands based on a score value. - View Dependent Claims (7, 8, 9)
-
-
10. A method of identifying an unidentified spoken command from a set of individual command models, said unidentified spoken command being represented by a plurality of spoken feature vectors, the method comprising the steps of:
-
averaging the plurality of spoken feature vectors to produce an average command structure for the unidentified spoken command; performing a dot product with said average command structure and each individual command model to create a set of score values, each score value being associated with one command of a set of commands; and selecting a command from said set of commands based on a score value from said set of score values. - View Dependent Claims (11, 12, 13)
-
-
14. A speech recognition system for identifying an unidentified spoken command from a set of individual command models, said unidentified spoken command being represented by a plurality of spoken feature vectors, the speech recognition system comprising:
-
a command model memory for storing individual command models for a set of commands; a pattern classifier for averaging the plurality of spoken feature vectors to produce an average command structure for the unidentified spoken command, performing a dot product with said average command structure and each individual command model to create a set of score values, each score value being associated with a command of the set of commands; and a command selector for selecting one command from said set of commands based on a score value. - View Dependent Claims (15, 16)
-
Specification