Apparatuses and methods for training and operating speech recognition systems
DCFirst Claim
1. A computerized method of training pattern information used by a word recognition system to identify the word signals which represent individual vocabulary words, said method comprising the steps of:
- storing a plurality of vocabulary words, including storing pattern information for identifying the word signals which represent each such vocabulary word;
prompting the user to generate a word signal representing a given vocabulary word;
receiving a post-prompt word signal generated after the user was prompted to generate the prompted word;
scoring the match of the post-prompt word signal against each word in an active vocabulary including one or more of said stored plurality of vocabulary words other than said prompted word;
responding, when said scoring of the post-prompt word signal against one or more of said active vocabulary words other than said prompted word is better than a certain measure, by reducing the use of the post-prompt word signal to train pattern information associated with the prompted word; and
responding, when said scoring of the post-prompt word signal against said active vocabulary words other than the prompted word which is worse than a certain measure, by increasing the use of the post-prompt word signal to train pattern information associated with the prompted word.
7 Assignments
Litigations
0 Petitions
Reexamination
Accused Products
Abstract
A word recognition system can: respond to the input of a character string from a user by limiting the words it will recognize to words having a related, but not necessarily the same, string; score signals generated after a user has been prompted to generate a given word against words other than the prompted word to determine if the signal should be used to train the prompted word; vary the number of signals a user is prompted to generate to train a given word as a function of how well the training signals score against each other or prior models for the prompted word; create a new acoustic model of a phrase by concatenating prior acoustic models of the words in the phrase; obtain information from another program running on the same computer, such as its commands or the context of text being entered into it, and use that information to vary which words it can recognize; determine which program unit, such as an application program or dialog box, currently has input focus on its computer and create a vocabulary state associated with that program unit into which vocabulary words which will be made active when that program group has the focus can be put; detect the available computational resources and alter the instructions it executes in response; test if its ability to respond to voice input has been shut off without user confirmation, and, if so, turn that ability back on and prompt the user to confirm if that ability is to be turned off; store both a first and a second set of models for individual vocabulary words and enable a user to selectively cause the recognizer to disregard the second set of models for a selected word; and/or score a signal representing a given word against models for that word from different word model sets to select which model should be used for future recognition.
-
Citations
17 Claims
-
1. A computerized method of training pattern information used by a word recognition system to identify the word signals which represent individual vocabulary words, said method comprising the steps of:
-
storing a plurality of vocabulary words, including storing pattern information for identifying the word signals which represent each such vocabulary word; prompting the user to generate a word signal representing a given vocabulary word; receiving a post-prompt word signal generated after the user was prompted to generate the prompted word; scoring the match of the post-prompt word signal against each word in an active vocabulary including one or more of said stored plurality of vocabulary words other than said prompted word; responding, when said scoring of the post-prompt word signal against one or more of said active vocabulary words other than said prompted word is better than a certain measure, by reducing the use of the post-prompt word signal to train pattern information associated with the prompted word; and responding, when said scoring of the post-prompt word signal against said active vocabulary words other than the prompted word which is worse than a certain measure, by increasing the use of the post-prompt word signal to train pattern information associated with the prompted word. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computerized method for training pattern information used by a word recognition system to identify which word signals generated by one or more users correspond to which vocabulary words, said method comprising the steps of:
-
prompting a user to generate a number of word signals representing a given word; scoring the match of one or more post-prompt word signals, that is, word signals generated after the user is prompted to generate the prompted word, against pattern information associated with the prompted word to produce a score for each such match; controlling the number of word signals the user is prompted to generate for the prompted word as a function of the scores produced by said scoring; and incorporating information from the prompted word'"'"'s post-prompt word signals into pattern information associated with the prompted word. - View Dependent Claims (8, 9)
-
-
10. A computerized word recognition system comprising;
-
means for receiving user generated word signals representing words to be recognized; means for storing a plurality of vocabulary words, including storing, for each such word, pattern information for identifying word signals which represent that word; recognition means for performing pattern matching between word signals and said pattern information to produce scores indicating the relative probabilities that a given word signal corresponds to different ones of said vocabulary words; and means for training the pattern information associated with vocabulary words to better represent the word signals generated for their respective words under given circumstances, including; means for repeatedly prompting a user to generate a word signal representing a vocabulary word whose associated pattern information is to be trained; means for causing the recognition means to pattern match each post-prompt word signal, that is, each word signal generated after a user has been prompted to generate a word signal for the prompted word, against prior pattern information associated with the prompted word and to produce a score for each such match; repeat control means for causing the means for repeatedly prompting to stop repeatedly prompting the user to generate a word signal of the prompted word once a scoring requirement has been meet, that is, once one or more post-prompt word signals received for the prompted word have scored better than a certain measure against the pattern information for the prompted word; and means for incorporating information from the post-prompt word for the prompted word into the pattern information associated with the prompted word. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
Specification