Speech recognizer
First Claim
Patent Images
1. A speech recognizer for performing word recognition on input speech by using information on models of speech units each shorter than a word, the speech recognizer comprising:
- vocabulary label network accumulation means for accumulating label series of said speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers;
registered word label series extraction means for generating label series of said speech units for registered words from input speech of a particular speaker; and
registration means for storing the label series of speech units for the generic words commonly used for word recognition of input speech of said unspecified speakers and the generated registered word label series in the form of parallel networks in said vocabulary label network accumulation means;
wherein said speech units are acoustic events generated by dividing a Hidden Markov Model of phoneme into individual states while maintaining the values of a transition probability and an output probability and the number of states.
1 Assignment
0 Petitions
Accused Products
Abstract
The generic word label series used for recognition of words uttered by unspecified speakers are stored in the vocabulary label network accumulation processing. The speech of a particular speaker is entered. Based on the input speech, the registered word label series extraction processing generates the registered word label series. The registered word label series of the particular speaker can then be registered with the vocabulary label network accumulation processing.
-
Citations
12 Claims
-
1. A speech recognizer for performing word recognition on input speech by using information on models of speech units each shorter than a word, the speech recognizer comprising:
-
vocabulary label network accumulation means for accumulating label series of said speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers;
registered word label series extraction means for generating label series of said speech units for registered words from input speech of a particular speaker; and
registration means for storing the label series of speech units for the generic words commonly used for word recognition of input speech of said unspecified speakers and the generated registered word label series in the form of parallel networks in said vocabulary label network accumulation means;
wherein said speech units are acoustic events generated by dividing a Hidden Markov Model of phoneme into individual states while maintaining the values of a transition probability and an output probability and the number of states. - View Dependent Claims (2)
-
-
3. The speech recognizer for performing word recognition on input speech by using information on models of speech units each shorter than a word the speech recognizer comprising:
-
vocabulary label network accumulation means for accumulating label series of said speech units for generic words commonly use to perform word recognition on input speech of unspecified speakers;
registered word label series extraction means for generating label series satisfying a connection of said speech units and having the highest probability in the label series of said speech units for registered words from input speech of a particular speaker by using a network in which said connection of the speech units related to the connections of speech units is described; and
registration means for registering to add the generated registered word label series to said vocabulary label network accumulation means;
wherein said speech units are acoustic events generated by dividing a Hidden Markov Model of phoneme into individual states while maintaining the values of a transition probability and an output probability and the number of states. - View Dependent Claims (4)
-
-
5. The speech recognizer for performing word recognition on input speech by using information on models of speech units each shorter than a word, the speech recognizer comprising:
-
vocabulary label network accumulation means for accumulating label series of said speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers;
registered word label series extraction means for generating label series satisfying a connection of said speech units and having the highest probability in the label series of said speech units for registered words from input speech of a particular speaker by using a network in which said connection of the speech units related to the connections of speech units is described; and
registration means for storing said label series of speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers and the generated registered word label series in the form of parallel networks in said vocabulary label network accumulation means;
wherein said speech units are acoustic events generated by dividing a Hidden Markov Model of phoneme into individual states while maintaining the values of a transition probability and an output probability and the number of states. - View Dependent Claims (6)
-
-
7. A speech recognition method for performing word recognition on input speech by using information on models of speech units each shorter than a word,
wherein label series of said speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers are accumulated in vocabulary label network accumulation means; -
said method comprising steps of;
generating label series of said speech units for registered words from input speech of a particular speaker; and
storing said label series of speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers and the generated registered word label series in the form of parallel networks in said vocabulary label network accumulation means wherein said speech units are acoustic events generated by dividing a Hidden Markov Model of phoneme into individual states while maintaining the values of a transition probability and an output probability and the number of states. - View Dependent Claims (8)
-
-
9. A speech recognition method for performing word recognition on input speech by using information on models of speech units each shorter than a word.
wherein label series of said speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers are accumulated in vocabulary label network accumulation means; -
said method comprising steps of;
generated label series satisfying a connection of said speech units and having the highest probability in the label series of said speech units for registered words from input speech of a particular speaker by using a network in which said connection of the speech units related to the connections of speech units is described; and
registering to add the generated registered word label series to said vocabulary label network accumulation means;
wherein said speech units are acoustic events generated by dividing a Hidden Markov Model of phoneme into individual states while maintaining the values of a transition probability and an output probability and the number of states. - View Dependent Claims (10)
-
-
11. A speech recognition method for performing word recognition on input speech by using information on models of speech units each shorter than a word.
wherein label series of said speech units for generic words commonly used to perform word recognition on input speech of unspecified speakers are accumulated in vocabulary label network accumulation means: -
said method comprising steps of;
generating label series satisfying a connection of said speech units and having the highest probability in the label series of said speech units for registered words from input speech of a particular speaker by using a network in which said connection of the speech units related to the connections of speech units is described; and
storing said label series of speech units for generic words commonly use to perform word recognition in input speech of unspecified speakers and the generated registered word label series in said vocabulary label network accumulation means;
wherein said speech units are acoustic events generated by dividing a Hidden Markov Model of phoneme into individual states while maintaining the values of a transition probability and an output probability and the number of states. - View Dependent Claims (12)
-
Specification