Speech processing device and method, and program for recognition of out-of-vocabulary words in continuous speech
First Claim
Patent Images
1. A speech processing device, including a processor, for processing an input utterance and registering a word contained in the input utterance based on the processing result, comprising:
- recognition means for recognizing a continuous input utterance;
unknown word determination means for determining whether a result of recognition by the recognition means contains an unknown word;
network generating means for generating a network having two kinds of paths when the unknown word determination means determines that the result of recognition by the recognition means contains the unknown word;
wherein a first kind of path includes particular sub-words corresponding to word boundaries on one side or both sides of the unknown word, andwherein a second kind of path does not include the particular sub-words,wherein the particular sub-words are sub-words at positions where boundaries of the unknown words correspond to halfway points of the sub-words;
acquiring means for acquiring a word corresponding to the unknown word when the unknown word determination means determines that the result of recognition by the recognition means contains the unknown word; and
registering means for registering the word acquired by the acquiring means while associating the word with other information.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech processing device and a speech processing method, a storage medium, and a program decreases deletion errors and increases a speech recognition rate. A network of words and syllables is generated, and the network has two kinds of paths: paths that do not contain a particular syllable and paths that contain the syllable at a position corresponding to a boundary between words. Thus, an optimal sub-word sequence on the network is selected for an input utterance.
-
Citations
12 Claims
-
1. A speech processing device, including a processor, for processing an input utterance and registering a word contained in the input utterance based on the processing result, comprising:
-
recognition means for recognizing a continuous input utterance; unknown word determination means for determining whether a result of recognition by the recognition means contains an unknown word; network generating means for generating a network having two kinds of paths when the unknown word determination means determines that the result of recognition by the recognition means contains the unknown word; wherein a first kind of path includes particular sub-words corresponding to word boundaries on one side or both sides of the unknown word, and wherein a second kind of path does not include the particular sub-words, wherein the particular sub-words are sub-words at positions where boundaries of the unknown words correspond to halfway points of the sub-words; acquiring means for acquiring a word corresponding to the unknown word when the unknown word determination means determines that the result of recognition by the recognition means contains the unknown word; and registering means for registering the word acquired by the acquiring means while associating the word with other information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speech processing method, utilizing a processor in a speech processing device that processes an input utterance and registers a word contained in the input utterance based on the processing result, comprising the steps of:
-
recognizing a continuous input utterance, utilizing a recognition unit; determining whether a result of recognition in the recognition step contains an unknown word, utilizing an unknown word determination unit; generating a network having two kinds of paths, utilizing a network generating unit, wherein a first kind of path includes particular sub-words corresponding to word boundaries on one side or both sides of the unknown word, and wherein a second kind of path does not include the particular sub-words when the determining step determines that the result of recognition contains the unknown word; wherein the particular sub-words are sub-words at positions where boundaries of the unknown words corresponds to halfway points of the sub-words; acquiring a word corresponding to the unknown word when the determining step determines that the result of recognition contains the unknown word, utilizing an acquiring unit; and registering the word acquired in the acquiring step while associating the word with other information, utilizing a registering unit.
-
-
12. A storage medium for storing a computer-readable program executed in a speech processing device for processing input utterances and registering words contained in the input utterances, wherein the computer-readable program comprises the steps of:
-
recognizing a continuous input utterance; determining whether a result of recognition in the recognition step contains an unknown word; generating a network having two kinds of paths, when the determining step determines that the result of recognition contains the unknown word, wherein a first one kind of path includes particular sub-words corresponding to word boundaries on one side or both sides of the unknown word, and wherein a second kind of path does not include the particular sub-words, wherein the particular sub-words are sub-words at positions where boundaries of the unknown words correspond to halfway points of the sub-words; acquiring a word corresponding to the unknown word when the determining step determines that the result of recognition contains the unknown word; and registering the word acquired in the acquiring step while associating the word with other information.
-
Specification