Symbol insertion apparatus and method
First Claim
1. A speech recognition apparatus comprising:
- a transformer for transforming sequences of phonemes extracted from an utterance into one or more word sequences, and for assigning to said word sequences appearance probabilities, in accordance with which said word sequences are originally represented by said phoneme sequences;
a renewer for renewing said appearance probability assigned to each of said word sequences by employing a renewal value represented by a language model corresponding to each of said word sequences; and
a speech recognizer for selecting the word sequence having the highest appearance probability, by screening all said word sequences assigned appearance probabilities, according to which said word sequences are originally represented by said phoneme sequences, wherein said renewer calculates said renewal value using a first language model which is employed when each of said word sequences always includes a specific symbol as a word and a second language model which is employed in other situations to renew said appearance probabilities based on said renewal value.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method are provided for the insertion of punctuation marks into appropriate positions in a sentence. An acoustic processor processes input utterances to extract voice data, and transforms the data into a feature vector. When the automatic insertion of punctuation marks is not performed, a language decoder processes the feature vector using only a general-purpose language model, and inserts a comma at a location marked in the voice data by the entry “ten,” for example, which is clearly a location at which a comma should be inserted. When automatic punctuation insertion is performed, the language decoder employs the general-purpose language model and the punctuation mark language model to identify an unvoiced, pause location for the insertion of a punctuation mark, such as a comma.
53 Citations
16 Claims
-
1. A speech recognition apparatus comprising:
-
a transformer for transforming sequences of phonemes extracted from an utterance into one or more word sequences, and for assigning to said word sequences appearance probabilities, in accordance with which said word sequences are originally represented by said phoneme sequences;
a renewer for renewing said appearance probability assigned to each of said word sequences by employing a renewal value represented by a language model corresponding to each of said word sequences; and
a speech recognizer for selecting the word sequence having the highest appearance probability, by screening all said word sequences assigned appearance probabilities, according to which said word sequences are originally represented by said phoneme sequences, wherein said renewer calculates said renewal value using a first language model which is employed when each of said word sequences always includes a specific symbol as a word and a second language model which is employed in other situations to renew said appearance probabilities based on said renewal value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method of speech recognition comprising the steps of:
-
transforming sequences of phonemes extracted from an utterance into one or more word sequences, and of assigning to said word sequences appearance probabilities, in accordance with which said word sequences are originally represented by said phoneme sequences;
renewing said appearance probability assigned to each of said word sequences by employing a renewal value represented by a language model corresponding to each of said word sequences; and
selecting the word sequence having the highest appearance probability, by screening all said word sequences assigned appearance probabilities, according to which said word sequences are originally represented by said phoneme sequences, wherein, at during said renewing, said renewal value is calculated using a first language model which is employed when each of said word sequences always includes a specific symbol as a word and a second language model which is employed in other situations to renew said appearance probabilities based on said renewal value.
-
-
16. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for providing symbol insertion, said method comprising the steps of:
-
transforming sequences of phonemes extracted from an utterance into one or more word sequences, and of assigning to said word sequences appearance probabilities, in accordance with which aid word sequences are originally represented by said phoneme sequences;
renewing said appearance probability assigned to each of said word sequences by employing a renewal value represented by a language model corresponding to each of said word sequences; and
selecting the word sequence having the highest appearance probability, by screening all said word sequences assigned appearance probabilities, according to which said word sequences arm originally represented by said phoneme sequences, wherein, during said renewing, said renewal value is calculated using a first language model which is employed when each of said word sequences always includes a specific symbol as a word and a second language model which is employed to renew said appearance probabilities based on said renewal value.
-
Specification