Language model generation and accumulation device, speech recognition device, language model creation method, and speech recognition method
First Claim
1. A language model generation and accumulation apparatus that generates and accumulates language models for speech recognition, the apparatus comprising:
- a higher-level N-gram language model generation and accumulation unit operable to generate and accumulate a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model generation and accumulation unit operable to generate and accumulate a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class.
3 Assignments
0 Petitions
Accused Products
Abstract
A language model generation and accumulation apparatus (10) that generates and accumulates language models for speech recognition is comprised of: a higher-level N-gram generation and accumulation unit (11) that generates and accumulates a higher-level N-gram language model obtained by modeling each of a plurality of texts as a string of words including a word string class having a specific linguistic property; and a lower class dependent word N-gram generation and accumulation unit (12) that generates and accumulates a lower-level N-gram language model obtained by modeling a sequence of words included in each word string class.
-
Citations
30 Claims
-
1. A language model generation and accumulation apparatus that generates and accumulates language models for speech recognition, the apparatus comprising:
-
a higher-level N-gram language model generation and accumulation unit operable to generate and accumulate a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model generation and accumulation unit operable to generate and accumulate a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A speech recognition apparatus that recognizes a speech which is a sequence of uttered words, using the following:
-
a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class.
-
-
14. A speech recognition apparatus that recognizes a sequence of uttered words, comprising
a language model generation and accumulation apparatus that generates and accumulates language models for speech recognition, wherein the language model generation and accumulation apparatus includes: -
a higher-level N-gram language model generation and accumulation unit operable to generate and accumulate a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model generation and accumulation unit operable to generate and accumulate a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class, and the speech recognition apparatus recognizes the speech by use of the higher-level N-gram language model that is accumulated by the higher-level N-gram language model generation and accumulation unit and the lower-level N-gram language model that is accumulated by the lower-level N-gram language model generation and accumulation unit. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A language model generation method for generating language models for speech recognition, comprising:
-
a higher-level N-gram language model generation and accumulation step of generating and accumulating a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model generation and accumulation step of generating and accumulating a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class.
-
-
27. A speech recognition method for recognizing a speech which is a sequence of uttered words, using the following:
-
a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class.
-
-
28. The speech recognition method comprising:
-
a step of categorizing each word string having a specific linguistic property as a word string class, and providing, to said each word string, a language likelihood which is a logarithm value of a probability, by use of class dependent word N-grams that are obtained by modeling said word string class in dependency on said word string class based on a linguistic relationship between words constituting said word string class;
a step of analyzing a text into a word and the word string class, and providing, to a sequence of said word and the word string class, a language likelihood which is a logarithm value of a probability, by use of class N-grams that are obtained by modeling said sequence of the word and the word string class based on a linguistic relationship; and
a step of (i) comparing features parameters extracted from a series of speeches with a pronunciation as well as an acoustic characteristic of each word and generating a set of word hypotheses including an utterance segment of said each word and an acoustic likelihood of said each word, (ii) generating a word string hypothesis from said set of word string hypotheses with reference to the class N-grams and the class dependent word N-grams, and (iii) outputting a result of the speech recognition.
-
-
29. A program for performing a language model generation method that is intended for generating a language model for speech recognition, the program causing a computer to execute the following steps:
-
a higher-level N-gram language model generation and accumulation step of generating and accumulating a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model generation and accumulation step of generating and accumulating a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class.
-
-
30. A program for performing a speech recognition method that is intended for recognizing a sequence of uttered words, the program causing a computer to execute a speech recognition step that is performed by use of the following:
-
a higher-level N-gram language model that is obtained by modeling each of a plurality of texts as a sequence of words that includes a word string class having a specific linguistic property; and
a lower-level N-gram language model that is obtained by modeling a sequence of words within the word string class.
-
Specification