Speech recognition apparatus and speech recognition method
First Claim
1. A speech recognition apparatus which obtains and recognizes speech, said apparatus comprising:
- a language model storage unit operable to store language models for recognizing speech;
a tag information storage unit operable to store a piece of tag information for each of the language models, the tag information indicating a feature of each language model;
a relevance degree holding unit operable to hold a relevance degree between each piece of tag information and each of words;
an importance degree holding unit operable to hold an importance degree of each piece of tag information to a corresponding one of the language models;
a word obtainment unit operable to obtain one of the words;
a relevance degree derivation unit operable to derive the relevance degree between each piece of tag information and the word obtained by said word obtainment unit, from the respective relevance degrees held by said relevance degree holding unit;
a combination coefficient calculation unit operable to calculate, as a combination coefficient, a weight of each language model which corresponds to the obtained word, based on the relevance degrees derived by said relevance degree derivation unit and the importance degrees held by said importance degree holding unit, each of the relevance degrees indicating a relevance degree between the obtained word and one of the pieces of tag information of each language model;
a probability calculation unit operable to calculate a probability of appearance of a predetermined word using, in combination, a specific model probability and a combination coefficient, the specific model probability being derived for each of the language models and indicating the probability that the predetermined word will appear in the speech, and the combination coefficient for each of the language models being calculated by said combination coefficient calculation unit; and
a recognition unit operable to recognize the speech using the probability calculated by said probability calculation unit, wherein said word obtainment unit is operable to obtain the one of the words adapted to the speech recognized by said recognition unit.
2 Assignments
0 Petitions
Accused Products
Abstract
To provide a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case where topics are changed. The speech recognition apparatus includes: a word specification unit for obtaining and specifying a word; a language model information storage unit for storing language models for recognizing speech and the respectively corresponding pieces of tag information; a combination coefficient calculation unit for calculating the weights of the respective language models, as combination coefficients, according to the word obtained by the word specification unit, based on the relevance degree between the word obtained by the word specification unit and the tag information of each language model; a language probability calculation unit for calculating the probabilities of word appearance by combining the respective language models according to the calculated combination coefficients; and a speech recognition unit for recognizing speech using the calculated probabilities of word appearance.
-
Citations
7 Claims
-
1. A speech recognition apparatus which obtains and recognizes speech, said apparatus comprising:
-
a language model storage unit operable to store language models for recognizing speech;
a tag information storage unit operable to store a piece of tag information for each of the language models, the tag information indicating a feature of each language model;
a relevance degree holding unit operable to hold a relevance degree between each piece of tag information and each of words;
an importance degree holding unit operable to hold an importance degree of each piece of tag information to a corresponding one of the language models;
a word obtainment unit operable to obtain one of the words;
a relevance degree derivation unit operable to derive the relevance degree between each piece of tag information and the word obtained by said word obtainment unit, from the respective relevance degrees held by said relevance degree holding unit;
a combination coefficient calculation unit operable to calculate, as a combination coefficient, a weight of each language model which corresponds to the obtained word, based on the relevance degrees derived by said relevance degree derivation unit and the importance degrees held by said importance degree holding unit, each of the relevance degrees indicating a relevance degree between the obtained word and one of the pieces of tag information of each language model;
a probability calculation unit operable to calculate a probability of appearance of a predetermined word using, in combination, a specific model probability and a combination coefficient, the specific model probability being derived for each of the language models and indicating the probability that the predetermined word will appear in the speech, and the combination coefficient for each of the language models being calculated by said combination coefficient calculation unit; and
a recognition unit operable to recognize the speech using the probability calculated by said probability calculation unit, wherein said word obtainment unit is operable to obtain the one of the words adapted to the speech recognized by said recognition unit. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech recognition method for obtaining speech and recognizing the data stored in a recording medium,
wherein the recording medium includes: -
a language model storage unit operable to store language models for recognizing speech;
a tag information storage unit operable to store a piece of tag information for each of the language models, the tag information indicating a feature of each language model;
a relevance degree holding unit operable to hold a relevance degree between each piece of tag information and each of words; and
an importance degree holding unit operable to hold an importance degree of each piece of tag information to a corresponding one of the language models, said speech recognition method comprises;
obtainment of one of the words;
derivation of the relevance degree between each piece of tag information and the word obtained by said obtainment of the word, from the respective relevance degrees held by the relevance degree holding unit;
calculation of, as a combination coefficient, a weight of each language model which corresponds to the obtained word, based on the relevance degrees derived by said derivation of the relevance degrees and the importance degrees held by the importance degree holding unit, each of the relevance degrees indicating a relevance degree between the obtained word and one of the pieces of tag information of each language model;
calculation of a probability of appearance of a predetermined word using, in combination, a specific model probability and a combination coefficient, the specific model probability being derived for each of the language models and indicating the probability that the predetermined word will appear in the speech, and the combination coefficient for each of the language models being calculated in said calculation of the combination coefficient;
recognition of the speech using the probability calculated in said calculation of the probability, wherein, said obtainment of the word includes obtainment of the one of the words adapted to the speech recognized in said recognition of the speech.
-
-
7. A program causing a computer to obtain speech and recognize the speech using the data stored on a recording medium,
wherein the recording medium includes: -
a language model storage unit operable to store language models for recognizing speech;
a tag information storage unit operable to store a piece of tag information for each of the language models, the tag information indicating a feature of each language model;
a relevance degree holding unit operable to hold a relevance degree between each piece of tag information and each of words; and
an importance degree holding unit operable to hold an importance degree of each piece of tag information to a corresponding one of the language models, said program causes a computer to execute;
obtainment of one of the words;
derivation of the relevance degree between each piece of tag information and the word obtained by said obtainment of the word, from the respective relevance degrees held by the relevance degree holding unit;
calculation of, as a combination coefficient, a weight of each language model which corresponds to the obtained word, based on the relevance degrees derived by said derivation of the relevance degrees and the importance degrees held by the importance degree holding unit, each of the relevance degrees indicating a relevance degree between the obtained word and one of the pieces of tag information of each language model;
calculation of a probability of appearance of a predetermined word using, in combination, a specific model probability and a combination coefficient, the specific model probability being derived for each of the language models and indicating the probability that the predetermined word will appear in the speech, and the combination coefficient for each of the language models being calculated in said calculation of the combination coefficient;
recognition of the speech using the probability calculated in said calculation of the probability, wherein, said obtainment of the word includes obtainment of the one of the words adapted to the speech recognized in said recognition of the speech.
-
Specification