Speech recognition apparatus, speech recognition method, and storage medium
First Claim
1. A speech recognition apparatus for recognizing an input speech as a recognized speech, comprising:
- a feature extracting means for extracting feature amounts from the input speech;
a preliminary word-selecting means for selecting words on the basis of the feature amounts by referring to a first database;
a matching means for calculating acoustic and linguistic scores for the selected words and forming a word string serving as a candidate for the recognized speech by referring to a second database;
wherein the second database incorporates more precise acoustic model, phoneme information, and grammar rules than the first database;
a control means for generating word-connection-information between words in the word string;
the word-connection-information including acoustic and linguistic scores for each word in the word string;
a re-evaluation means for re-evaluating the word string and correcting the word-connection-information by referring to a third database;
wherein the third database incorporates more precise acoustic models, phoneme information, and grammar rules than the second database; and
the control means determining the recognized speech by correcting the word string on the basis of the corrected word-connection-information.
1 Assignment
0 Petitions
Accused Products
Abstract
A preliminary word-selecting section selects one or more words following words which have been obtained in a word string serving as a candidate for a result of speech recognition; and a matching section calculates acoustic or linguistic scores for the selected words, and forms a word string serving as a candidate for a result of speech recognition according to the scores. A control section generates word-connection relationships between words in the word string serving as a candidate for a result of speech recognition, sends them to a word-connection-information storage section, and stores them in it. A re-evaluation section corrects the word-connection relationships stored in the word-connection-information storage section 16, and the control section determines a word string serving as the result of speech recognition according to the corrected word-connection relationships.
-
Citations
7 Claims
-
1. A speech recognition apparatus for recognizing an input speech as a recognized speech, comprising:
-
a feature extracting means for extracting feature amounts from the input speech; a preliminary word-selecting means for selecting words on the basis of the feature amounts by referring to a first database; a matching means for calculating acoustic and linguistic scores for the selected words and forming a word string serving as a candidate for the recognized speech by referring to a second database;
wherein the second database incorporates more precise acoustic model, phoneme information, and grammar rules than the first database;a control means for generating word-connection-information between words in the word string;
the word-connection-information including acoustic and linguistic scores for each word in the word string;a re-evaluation means for re-evaluating the word string and correcting the word-connection-information by referring to a third database;
wherein the third database incorporates more precise acoustic models, phoneme information, and grammar rules than the second database; andthe control means determining the recognized speech by correcting the word string on the basis of the corrected word-connection-information. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech recognition method of recognizing an input speech as a recognized speech, comprising the steps of:
-
a feature extracting step of extracting feature amounts from the input speech; a preliminary word-selecting step of selecting words on the basis of the feature amounts by referring to a first database; a matching step of calculating acoustic and linguistic scores for the selected words and forming a word string serving as a candidate for the recognized speech by referring to a second database;
wherein the second database incorporates more precise acoustic model, phoneme information, and grammar rules than the first database;a control step of generating word-connection-information between words in the word string;
the word-connection-information including acoustic and linguistic scores for each word in the word string;a re-evaluation step of re-evaluating the word string and correcting the word-connection-information by referring to a third database;
wherein the third database incorporates more precise acoustic models, phoneme information, and grammar rules than the second database; anda second control step of determining the recognized speech by correcting the word string on the basis of the corrected word-connection-information.
-
-
7. A recording medium for storing a program which executes on a computer for recognizing an input speech as a recognized speech, the program comprising:
-
a feature extracting step of extracting feature amounts from the input speech; a preliminary word-selecting step of selecting words on the basis of the feature amounts by referring to a first database; a matching step of calculating acoustic and linguistic scores for the selected words and forming a word string serving as a candidate for the recognized speech by referring to a second database;
wherein the second database incorporates more precise acoustic model, phoneme information, and grammar rules than the first database;a control step of generating word-connection-information between words in the word string;
the word-connection-information including acoustic and linguistic scores for each word in the word string;a re-evaluation step of re-evaluating the word string and correcting the word-connection-information by referring to a third database;
wherein the third database incorporates more precise acoustic models, phoneme information, and grammar rules than the second database; anda second control step of determining the recognized speech by correcting the word string on the basis of the corrected word-connection-information.
-
Specification