Apparatus, method and computer program product for recognizing speech
First Claim
1. A speech recognition apparatus comprising:
- a semantic-relation storage unit that stores semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other;
a first input accepting unit that accepts an input of a first speech;
a first candidate producing unit that recognizes the first speech and produces first recognition candidates and first likelihood of the first recognition candidates;
a first-candidate selecting unit that selects one of the first recognition candidates as a recognition result of the first speech based on the first likelihood of the first recognition candidates;
a second input accepting unit that accepts an input of a second speech including an object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word;
a second candidate producing unit that recognizes the second speech and produces second recognition candidates and second likelihood of the second recognition candidates;
a word extracting unit that extracts recognition candidates of the object word and recognition candidates of the clue word from the second recognition candidates;
a second-candidate selecting unit that acquires the relevance ratio associated with the semantic relation between the extracted recognition candidates of the objected word and the extracted recognition candidates of the clue word, from the semantic-relation storage unit, and selects one of the second recognition candidates as a recognition result of the second speech based on the acquired relevance ratio;
a correction-portion identifying unit that compares the recognition result of the first speech with the recognition result of the second speech, and identifies a portion corresponding to the object word; and
a correcting unit that corrects the identified portion corresponding to the object word.
4 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition apparatus includes a first-candidate selecting unit that selects a recognition result of a first speech from first recognition candidates based on likelihood of the first recognition candidates; a second-candidate selecting unit that extracts recognition candidates of a object word contained in the first speech and recognition candidates of a clue word from second recognition candidates, acquires the relevance ratio associated with the semantic relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word, and selects a recognition result of the second speech based on the acquired relevance ratio; a correction-portion identifying unit that identifies a portion corresponding to the object word in the first speech; and a correcting unit that corrects the word on identified portion.
-
Citations
15 Claims
-
1. A speech recognition apparatus comprising:
-
a semantic-relation storage unit that stores semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other; a first input accepting unit that accepts an input of a first speech; a first candidate producing unit that recognizes the first speech and produces first recognition candidates and first likelihood of the first recognition candidates; a first-candidate selecting unit that selects one of the first recognition candidates as a recognition result of the first speech based on the first likelihood of the first recognition candidates; a second input accepting unit that accepts an input of a second speech including an object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word; a second candidate producing unit that recognizes the second speech and produces second recognition candidates and second likelihood of the second recognition candidates; a word extracting unit that extracts recognition candidates of the object word and recognition candidates of the clue word from the second recognition candidates; a second-candidate selecting unit that acquires the relevance ratio associated with the semantic relation between the extracted recognition candidates of the objected word and the extracted recognition candidates of the clue word, from the semantic-relation storage unit, and selects one of the second recognition candidates as a recognition result of the second speech based on the acquired relevance ratio; a correction-portion identifying unit that compares the recognition result of the first speech with the recognition result of the second speech, and identifies a portion corresponding to the object word; and a correcting unit that corrects the identified portion corresponding to the object word. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A speech recognition method comprising:
-
accepting a first speech; recognizing the accepted first speech to produce first recognition candidates and first likelihood of the first recognition candidates; selecting one of the first recognition candidates produced for a first speech as the recognition result of the first speech based on the first likelihood of the first recognition candidates; accepting a second speech that includes a object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word; recognizing the accepted second speech to produce second recognition candidates and second likelihood of the second recognition candidates; extracting recognition candidates of the object word and recognition candidates of the clue word from the produced second recognition candidates; acquiring a relevance ratio associated with the semantic relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word from a semantic-relation storage unit that stores therein semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other; selecting one of the second recognition candidates as the recognition result of the second speech based on the acquired relevance ratio; comparing the recognition result of the first speech with the recognition result of the second speech; identifying a portion corresponding to the object word in the first speech; and correcting the identified portion corresponding to the object word.
-
-
15. A computer program product having a computer readable medium including programmed instructions for recognizing speech, wherein the instructions, when executed by a computer, cause the computer to perform:
-
accepting a first speech; recognizing the accepted first speech to produce first recognition candidates and first likelihood of the first recognition candidates; selecting one of the first recognition candidates produced for a first speech as the recognition result of the first speech based on the first likelihood of the first recognition candidates; accepting a second speech that includes a object word and a clue word, the object word is contained in the first recognition candidates, the clue word that provides a clue for correcting the object word; recognizing the accepted second speech to produce second recognition candidates and second likelihood of the second recognition candidates; extracting recognition candidates of the object word and recognition candidates of the clue word from the produced second recognition candidates; acquiring a relevance ratio associated with the semantic relation between the extracted recognition candidates of the object word and the extracted recognition candidates of the clue word from a semantic-relation storage unit that stores therein semantic relation among words and relevance ratio indicating degree of the semantic relation in association with each other; selecting one of the second recognition candidates as the recognition result of the second speech based on the acquired relevance ratio; comparing the recognition result of the first speech with the recognition result of the second speech; identifying a portion corresponding to the object word in the first speech; and correcting the identified portion corresponding to the object word.
-
Specification