Apparatus, method, and computer program product for speech recognition allowing for recognition of character string in speech input
First Claim
1. A speech recognition apparatus comprising:
- a generation unit configured to receive a speech utterance and to generate at least one recognition candidate associating to the speech utterance and a likelihood of the recognition candidate;
a storing unit configured to store at least the one recognition candidate and the likelihood;
a selecting unit configured to select one of at least the one recognition candidate as a recognition result of a first speech utterance based on the likelihood;
an utterance relation determining unit configured to determine, when a first speech utterance and a second speech utterance are sequentially input, at least whether the second speech utterance which is input after the input of the first speech utterance is a speech re-utterance of a whole of the first speech utterance or a speech re-utterance of a part of the first speech utterance;
a whole correcting unit configured to correct the recognition candidate of the whole of the first speech utterance based on the second speech utterance and to display the corrected recognition result when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the whole of the first speech utterance; and
a part correcting unit configured to correct the recognition candidate for the part of the first speech utterance, the part corresponding to the second speech utterance, based on the second speech utterance and to display the corrected recognition result when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the part of the first speech utterance.
4 Assignments
0 Petitions
Accused Products
Abstract
A speech recognition apparatus includes a generation unit configured to receive a speech utterance and to generate at least one recognition candidate associating to the speech utterance and a likelihood of the recognition candidate; a storing unit configured to store at least the one recognition candidate and the likelihood; a selecting unit configured to select one of at least the one recognition candidate as a recognition result of a first speech utterance based on the likelihood; an utterance relation determining unit configured to determine, when a first speech utterance and a second speech utterance are sequentially input, at least whether the second speech utterance which is input after the input of the first speech utterance is a speech re-utterance of a whole of the first speech utterance or a speech re-utterance of a part of the first speech utterance; a whole correcting unit configured to correct the recognition candidate of the whole of the first speech utterance based on the second speech utterance and to display the corrected recognition result when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the whole of the first speech utterance; and a part correcting unit configured to correct the recognition candidate for the part of the first speech utterance, the part corresponding to the second speech utterance, based on the second speech utterance and to display the corrected recognition result when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the part of the first speech utterance.
246 Citations
15 Claims
-
1. A speech recognition apparatus comprising:
-
a generation unit configured to receive a speech utterance and to generate at least one recognition candidate associating to the speech utterance and a likelihood of the recognition candidate;
a storing unit configured to store at least the one recognition candidate and the likelihood;
a selecting unit configured to select one of at least the one recognition candidate as a recognition result of a first speech utterance based on the likelihood;
an utterance relation determining unit configured to determine, when a first speech utterance and a second speech utterance are sequentially input, at least whether the second speech utterance which is input after the input of the first speech utterance is a speech re-utterance of a whole of the first speech utterance or a speech re-utterance of a part of the first speech utterance;
a whole correcting unit configured to correct the recognition candidate of the whole of the first speech utterance based on the second speech utterance and to display the corrected recognition result when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the whole of the first speech utterance; and
a part correcting unit configured to correct the recognition candidate for the part of the first speech utterance, the part corresponding to the second speech utterance, based on the second speech utterance and to display the corrected recognition result when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the part of the first speech utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of speech recognition comprising:
-
receiving a speech utterance;
generating at least one recognition candidate associating to the speech utterance and a likelihood of the recognition candidate;
selecting one of at least the one recognition candidate as a recognition result of a first speech utterance based on the likelihood;
determining, when a first speech utterance and a second speech utterance are sequentially input, at least whether the second speech utterance which is input after the input of the first speech utterance is a speech re-utterance of a whole of the first speech utterance or a speech re-utterance of a part of the first speech utterance;
correcting the recognition candidate of the whole of the first speech utterance based on the second speech utterance to display the corrected recognition result, when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the whole of the first speech utterance; and
correcting the recognition candidate for the part of the first speech utterance, the part corresponding to the second speech utterance, based on the second speech utterance to display the corrected recognition result, when the second speech utterance is determined to be the speech re-utterance of the part of the first speech utterance.
-
-
15. A computer program product having a computer readable medium including programmed instructions for performing a speech recognition process, wherein the instructions, when executed by a computer, cause the computer to perform:
-
receiving a speech utterance;
generating at least one recognition candidate associating to the speech utterance and a likelihood of the recognition candidate;
selecting one of at least the one recognition candidate as a recognition result of a first speech utterance based on the likelihood;
determining, when a first speech utterance and a second speech utterance are sequentially input, at least whether the second speech utterance which is input after the input of the first speech utterance is a speech re-utterance of a whole of the first speech utterance or a speech re-utterance of a part of the first speech utterance;
correcting the recognition candidate of the whole of the first speech utterance based on the second speech utterance to display the corrected recognition result, when the utterance relation determining unit determines that the second speech utterance is the speech re-utterance of the whole of the first speech utterance; and
correcting the recognition candidate for the part of the first speech utterance, the part corresponding to the second speech utterance, based on the second speech utterance to display the corrected recognition result, when the second speech utterance is determined to be the speech re-utterance of the part of the first speech utterance.
-
Specification