Speech recognition method and speech recognition apparatus
First Claim
1. A speech recognition method comprising:
- analyzing an input speech input a plurality of times to recognize the input speech and generate a plurality of recognized speech information items;
detecting a rephrased speech information item corresponding to a rephrased speech from the recognition speech information items;
detecting a recognition error in units of a character string from an original speech information item corresponding to the rephrased speech information item;
removing an error character string corresponding to the recognition error from the original speech information item; and
generating a speech recognition result by using the rephrased speech information item and the original speech information item from which the error character string is removed.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition method comprises analyzing an input speech input a plurality of times to recognize the input speech and generate a plurality of recognized speech information items, detecting a rephrased speech information item corresponding to a rephrased speech from the recognition speech information items, detecting a recognition error in units of a character string from an original speech information item corresponding to the rephrased speech information item, removing an error character string corresponding to the recognition error from the original speech information item, and generating a speech recognition result by using the rephrased speech information item and the original speech information item from which the error character string is removed.
-
Citations
20 Claims
-
1. A speech recognition method comprising:
-
analyzing an input speech input a plurality of times to recognize the input speech and generate a plurality of recognized speech information items;
detecting a rephrased speech information item corresponding to a rephrased speech from the recognition speech information items;
detecting a recognition error in units of a character string from an original speech information item corresponding to the rephrased speech information item;
removing an error character string corresponding to the recognition error from the original speech information item; and
generating a speech recognition result by using the rephrased speech information item and the original speech information item from which the error character string is removed. - View Dependent Claims (2, 3)
-
-
4. A speech recognition method comprising:
-
receiving an input speech a plurality of times to generate a plurality of input speech signals corresponding to an original speech and a rephrased speech;
analyzing the input speech signals to output feature information expressing a feature of the input speech;
collating the feature information with a dictionary storage to extract at least one recognition candidate information similar to the feature information;
storing the feature information corresponding to the input speech and the extracted candidate information in a history storage;
outputting interval information based on the feature information corresponding to at least two of the input speech signals and the extracted candidate information, referring to the history storage, the interval information representing at least one of one of a coincident interval and a similar speech interval and one of a non-similar interval and a non-coincident interval with respect to the rephrased speech and the original speech; and
reconstructing the input speech using the candidate information of the rephrased speech and the original speech based on the interval information. - View Dependent Claims (5, 6, 7, 8, 9)
-
-
10. A speech recognition apparatus comprising:
-
an input speech analyzer to analyze an input speech input a plurality of times to recognize the input speech and generate a plurality of recognized speech information items;
a rephrased speech detector to detect a rephrased speech information item corresponding to a rephrased speech from the recognition speech information items;
a recognition error detector to detect a recognition error in units of a character string from an original speech information item corresponding to the rephrased speech information item;
an error remover to remove an error character string corresponding to the recognition error from the original speech information item; and
a reconstruction unit to reconstruct the input speech by using the rephrased speech information item and the original speech information item from which the error character string is removed. - View Dependent Claims (11, 12)
-
-
13. A speech recognition apparatus comprising:
-
a speech input unit to receive an input speech a plurality of times to generate a plurality of input speech signals corresponding to an original speech and a rephrased speech;
a speech analysis unit to analyze the input speech signal to output feature information expressing a feature of the input speech;
a dictionary storage which stores recognition candidate information;
a collation unit configured to collate the feature information with the dictionary storage to extract at least one recognition candidate information similar to the feature information;
a history storage to store the feature information corresponding to the input speech and the extracted candidate information;
an interval information output unit to output interval information based on the feature information corresponding to at least two of the input speech signals and the extracted candidate information, referring to the history storage, the interval information representing at least one of one of a coincident interval and a similar speech interval and one of a non-similar interval and a non-coincident interval with respect to the rephrased speech and the original speech; and
a reconstruction unit to reconstruct the input speech using the candidate information of the rephrased speech and the original speech based on the interval information. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A speech recognition program stored on a computer readable medium comprising:
-
means for instructing a computer to analyze an input speech input a plurality of times to recognize the input speech and generate a plurality of recognized speech information items;
means for instructing the computer to detect a rephrased speech information item corresponding to a rephrased speech from the recognition speech information items;
means for instructing the computer to detect a recognition error in units of a character string from an original speech information item corresponding to the rephrased speech information item;
means for instructing the computer to remove an error character string corresponding to the recognition error from the original speech information item; and
means for instructing the computer to generate a speech recognition result by using the rephrased speech information item and the original speech information item from which the error character string is removed.
-
-
20. A speech recognition program stored on a computer readable medium comprising:
-
means for instructing the computer to take in an input speech a plurality of times to generate a plurality of input speech signals corresponding to an original speech and a rephrased speech;
means for instructing the computer to analyze the input speech signal to output feature information expressing a feature of the input speech;
means for instructing the computer to collate the feature information with a dictionary storage to extract at least one recognition candidate information similar to the feature information;
means for instructing the computer to store the feature information corresponding to the input speech and the extracted candidate information in a history storage;
means for instructing the computer to output interval information based on the feature information corresponding to at least two of the input speech signals and the extracted candidate information, referring to the history storage, the interval information representing at least one of one of a coincident interval and a similar speech interval and one of a non-similar interval and a non-coincident interval with respect to the rephrased speech and the original speech; and
means for instructing the computer to reconstruct the input speech using the candidate information of the rephrased speech and the original speech based on the interval information.
-
Specification