Detection of end of utterance in speech recognition system
First Claim
1. A speech recognition system comprising a speech recognizer with end of utterance detection, wherein the speech recognizer is configured to determine whether recognition result determined from received speech data is stabilized, the speech recognizer is configured to process values of best state scores and best token scores associated with frames of received speech data for end of utterance detection purposes, and the speech recognizer is configured to determine whether end of utterance is detected or not, based on the processing, if the recognition result is stabilized.
10 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to speech recognition systems, especially to arranging detection of end-of utterance in such systems. A speech recognizer of the system is configured to determine whether recognition result determined from received speech data is stabilized. The speech recognizer is configured to process values of best state scores and best token scores associated with frames of received speech data for end of utterance detection purposes. Further, the speech recognizer is configured to determine whether end of utterance is detected or not, based on the processing, if the recognition result is stabilized.
-
Citations
31 Claims
-
1. A speech recognition system comprising a speech recognizer with end of utterance detection, wherein the speech recognizer is configured to determine whether recognition result determined from received speech data is stabilized,
the speech recognizer is configured to process values of best state scores and best token scores associated with frames of received speech data for end of utterance detection purposes, and the speech recognizer is configured to determine whether end of utterance is detected or not, based on the processing, if the recognition result is stabilized.
-
13. A method for arranging detection of end-of utterance in a speech recognition system, the method comprising:
-
processing values of best state scores and best token scores associated with frames of received speech data for end of utterance detection purposes, determining whether recognition result determined from received speech data is stabilized, and determining whether end of utterance is detected or not, based on the processing, if the recognition result is stabilized. - View Dependent Claims (14, 15, 16, 17)
-
-
18. An electronic device comprising a speech recognizer, wherein the speech recognizer is configured to determine whether recognition result determined from received speech data is stabilized,
the speech recognizer is configured to process values of best state scores and best token scores associated with frames of received speech data for end of utterance detection purposes, and the speech recognizer is configured to determine whether end of utterance is detected or not, based on the processing, if the recognition result is stabilized.
-
31. A computer program product, loadable into the memory of a data processing device, for arranging detection of end-of utterance in an electronic device comprising a speech recognizer, the computer program product comprising:
-
program code for processing values of best state scores and best token scores associated with frames of received speech data for end of utterance detection purposes, program code for determining whether recognition result determined from received speech data is stabilized, and program code for determining whether end of utterance is detected or not, based on the processing, if the recognition result is stabilized.
-
Specification