Technologies for end-of-sentence detection using syntactic coherence
First Claim
Patent Images
1. An automatic speech recognition device comprising:
- a speech data capture module to acquire speech data;
a phoneme recognition module to recognize, based on the speech data, phonemes of the speech data;
a word recognition module to recognize, based on the phonemes, words of the speech data;
a syntactic parser module to parse, based on the words, the speech data to determine a syntactic coherence of the speech data; and
an end-of-sentence determination module to;
determine, based on the words, a word statistics end-of-sentence score;
determine, based on the syntactic coherence and the word statistics end-of-sentence score, an end of a sentence of the speech data; and
determine, based on the determined end of the sentence, a speech recognition result.
1 Assignment
0 Petitions
Accused Products
Abstract
Technologies for detecting an end of a sentence in automatic speech recognition are disclosed. An automatic speech recognition device may acquire speech data, and identify phonemes and words of the speech data. The automatic speech recognition device may perform a syntactic parse based on the recognized words, and determine an end of a sentence based on the syntactic parse. For example, if the syntactic parse indicates that a certain set of consecutive recognized words form a syntactically complete and correct sentence, the automatic speech recognition device may determine that there is an end of a sentence at the end of that set of words.
14 Citations
25 Claims
-
1. An automatic speech recognition device comprising:
-
a speech data capture module to acquire speech data; a phoneme recognition module to recognize, based on the speech data, phonemes of the speech data; a word recognition module to recognize, based on the phonemes, words of the speech data; a syntactic parser module to parse, based on the words, the speech data to determine a syntactic coherence of the speech data; and an end-of-sentence determination module to; determine, based on the words, a word statistics end-of-sentence score; determine, based on the syntactic coherence and the word statistics end-of-sentence score, an end of a sentence of the speech data; and determine, based on the determined end of the sentence, a speech recognition result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. One or more non-transitory, machine-readable storage media comprising a plurality of instructions stored thereon that, when executed, cause an automatic speech recognition device to:
-
acquire speech data; recognize, based on the speech data, phonemes of the speech data; recognize, based on the phonemes, words of the speech data; parse, based on the words, the speech data to determine a syntactic coherence of the speech data; determine, based on the words, a word statistics end-of-sentence score; determine, based on the syntactic coherence and the word statistics end-of-sentence score, an end of a sentence of the speech data; and determine, based on the determined end of the sentence, a speech recognition result. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for determining an end of a sentence of speech data, the method comprising:
-
acquiring, by an automatic speech recognition device, speech data; recognizing, by the automatic speech recognition device and based on the speech data, phonemes of the speech data; recognizing, by the automatic speech recognition device and based on the phonemes, words of the speech data; parsing, by the automatic speech recognition device and based on the words, the speech data to determine a syntactic coherence of the speech data; determine, by the automatic speech recognition device and based on the words, a word statistics end-of-sentence score; determining, by the automatic speech recognition device and based on the syntactic coherence and the word statistics end-of-sentence score, an end of a sentence of the speech data; determining, by the automatic speech recognition device and based on the determined end of the sentence, a speech recognition result. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25)
-
Specification