Method and apparatus for speech recognition
First Claim
Patent Images
1. A speech recognition method, comprising:
- generating, by a processor, a word sequence based on a phoneme sequence generated from a speech signal;
generating, by the processor, a syllable sequence based on the phoneme sequence, in response to a word element among words included in the word sequence having a lower recognition rate than a threshold value; and
determining, by the processor, a text corresponding to a recognition result of the speech signal based on the word sequence and the syllable sequence,wherein the syllable sequence corresponds to the word element,wherein the generating of the syllable sequence comprises generating the syllable sequence by decoding a portion corresponding to the word element, among phonemes included in the phoneme sequence, andwherein the word sequence is generated by decoding the phoneme sequence and corresponds to at least the text corresponding to the recognition result of the speech signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for speech recognition are disclosed. The speech recognition apparatus includes a processor configured to process a received speech signal, generate a word sequence based on a phoneme sequence generated from the speech signal, generate a syllable sequence corresponding to a word element among words comprised in the word sequence based on the phoneme sequence, and determine a text corresponding to a recognition result of the speech signal based on the word sequence and the syllable sequence.
-
Citations
12 Claims
-
1. A speech recognition method, comprising:
-
generating, by a processor, a word sequence based on a phoneme sequence generated from a speech signal; generating, by the processor, a syllable sequence based on the phoneme sequence, in response to a word element among words included in the word sequence having a lower recognition rate than a threshold value; and determining, by the processor, a text corresponding to a recognition result of the speech signal based on the word sequence and the syllable sequence, wherein the syllable sequence corresponds to the word element, wherein the generating of the syllable sequence comprises generating the syllable sequence by decoding a portion corresponding to the word element, among phonemes included in the phoneme sequence, and wherein the word sequence is generated by decoding the phoneme sequence and corresponds to at least the text corresponding to the recognition result of the speech signal. - View Dependent Claims (2, 3, 4, 5, 9)
-
-
6. A speech recognition method, comprising:
-
generating, by a processor, a word sequence by decoding a phoneme sequence generated from a speech signal; generating, by the processor, a syllable sequence corresponding to a word element, among words included in the word sequence, based on the phoneme sequence, in response to the word element comprising a lower recognition rate than a threshold value; and determining, by the processor, a text corresponding to a recognition result of the speech signal based on the word sequence and the syllable sequence, wherein the generating of the syllable sequence comprises generating the syllable sequence by decoding a portion corresponding to the word element among phonemes included in the phoneme sequence. - View Dependent Claims (7, 8)
-
-
10. A speech recognition apparatus, comprising:
-
a processor configured to process a received speech signal, generate a word sequence based on a phoneme sequence generated from the speech signal, generate a syllable sequence corresponding to a word element among words included in the word sequence, based on decoding a portion corresponding to the word element among phonemes included in the phoneme sequence, in response to the word element having a lower recognition rate than a threshold value, and determine a text corresponding to a recognition result of the speech signal based on the word sequence and the syllable sequence, wherein the word sequence is generated by decoding the phoneme sequence and corresponds to at least the text corresponding to the recognition result of the speech signal. - View Dependent Claims (11, 12)
-
Specification