METHOD AND APPARATUS FOR SPEECH RECOGNITION AND GENERATION OF SPEECH RECOGNITION ENGINE
First Claim
Patent Images
1. A method of speech recognition, the method comprising:
- receiving a speech input;
transmitting the speech input to a speech recognition engine; and
receiving a speech recognition result from the speech recognition engine,wherein the speech recognition engine is configured to obtain a phoneme sequence from the speech input and provide the speech recognition result based on a phonetic distance of the phoneme sequence.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for speech recognition and for generation of speech recognition engine, and a speech recognition engine are provided. The method of speech recognition involves receiving a speech input, transmitting the speech input to a speech recognition engine, and receiving a speech recognition result from the speech recognition engine, in which the speech recognition engine obtains a phoneme sequence from the speech input and provides the speech recognition result based on a phonetic distance of the phoneme sequence.
20 Citations
20 Claims
-
1. A method of speech recognition, the method comprising:
-
receiving a speech input; transmitting the speech input to a speech recognition engine; and receiving a speech recognition result from the speech recognition engine, wherein the speech recognition engine is configured to obtain a phoneme sequence from the speech input and provide the speech recognition result based on a phonetic distance of the phoneme sequence. - View Dependent Claims (2, 3, 4)
-
-
5. A method of generating speech recognition engine, the method comprising:
-
obtaining phoneme sequences of words; determining phonetic similarities between the phoneme sequences by comparing phonemes comprised in the phoneme sequences; calculating phonetic distances between the words based on the determined phonetic similarities between the phoneme sequences; and generating embedding vectors based on the calculated phonetic distances between the words. - View Dependent Claims (6, 7, 8, 9, 10)
-
-
11. A method of speech recognition, the method comprising:
-
receiving a speech input; obtaining a phoneme sequence from the speech input; selecting an embedding vector closest in a phonetic distance to the phoneme sequence among embedding vectors arranged on an N-dimensional embedding space; and outputting a speech recognition result based on a phoneme sequence mapped to the selected embedding vector. - View Dependent Claims (12, 13)
-
-
14. An apparatus comprising:
-
a microphone configured to receive a speech input; a phoneme sequence processor configured to obtain a phoneme sequence from the speech input; and a speech recognition engine configured to generate a speech recognition result based on a phonetic distance of the phoneme sequence. - View Dependent Claims (15, 16, 17)
-
-
18. A speech recognition engine, comprising:
-
an embedded vector processor configured to select an embedding vector corresponding to a phoneme sequence among embedding vectors arranged on an embedding space; and a speech recognition result synthesizer configured to recognize a word in the speech input based on the selected embedding vector. - View Dependent Claims (19, 20)
-
Specification