SPEECH RECOGNITION METHOD AND ELECTRONIC APPARATUS
First Claim
1. A speech recognition method, adapted to an electronic apparatus, comprising:
- obtaining a phonetic transcription sequence of a speech signal according to an acoustic model;
obtaining a plurality of possible syllable sequences and a plurality of corresponding phonetic spelling matching probabilities according to the phonetic transcription sequence and a syllable acoustic lexicon;
obtaining, from a language model, a probability of a plurality of text sequences appeared in the language model; and
selecting the text sequence corresponding to a largest one among a plurality of associated probabilities to be used as a recognition result of the speech signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition method and an electronic apparatus are provided. The speech recognition method includes the following steps. A plurality of phonetic transcriptions of a speech signal is obtained according to an acoustic model. A phonetic spelling and intonation information matched to the phonetic transcriptions are obtained according to a phonetic transcription sequence and a syllable acoustic lexicon of the invention. According to the phonetic spellings and the intonation information, a plurality of phonetic spelling sequences and a plurality of phonetic spelling sequence probabilities are obtained from a language model. The phonetic spelling sequence corresponding to a largest one among the phonetic spelling sequence probabilities is selected as a recognition result of the speech signal.
20 Citations
20 Claims
-
1. A speech recognition method, adapted to an electronic apparatus, comprising:
-
obtaining a phonetic transcription sequence of a speech signal according to an acoustic model; obtaining a plurality of possible syllable sequences and a plurality of corresponding phonetic spelling matching probabilities according to the phonetic transcription sequence and a syllable acoustic lexicon; obtaining, from a language model, a probability of a plurality of text sequences appeared in the language model; and selecting the text sequence corresponding to a largest one among a plurality of associated probabilities to be used as a recognition result of the speech signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An electronic apparatus, comprising:
-
an input unit, receiving a speech signal; a storage unit, storing a plurality of program code segments; and a processing unit, coupled to the input unit and the storage unit, the processing unit executing a plurality of commands through the program code segments, and the commands comprising; obtaining a phonetic transcription sequence of the speech signal according to an acoustic model; obtaining a plurality of syllable sequences and a plurality of corresponding phonetic spelling matching probabilities according to the phonetic transcription sequence and a syllable acoustic lexicon; obtaining, from a language model, a probability of a plurality of text sequences appeared in the language model; and selecting the text sequence corresponding to a largest one among a plurality of associated probabilities to be used as a recognition result of the speech signal. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification