METHOD FOR BUILDING LANGUAGE MODEL, SPEECH RECOGNITION METHOD AND ELECTRONIC APPARATUS
First Claim
1. A method for building a language model, adapted to an electronic apparatus, the method comprising:
- receiving a plurality of candidate sentences; and
obtaining a plurality of phonetic spellings matching each of words in each of the candidate sentences and a plurality of word probabilities according to a text corpus, so as to obtain a candidate sentence table corresponding to the candidate sentences.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for building a language model, a speech recognition method and an electronic apparatus are provided. The speech recognition method includes the following steps. Phonetic transcriptions of a speech signal are obtained from an acoustic model. Phonetic spellings matching the phonetic transcriptions are obtained according to the phonetic transcriptions and a syllable acoustic lexicon. According to the phonetic spellings, a plurality of text sequences and a plurality of text sequence probabilities are obtained from a language model. Each phonetic spelling is matched to a candidate sentence table; a word probability of each phonetic spelling matching a word in a sentence of the sentence table are obtained; and the word probabilities of the phonetic spellings are calculated so as to obtain the text sequence probabilities. The text sequence corresponding to a largest one of the sequence probabilities is selected as a recognition result of the speech signal.
37 Citations
28 Claims
-
1. A method for building a language model, adapted to an electronic apparatus, the method comprising:
-
receiving a plurality of candidate sentences; and obtaining a plurality of phonetic spellings matching each of words in each of the candidate sentences and a plurality of word probabilities according to a text corpus, so as to obtain a candidate sentence table corresponding to the candidate sentences. - View Dependent Claims (2, 3)
-
-
4. A speech recognition method, adapted to an electronic apparatus, comprising:
-
obtaining a phonetic transcription sequence of a speech signal according to an acoustic model, and the phonetic transcription sequence including a plurality of phones; obtaining a plurality of phonetic spellings matching the phonetic transcription sequence according to the phonetic transcription sequence and a syllable acoustic lexicon; obtaining a plurality of text sequences and a plurality of text sequence probabilities from a language model according to the phonetic spellings, and matching each of the phonetic spellings with a candidate sentence table, so as to obtain a word probability of each of the phonetic spellings corresponding to each of the words in the candidate sentences; and calculating the word probabilities of the phonetic spellings, so as to obtain the text sequence probabilities, wherein the candidate sentences corresponding to the text sequence probabilities are the text sequences; and selecting the text sequence corresponding to a largest one among the text sequence probabilities as a recognition result of the speech signal. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. An electronic apparatus, comprising:
-
a storage unit, storing a plurality of program code segments; and a processing unit, coupled to the storage unit, the processing unit executing a plurality of commands through the program code segments, and the commands comprising; receiving a plurality of candidate sentences; and obtaining a plurality of phonetic spellings matching each of words in each of the candidate sentences and a plurality of word probabilities according to a text corpus, so as to obtain a candidate sentence table corresponding to the candidate sentences. - View Dependent Claims (16, 17)
-
-
18. An electronic apparatus, comprising:
-
an input unit, receiving a speech signal; a storage unit, storing a plurality of program code segments; and a processing unit, coupled to the input unit and the storage unit, the processing unit executing a plurality of commands through the program code segments, and the commands comprising; obtaining a phonetic transcription sequence of the speech signal according to an acoustic model, and the phonetic transcription sequence including a plurality of phones; obtaining a plurality of phonetic spellings matching the phonetic transcription sequence according to the phonetic transcription sequence and a syllable acoustic lexicon; obtaining a plurality of text sequences and a plurality of text sequence probabilities from a language model according to the phonetic spellings, and matching each of the phonetic spellings with a candidate sentence table, so as to obtain a word probability of each of the phonetic spellings corresponding to each of the words in the candidate sentences; and calculating the word probabilities of the phonetic spellings, so as to obtain the text sequence probabilities, wherein the candidate sentences corresponding to the text sequence probabilities are the text sequences; and selecting the text sequence corresponding to a largest one among the text sequence probabilities as a recognition result of the speech signal. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
Specification