System and method for effectively implementing a Mandarin Chinese speech recognition dictionary
First Claim
1. A system for performing a speech recognition procedure with an electronic device, comprising:
- a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, each of said phone strings being implemented as a sequence of phonemes that are serially configured, said optimized phone set being implemented in a compact manner by utilizing an allophone variation technique that maps different pronunciations of said input speech data to a respective one of said phone strings, said vocabulary dictionary being implemented by utilizing one or more dictionary optimization techniques, said optimized phone set representing sounds of a Mandarin Chinese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and
a processor configured to control said recognizer to thereby perform said speech recognition procedure.
5 Assignments
0 Petitions
Accused Products
Abstract
The present invention comprises a system and method for effectively implementing a Mandarin Chinese speech recognition dictionary, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemented according to an optimized Mandarin Chinese phone set. The optimized Mandarin Chinese phone set may efficiently be implemented by utilizing an allophone and phonemic variation technique. In addition, the foregoing vocabulary dictionary may be implemented by utilizing unified dictionary optimization techniques to provide robust and accurate speech recognition. Furthermore, the vocabulary dictionary may be implemented as an optimized dictionary to accurately recognize either Northern Mandarin Chinese speech or Southern Mandarin Chinese speech during the speech recognition procedure.
8 Citations
41 Claims
-
1. A system for performing a speech recognition procedure with an electronic device, comprising:
-
a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, each of said phone strings being implemented as a sequence of phonemes that are serially configured, said optimized phone set being implemented in a compact manner by utilizing an allophone variation technique that maps different pronunciations of said input speech data to a respective one of said phone strings, said vocabulary dictionary being implemented by utilizing one or more dictionary optimization techniques, said optimized phone set representing sounds of a Mandarin Chinese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and a processor configured to control said recognizer to thereby perform said speech recognition procedure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 41)
-
-
20. A method for performing a speech recognition procedure with an electronic device, comprising the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, each of said phone strings being implemented as a sequence of phonemes that are serially configured, said optimized phone set being implemented in a compact manner by utilizing an phonemic and allophonic variation technique that maps different pronunciations of said input speech data to a respective one of said phone strings, said vocabulary dictionary being implemented by utilizing one or more dictionary optimization techniques, said optimized phone set representing sounds of a Mandarin Chinese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and controlling said recognizer with a processor to thereby perform said speech recognition procedure. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A computer-readable medium encoded with a computer program for performing a speech recognition, by performing the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, each of said phone strings being implemented as a sequence of phonemes that are serially configured, said optimized phone set being implemented in a compact manner by utilizing a phonemic and allophonic variation technique that maps different pronunciations of said input speech data to a respective one of said phone strings, said vocabulary dictionary being implemented by utilizing one or more dictionary optimization techniques, said optimized phone set representing sounds of a Mandarin Chinese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said Cantonese speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and controlling said recognizer with a processor to thereby perform said speech recognition procedure.
-
-
40. A system for performing a speech recognition procedure with an electronic device, comprising:
-
means for comparing input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, each of said phone strings being implemented as a sequence of phonemes that are serially configured, said optimized phone set being implemented in a compact manner by utilizing a phonemic and allophonic variation technique that maps different pronunciations of said input speech data to a respective one of said phone strings, said vocabulary dictionary being implemented by utilizing one or more dictionary optimization techniques, said optimized phone set representing sounds of a Mandarin Chinese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said means for comparing thus performing said speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and means for controlling said means for comparing to thereby perform said speech recognition procedure.
-
Specification