System and method for cantonese speech recognition using an optimized phone set
First Claim
1. A system for performing a Cantonese speech recognition procedure with an electronic device, comprising:
- a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said Cantonese speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and
a processor configured to control said recognizer to thereby perform said Cantonese speech recognition procedure.
5 Assignments
0 Petitions
Accused Products
Abstract
The present invention comprises a system and method for implementing a Cantonese speech recognizer with an optimized phone set, and may include a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary that is implemented according to an optimized Cantonese phone set. The optimized Cantonese phone set may be implemented with a phonetic technique to separately include consonantal phones and vocalic phones. For reasons of system efficiency, the optimized Cantonese phone set may preferably be implemented in a compact manner to include only a minimum required number of consonantal phones and vocalic phones to accurately represent Cantonese speech during the speech recognition procedure.
-
Citations
43 Claims
-
1. A system for performing a Cantonese speech recognition procedure with an electronic device, comprising:
-
a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said Cantonese speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and a processor configured to control said recognizer to thereby perform said Cantonese speech recognition procedure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 43)
-
-
16. A system for performing a Cantonese speech recognition procedure with an electronic device, comprising:
-
a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-initial context in which a stop is located at a beginning of a syllable, said optimized phone set responsively utilizing an appropriate consonant phone “
p”
, “
t”
, or “
k”
in said syllable-initial context to represent a corresponding consonant and a preceding closure; anda processor configured to control said recognizer to thereby perform said Cantonese speech recognition procedure.
-
-
17. A system for performing a Cantonese speech recognition procedure with an electronic device, comprising:
-
a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-final/midphrase context in which a stop is located at an end of a word in a middle of a phrase, said optimized phone set responsively utilizing an appropriate consonant phone “
P”
, “
t”
, or “
k”
in said syllable-final/midphrase context to represent a corresponding consonant and a preceding closure; anda processor configured to control said recognizer to thereby perform said Cantonese speech recognition procedure.
-
-
18. A system for performing a Cantonese speech recognition procedure with an electronic device, comprising:
-
a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-final/phrase-end context in which a stop is located at an end of a word at an end of a phrase, said optimized phone set responsively utilizing a same identical closure phone “
cl”
in said syllable-final/phrase-end context to represent either “
p”
, “
t”
, or “
k”
consonants as a closure only without any subsequent releasing consonant sound; anda processor configured to control said recognizer to thereby perform said Cantonese speech recognition procedure.
-
-
19. A system for performing a Cantonese speech recognition procedure with an electronic device, comprising:
-
a recognizer configured to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-initial context in which a first stop is located at a beginning of a syllable, a syllable-final/midphrase context in which a second stop is located at an end of a first word in a middle of a phrase, and a syllable-final/phrase-end context in which a third stop is located at an end of a second word at an end of said phrase, said optimized phone set utilizing an appropriate consonant phone “
b”
, “
d”
, “
p”
, “
t”
, or “
k”
in said syllable-initial context to represent a corresponding consonant and a preceding closure, said optimized phone set responsively utilizing said appropriate consonant phone “
p”
, “
t”
, or “
k”
in said syllable-final/midphrase context to represent said corresponding consonant and said preceding closure, said optimized phone set responsively utilizing a same identical closure phone “
cl”
in said syllable-final/phrase-end context to represent either. “
p”
, “
t”
, or “
k”
as a closure only without any subsequent releasing consonant anda processor configured to control said recognizer to thereby perform said Cantonese speech recognition procedure.
-
-
21. A method for performing a Cantonese speech recognition procedure with an electronic device, comprising the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said Cantonese speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and controlling said recognizer with a processor to thereby perform said Cantonese speech recognition procedure. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 40)
-
-
36. A method for performing a Cantonese speech recognition procedure with an electronic device, comprising the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-initial context in which a stop is located at a beginning of a syllable, said optimized phone set responsively utilizing an appropriate consonant phone “
b”
, “
d”
, “
g”
, “
p”
, “
t”
, or “
k”
in said syllable-initial context to represent a corresponding consonant and a preceding closure; andcontrolling said recognizer with a processor to thereby perform said Cantonese speech recognition procedure.
-
-
37. A method for performing a Cantonese speech recognition procedure with an electronic device. comprising the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-final/midphrase context in which a stop is located at an end of a word in a middle of a phrase, said optimized phone set responsively utilizing an appropriate consonant phone “
p”
, “
t”
, or “
k”
in said syllable-final/midphrase context to represent a corresponding consonant and a preceding; andcontrolling said recognizer with a processor to thereby perform said Cantonese speech recognition procedure.
-
-
38. A method for performing a Cantonese speech recognition procedure with an electronic device, comprising the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-final/phrase-end context in which a stop is located at an end of a word at an end of a phrase, said optimized phone set responsively utilizing a same identical closure phone “
cl”
in said syllable-final/phrase-end context to represent either “
p”
, “
t”
, or “
k”
as a closure only without any subsequent releasing consonant sound; andcontrolling said recognize with a processor to thereby perform said Cantonese speech recognition procedure.
-
-
39. A method for performing a Cantonese speech recognition procedure with an electronic device, comprising the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said input speech data including a syllable-initial context in which a first stop is located at a beginning of a syllable, a syllable-final/midphrase context in which a second stop is located at an end of a first word in a middle of a phrase, and a syllable-final/phrase-end context in which a third stop is located at an end of a second word at an end of said phrase, said optimized phone set utilizing an appropriate consonant phone “
b”
, “
d”
, “
g”
, “
p”
, “
t”
, or “
k”
in said syllable-initial context to represent a corresponding consonant and a preceding closure, said optimized phone set responsively utilizing an appropriate consonant phone “
p”
, “
t”
, or “
k”
in said syllable-final/midphrase context to represent said corresponding consonant and a preceding closure, said optimized phone set responsively utilizing a same identical closure phone “
cl”
in said syllable-final/phrase-end context to represent either “
p”
, “
t”
, or “
k”
as a closure only without any subsequent releasing consonant sound; andcontrolling said recognizer with a processor to thereby perform said Cantonese speech recognition procedure.
-
-
41. A computer-readable medium encoded with a computer program for performing a Cantonese speech recognition procedure, by performing the steps of:
-
configuring a recognizer to compare input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said recognizer thus performing said Cantonese speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and controlling said recognizer with a processor to thereby perform said Cantonese speech recognition procedure.
-
-
42. A system for performing a Cantonese speech recognition procedure with an electronic device, comprising the steps of:
-
means for comparing input speech data to phone strings from a vocabulary dictionary to thereby generate and output one or more recognized words from said vocabulary dictionary, said vocabulary dictionary being implemented according to an optimized phone set, said optimized phone set being implemented with a phonetic technique to separately provide consonantal phones and vocalic phones, one or more of said phone strings including more than two phones from said consonantal phones and said vocalic phones, said optimized phone set being implemented in a compact manner to include only a minimum required number of said consonantal, phones and said vocalic phones, said optimized phone set representing sounds of a Cantonese language without utilizing corresponding tonal information as part of different phones in said optimized phone set, said means for comparing thus performing said Cantonese speech recognition procedure without utilizing any type of tone data to thereby output said one or more recognized words as a final speech recognition result; and means for controlling said means for comparing to thereby perform said Cantonese speech recognition procedure.
-
Specification