Segmental tonal modeling for tonal languages
First Claim
1. A speech processing system adapted to receive an input related to one of speech and text and process the input to provide an output related to one of speech and text, the speech processing system accessing a module derived from a phone set having a plurality of phones for a tonal language, the phones being used to model syllables used in the module, the syllables having an initial and final part, wherein the final part comprises a plurality of phones that jointly and implicitly carry the tonal information.
2 Assignments
0 Petitions
Accused Products
Abstract
A phone set for use in speech processing such as speech recognition or text-to-speech conversion is used to model or form syllables of a tonal language having a plurality of different tones. Each syllable includes an initial part that can be glide dependent and a final part. The final part includes a plurality of phones. Each phones carries partial tonal information such that the phones taken together implicitly and jointly represent the different tones.
25 Citations
29 Claims
- 1. A speech processing system adapted to receive an input related to one of speech and text and process the input to provide an output related to one of speech and text, the speech processing system accessing a module derived from a phone set having a plurality of phones for a tonal language, the phones being used to model syllables used in the module, the syllables having an initial and final part, wherein the final part comprises a plurality of phones that jointly and implicitly carry the tonal information.
- 20. A speech processing system adapted to receive an input related to one of speech and text and process the input to perform one of speech recognition and text-to-speech conversion in order to provide an output related to one of speech and text, the speech processing system accessing a module derived from a phone set having a plurality of phones for a tonal language comprising a plurality of different tones with different levels of pitch, the phones being used to model syllables used in the module, at least some of the syllables having an initial and final part, wherein a first set of the plurality of phones are used to describe glide dependent initials, and a second set of the plurality of phones are used to describe the final part, wherein the final part comprises a plurality of phones, each phone including partial tonal information.
-
29. A method of speech processing comprising:
-
accessing a module having a phone set comprising a plurality of phones for a tonal language, the phones being used to model syllables, the syllables having an initial and final part, wherein the final part comprises a plurality of phones that jointly and implicitly carry the tonal information;
utilizing the phone set to identify syllables corresponding to the input for performing one of speech recognition and text-to-speech conversion; and
providing an output corresponding to one of speech recognition and text-to-speech conversion.
-
Specification