Segmental tonal modeling for tonal languages

US 20050159954A1
Filed: 01/21/2004
Published: 07/21/2005
Est. Priority Date: 01/21/2004
Status: Active Grant

First Claim

Patent Images

1. A speech processing system adapted to receive an input related to one of speech and text and process the input to provide an output related to one of speech and text, the speech processing system accessing a module derived from a phone set having a plurality of phones for a tonal language, the phones being used to model syllables used in the module, the syllables having an initial and final part, wherein the final part comprises a plurality of phones that jointly and implicitly carry the tonal information.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A phone set for use in speech processing such as speech recognition or text-to-speech conversion is used to model or form syllables of a tonal language having a plurality of different tones. Each syllable includes an initial part that can be glide dependent and a final part. The final part includes a plurality of phones. Each phones carries partial tonal information such that the phones taken together implicitly and jointly represent the different tones.

25 Citations

View as Search Results

29 Claims

1. A speech processing system adapted to receive an input related to one of speech and text and process the input to provide an output related to one of speech and text, the speech processing system accessing a module derived from a phone set having a plurality of phones for a tonal language, the phones being used to model syllables used in the module, the syllables having an initial and final part, wherein the final part comprises a plurality of phones that jointly and implicitly carry the tonal information.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
- - 2. The speech processing system of claim 1 wherein each phone of the final part includes information about the tone.
  - 3. The speech processing system of claim 1 wherein the tonal language comprises a plurality of different tones with different levels of pitch.
  - 4. The speech processing system of claim 3 wherein the different levels of pitch comprise two categorical levels, and wherein each phone has a categorical level associated with it.
  - 5. The speech processing system of claim 3 wherein the different levels of pitch comprise three categorical levels, and wherein each phone has a categorical level associated with it.
  - 6. The speech processing system of claim 3 wherein the different levels of pitch comprise five categorical levels, and wherein each phone has a categorical level associated with it.
  - 7. The speech processing system of claim 1 wherein each syllable comprises the same form having the initial and the final, the final having two phones carrying partial tonal information each.
  - 8. The speech processing system of claim 1 wherein at least some of the syllables of the tonal language include a glide, the glide being embodied in the initial.
  - 9. The speech processing system of claim 1 wherein the speech processing system comprises one of a speech recognition system and a text-to-speech converter.
  - 10. The speech processing system of claim 9 wherein each phone of the final part includes partial information about the tone.
  - 11. The speech processing system of claim 9 wherein the tonal language comprises a plurality of different tones with different levels of pitch.
  - 12. The speech processing system of claim 11 wherein the different levels of pitch comprise two categorical levels, and wherein each phone has a categorical level associated with it.
  - 13. The speech processing system of claim 11 wherein the different levels of pitch comprise three categorical levels, and wherein each phone has a categorical level associated with it.
  - 14. The speech processing system of claim 11 wherein the different levels of pitch comprise five categorical levels, and wherein each phone has a categorical level associated with it.
  - 15. The speech processing system of claim 9 wherein each syllable comprises the same form having the initial and the final, the final having two phones carrying partial tonal information.
  - 16. The speech processing system of claim 9 wherein at least some of the syllables of the tonal language include a glide, the glide being embodied in the initial.
  - 17. The speech processing system of claim 16 wherein the tonal language comprises Chinese or a dialect thereof, such as Cantonese.
  - 18. The speech processing system of claim 16 wherein the tonal language comprises Thai or a tonal dialect thereof.
  - 19. The speech processing system of claim 16 wherein the tonal language comprises Vietnamese or a tonal dialect thereof.

20. A speech processing system adapted to receive an input related to one of speech and text and process the input to perform one of speech recognition and text-to-speech conversion in order to provide an output related to one of speech and text, the speech processing system accessing a module derived from a phone set having a plurality of phones for a tonal language comprising a plurality of different tones with different levels of pitch, the phones being used to model syllables used in the module, at least some of the syllables having an initial and final part, wherein a first set of the plurality of phones are used to describe glide dependent initials, and a second set of the plurality of phones are used to describe the final part, wherein the final part comprises a plurality of phones, each phone including partial tonal information.
- View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
- - 21. The speech processing system of claim 20 wherein the different levels of pitch comprise two categorical levels, and wherein each phone has a categorical level associated with it.
  - 22. The speech processing system of claim 20 wherein the different levels of pitch comprise three categorical levels, and wherein each phone has a categorical level associated with it.
  - 23. The speech processing system of claim 20 wherein the different levels of pitch comprise five categorical levels, and wherein each phone has a categorical level associated with it.
  - 24. The speech processing system of claim 20 wherein at least one syllable comprises only the final having two phones carrying partial tonal information each.
  - 25. The speech processing system of claim 20 wherein each syllable comprises the same form having the initial and the final, the final having two phones carrying partial tonal information each.
  - 26. The speech processing system of claim 20 wherein the tonal language comprises Chinese or a dialect thereof such as Cantonese.
  - 27. The speech processing system of claim 20 wherein the tonal language comprises Thai or a tonal dialect thereof.
  - 28. The speech processing system of claim 20 wherein the tonal language comprises Vietnamese or a tonal dialect thereof.

29. A method of speech processing comprising:
- accessing a module having a phone set comprising a plurality of phones for a tonal language, the phones being used to model syllables, the syllables having an initial and final part, wherein the final part comprises a plurality of phones that jointly and implicitly carry the tonal information;
  
  utilizing the phone set to identify syllables corresponding to the input for performing one of speech recognition and text-to-speech conversion; and
  
  providing an output corresponding to one of speech recognition and text-to-speech conversion.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Huang, Chao, Chu, Min

Granted Patent

US 7,684,987 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/254
CPC Class Codes

G10L 15/02   Feature extraction for spee...

G10L 2015/027   Syllables being the recogni...

G10L 25/15   the extracted parameters be...

Segmental tonal modeling for tonal languages

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

25 Citations

29 Claims

Specification

Use Cases

Quick Links

Others

Segmental tonal modeling for tonal languages

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

25 Citations

29 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others