Text to speech

US 20030144842A1
Filed: 01/29/2002
Published: 07/31/2003
Est. Priority Date: 01/29/2002
Status: Active Grant

First Claim

Patent Images

1. A method for converting text to speech using a computing device having memory, comprising:

(a) receiving text into said memory of said computing device;

(b) applying a set of the lexical parsing rules to parse said text into a plurality of components;

(c) associating pronunciation, and meaning information with said components;

(d) applying a set of phrase parsing rules to generate marked up text;

(e) phonetically parsing said marked up text using phonetic parsing rules;

(f) parsing said marked up text using Lessac expressive parsing rules; and

(g) storing a plurality of sounds in memory, each of said sounds being associated with said pronunciation information; and

(h) recalling the sounds associated with said text to generate a raw speech signal from said marked up text after said parsing using phonetic and expressive parsing rules.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A preferred embodiment of the method for converting text to speech using a computing device having a memory is disclosed. Text, being made up of a plurality of words, is received into the memory of the computing device. A plurality of phonemes are derived from the text. Each of the phonemes is associated with a prosody record based on a database of prosody records associated with a plurality of words. A first set of the artificial intelligence rules is applied to determine context information associated with the text. The context influenced prosody changes for each of the phonemes is determined. Then a second set of rules, based on Lessac theory to determine Lessac derived prosody changes for each of the phonemes is applied. The prosody record for each of the phonemes is amended in response to the context influenced prosody changes and the Lessac derived prosody changes. Then a reading from the memory sound information associated with the phonemes is performed. The sound information is amended, based on the prosody record as amended in response to the context influenced prosody changes and the Lessac derived prosody changes to generate amended sound information for each of the phonemes. Then the sound information is outputted to generate a speech signal.

Citations

20 Claims

1. A method for converting text to speech using a computing device having memory, comprising:
- (a) receiving text into said memory of said computing device;
  
  (b) applying a set of the lexical parsing rules to parse said text into a plurality of components;
  
  (c) associating pronunciation, and meaning information with said components;
  
  (d) applying a set of phrase parsing rules to generate marked up text;
  
  (e) phonetically parsing said marked up text using phonetic parsing rules;
  
  (f) parsing said marked up text using Lessac expressive parsing rules; and
  
  (g) storing a plurality of sounds in memory, each of said sounds being associated with said pronunciation information; and
  
  (h) recalling the sounds associated with said text to generate a raw speech signal from said marked up text after said parsing using phonetic and expressive parsing rules.
- View Dependent Claims (2)
- - 2. A method as in claim 1, for the comprising:
    - (h) filtering said raw speech signal to generate an output speech signal.

3. A method for converting text to speech using a computing device having a memory, comprising:
- (a) receiving a text comprising a plurality of words into said memory of said computing device;
  
  (b) deriving a plurality of phonemes from said text;
  
  (c) associating with each of said phonemes a prosody record based on a database of prosody records associated with a plurality of words;
  
  (d) applying a first set of the artificial intelligence rules to determine context information associated with said text;
  
  (e) for each of said phonemes;
  
  (i) determining context influenced prosody changes;
  
  (ii) applying a second set of rules based on Lessac theory to determine Lessac derived prosody changes;
  
  (iii) amending the prosody record in response to said context influenced prosody changes and said Lessac derived prosody changes;
  
  (iv) reading from said memory sound information associated with said phonemes;
  
  (v) amending said sound information based on the prosody record as amended in response to said context influenced prosody changes and said Lessac derived prosody changes to generate amended sound information; and
  
  (f) outputting said sound information to generate a speech signal.
- View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
- - 4. A method for converting text to speech as in claim 3, wherein the prosody of said speech signal is varied whereby increased realism is achieved in said speech signal.
  - 5. A method for converting text to speech as in claim 3, wherein the prosody of said speech signal is varied in a manner which is random or which appears to be random, whereby increased realism is achieved in said speech signal.
  - 6. A method for converting text to speech as in claim 3, wherein said sound information is associated with different speakers, and a set of artificial intelligence rules are used to determine the identity of the speaker associated with the sound information that is to be output.
  - 7. A method of converting text to speech as in claim 3, wherein said amending of the prosody record in response to said context influenced prosody changes is based on the words in said text and their sequence.
  - 8. A method of converting text to speech as in claim 3, wherein said amending of the prosody record in response to said context influenced prosody changes is based on the emotional context of words in said text.
  - 9. A method for converting text to speech as in claim 8, wherein the prosody of said speech signal is varied whereby increased realism is achieved in said speech signal.
  - 10. A method for converting text to speech as in claim 9, wherein the prosody of said speech signal is varied in a manner which is random or which appears to be random, whereby increased realism is achieved in said speech signal.
  - 11. A method for converting text to speech as in claim 10, wherein said sound information is associated with different speakers, and a set of artificial intelligence rules are used to determine the identity of the speaker associated with the sound information that is to be output.
  - 12. A method of converting text to speech as in claim 11, wherein said amending of the prosody record in response to said context influenced prosody changes is based on the words in said text and their sequence.
  - 13. A method as in claim 12, further comprising filtering said speech signal to obtain a filtered amended sound information signal, said filtered amended sound information signal being output to generate a speech signal.
  - 14. A method as in claim 13, wherein said filtering of said amended sound information comprises introducing echo.
  - 15. A method as in claim 13, wherein said filtering of said speech signal comprises passing said amended sound information through an analog or digital resonant circuit wherein the resonance characteristics keyed to vowel information.
  - 16. A method as in claim 13, wherein said filtering of said speech signal comprises damping said amended sound information.
  - 17. A method as in claim 12, further comprising filtering said speech signal by introducing echo, passing said amended sound information through an analog or digital resonant circuit wherein the resonance characteristics keyed to vowel information, and damping said amended sound information.
  - 18. A method as in claim 3, further comprising filtering said speech signal by introducing echo, passing said amended sound information through an analog or digital resonant circuit wherein the resonance characteristics keyed to vowel information, and damping said amended sound information.
  - 19. A method as in claim 3, further comprising adding background sound logically consistent with the context of said text in response to artificial intelligence rules operating on said text and/or in response to a human input.

20. A method for converting text to speech using a computing device having a memory, comprising:
- (a) receiving a text comprising a plurality of words into said memory of said computing device;
  
  (b) deriving a plurality of phonemes from said text;
  
  (c) associating with each of said phonemes a prosody record based on a database of prosody records associated with a plurality of words;
  
  (d) applying a first set of the artificial intelligence rules to determine context information associated with said text;
  
  (e) determining prosody changes for each of said phonemes to generate determined prosody changes;
  
  (f) reading from said memory sound information associated with said phonemes;
  
  (g) amending said sound information based on the prosody record as amended in response to said determined prosody changes;
  
  (h) varying said determined prosody changes in said speech signal in a manner which is random or which appears to be random, whereby increased realism is achieved in output speech; and
  
  (i) outputting said sound information to generate a speech signal.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Lessac Technologies Incorporated
Original Assignee
Lessac Technologies Incorporated
Inventors
Krebs, Nancy, Wilson, H. Donald, Marple, Gary, Handal, Anthony H., Addison, Edwin R.

Granted Patent

US 6,847,931 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/260
CPC Class Codes

G10L 13/10 Prosody rules derived from ...

Text to speech

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Text to speech

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links