Methods for controlling the generation of speech from text representing names and addresses

US 5,732,395 A
Filed: 01/29/1997
Issued: 03/24/1998
Est. Priority Date: 03/19/1993
Status: Expired due to Term

First Claim

Patent Images

1. A method of synthesizing speech from a series of characters representing an address, the series of characters including a plurality of address components including a street address component, each address component including at least one word where a word is any mixture of printable nonblank characters, the method comprising the steps of:

analyzing a first word of a street address component to determine if the first word includes only digits;

if it is determined that the first word includes only digits analyzing a second word in the street address component to determine if the second word includes only alphabetic characters, or is a digit string followed by at least one letter;

if it is determined that the first word includes only digits, and the second word includes only alphabetic characters, inserting, between the first and second words, a prosodic boundary including a pause having a first duration;

if it is determined that the first word includes only digits, and the second word includes digits followed by at least one letter, inserting, between the first and second words, a prosodic boundary including a pause having a second duration that is longer than the first duration; and

generating speech from the series of characters representing the address and any inserted prosodic boundaries.

View all claims

6 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Improved automated synthesis of human audible speech from text is disclosed. Performance enhancement of the underlying text comprehensibility is obtained through prosodic treatment of the synthesized material, improved speaking rate treatment, and improved methods of spelling words or terms for the system user. Prosodic shaping of text sequences appropriate for the discourse in large groupings of text segments, with prosodic boundaries developed to indicate conceptual units within the text groupings, is implemented in a preferred embodiment.

277 Citations

12 Claims

1. A method of synthesizing speech from a series of characters representing an address, the series of characters including a plurality of address components including a street address component, each address component including at least one word where a word is any mixture of printable nonblank characters, the method comprising the steps of:
- analyzing a first word of a street address component to determine if the first word includes only digits;
  
  if it is determined that the first word includes only digits analyzing a second word in the street address component to determine if the second word includes only alphabetic characters, or is a digit string followed by at least one letter;
  
  if it is determined that the first word includes only digits, and the second word includes only alphabetic characters, inserting, between the first and second words, a prosodic boundary including a pause having a first duration;
  
  if it is determined that the first word includes only digits, and the second word includes digits followed by at least one letter, inserting, between the first and second words, a prosodic boundary including a pause having a second duration that is longer than the first duration; and
  
  generating speech from the series of characters representing the address and any inserted prosodic boundaries.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The method of claim 1, wherein the plurality of address components further includes a post office box component which includes a number, the method further comprising the step of:
    - generating speech from the post office box component by performing the steps of;
      
      i. synthesizing audible speech corresponding to the phrase "post office box" with the most stress within said phrase being assigned to the word post, the least stress within the phrase being assigned to the word office and an intermediate amount of stress to the word box, andii. synthesizing audible corresponding to the number included in the post office box component.
  - 3. The method of claim 2, further comprising the step of;
    - deaccenting the word street if it is included as the last word of the street address component.
  - 4. The method of claim 3, wherein the series of characters further includes characters representing a name, the method further comprising the step of:
    - inserting a pause between the characters representing the name and the characters representing the address.
  - 5. The method of claim 4, further comprising the step of:
    - determining the duration of the pause inserted between the characters representing the name and the characters representing the address as a function of the complexity of the represented name.
  - 6. The method of claim 4, wherein the characters representing the name include multiple words, the method further comprising the step of:
    - determining the duration of the pause inserted between the characters representing the name and the characters representing the address as a function of the number of words included in the characters representing the name.
  - 7. The method of claim 6, wherein the duration of the pause inserted between the characters representing the name and the characters representing the address, is longer for names including multiple words than it is for names including fewer words.
  - 8. The method of claim 7, wherein the plurality of address components further includes a zip code component, the method further comprising the step of:
    - treating the zip code component as a single word declarative sentence when generating speech therefrom.
  - 9. The method of claim 2, wherein the plurality of address components further includes a zip code component, the method further comprising the step of:
    - treating the zip code component as a single word declarative sentence when generating speech therefrom.

10. A method of synthesizing speech from a first series of characters representing a name and a second series of characters representing an address, the series of characters representing the name and the series of characters representing the address each including at least one word where a word is any mixture of alphanumeric nonblank characters, the method comprising the steps of:
- determining, as a function of the complexity of the represented name, the length of a pause to be inserted between the series of characters representing the name and the series of characters representing the address;
  
  inserting a pause of the determined length between the series of characters representing the name and the series of characters representing the address; and
  
  generating speech from the series of characters representing the name and the address as a function of the inserted pause.
- View Dependent Claims (11, 12)
- - 11. The method of claim 10,wherein the series of characters representing the name includes a plurality of words;
    - andwherein the complexity of the represented name is a function of the number of words included in the series of characters representing the name.
  - 12. The method of claim 11,wherein the duration of the inserted pause is determined to be longer for names represented using several words as compared to names represented using fewer words.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google LLC (Alphabet Inc.)
Original Assignee
Nynex Science & Technology
Inventors
Alexander Silverman, Kim Ernest
Primary Examiner(s)
Hafiz, Tariq R.

Application Number

US08/790,581
Time in Patent Office

419 Days
Field of Search

395/2.76, 395/2.69, 395/2.77, 395/2.65, 395/2.4, 395/2.67, 704/267, 704/260, 704/268, 704/256, 704/231, 704/258
US Class Current

704/260
CPC Class Codes

G10L 13/04   Details of speech synthesis...

G10L 13/08   Text analysis or generation...

G10L 13/10   Prosody rules derived from ...

Methods for controlling the generation of speech from text representing names and addresses

First Claim

6 Assignments

0 Petitions

Accused Products

Abstract

277 Citations

12 Claims

Specification

Solutions

Use Cases

Quick Links

Methods for controlling the generation of speech from text representing names and addresses

First Claim

6 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

277 Citations

12 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links