System and method for distributed text-to-speech synthesis and intelligibility

US 9,761,219 B2
Filed: 04/21/2009
Issued: 09/12/2017
Est. Priority Date: 04/21/2009
Status: Active Grant

First Claim

Patent Images

1. A system for distributed text-to-speech synthesis comprising:

a guest device configured for transmitting text input in the form of a text string;

a host device configured to receive the text string and process the text string by converting the text string to an audio index representation of an audio file associated with the text string, the host device comprising;

a text analyzer configurable to process the text string to produce phonetic information and linguistic information;

a prosody analyzer configurable to generate prosodic information based on at least the phonetic information and linguistic information,wherein the converting at the host device being based on at least the phonetic information and prosodic information, and includes identifying audio units from a first audio unit synthesis inventory on the host device,wherein the guest device comprises;

a second audio unit synthesis inventory where audio units are selected from and selection of audio units from the second audio unit synthesis inventory being based on the audio index representation sent from the host device; and

a unit-concatenative module for concatenating the selected audio units.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and system for distributed text-to-speech synthesis and intelligibility, and more particularly to distributed text-to-speech synthesis on handheld portable computing devices that can be used for example to generate intelligible audio prompts that help a user interact with a user interface of the handheld portable computing device. The text-to-speech distributed system 70 receives a text string from the guest devices and comprises a text analyzer 72, a prosody analyzer 74, a database 14 that the text analyzer and prosody analyzer refer to, and a speech synthesizer 80. Elements of the speech synthesizer 80 are resident on the host device and the guest device and an audio index representation of the audio file associated with the text string is produced at the host device and transmitted to the guest device for producing the audio file at the guest device.

Citations

3 Claims

1. A system for distributed text-to-speech synthesis comprising:
- a guest device configured for transmitting text input in the form of a text string;
  
  a host device configured to receive the text string and process the text string by converting the text string to an audio index representation of an audio file associated with the text string, the host device comprising;
  
  a text analyzer configurable to process the text string to produce phonetic information and linguistic information;
  
  a prosody analyzer configurable to generate prosodic information based on at least the phonetic information and linguistic information,wherein the converting at the host device being based on at least the phonetic information and prosodic information, and includes identifying audio units from a first audio unit synthesis inventory on the host device,wherein the guest device comprises;
  
  a second audio unit synthesis inventory where audio units are selected from and selection of audio units from the second audio unit synthesis inventory being based on the audio index representation sent from the host device; and
  
  a unit-concatenative module for concatenating the selected audio units.
- View Dependent Claims (2, 3)
- - 2. The system as recited in claim 1 wherein the host device and the guest device are in communication with each other, the host device adapted to receive a text input in a form of text string from either the guest device or any other source;
    - the host device having a unit-selection module configured to create an audio index representation of an audio file from the text string on the host device and to convert the text string to an audio index representation of an audio file associated with the text string at a text-to-speech synthesizer, the unit-selection module being arranged to identify audio units from the first audio unit synthesis inventory, the identified audio units forming the audio file, the identified audio units being represented by the audio index representation.
  - 3. The system of claim 1 wherein the guest device is a portable handheld device.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Creative Technology Ltd.
Original Assignee
Creative Technology Ltd.
Inventors
Xu, Jun, Lee, Teck Chee
Primary Examiner(s)
Sharma, Neeraj

Application Number

US12/427,526
Publication Number

US 20100268539A1
Time in Patent Office

3,066 Days
Field of Search

704260, 704267, 704233, 704251, 704 3, 704 9, 704258, 704257, 704277, 704263, 715201, 382114, 345698, 600300
US Class Current
CPC Class Codes

G10L 13/04   Details of speech synthesis...

G10L 13/07   Concatenation rules

G10L 13/08   Text analysis or generation...

System and method for distributed text-to-speech synthesis and intelligibility

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

3 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for distributed text-to-speech synthesis and intelligibility

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

3 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links