Middleware layer between speech related applications and engines

US 7,139,709 B2
Filed: 12/29/2000
Issued: 11/21/2006
Est. Priority Date: 07/20/2000
Status: Expired due to Term

First Claim

Patent Images

1. A multi-voice speech synthesis middleware layer of computer-readable instructions embedded on a computer-readable medium, the instructions being configured to, when executed, facilitate communication between one or more applications and a plurality of text-to-speech (TTS) engines, the multi-voice speech synthesis middleware layer comprising:

at least a first voice object having an application interface configured to receive TTS engine attribute information from the application and to instantiate first and second TTS engines based on the TTS attribute information, to receive a speak request requesting at least one of the TTS engines to speak a message, and to receive priority information associated with each speak request indicative of a precedence each speak request is to take;

wherein the first voice object has an engine interface configured to call a specified one of the first and second TTS engines to synthesize input data;

wherein the at least first voice object is configured to receive a normal priority associated with a message and to call the TTS engines so the message with normal priority is spoken in turn; and

wherein the at least first voice object is configured to receive a speakover priority associated with a message and to call the TTS engines so the message with speakover priority is spoken at a same time as other currently speaking messages.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The present invention provides an application-independent and engine-independent middleware layer between applications and engines. The middleware provides speech-related services to both applications and engines, thereby making it far easier for application vendors and engine vendors to bring their technology to consumers.

44 Citations

View as Search Results

3 Claims

1. A multi-voice speech synthesis middleware layer of computer-readable instructions embedded on a computer-readable medium, the instructions being configured to, when executed, facilitate communication between one or more applications and a plurality of text-to-speech (TTS) engines, the multi-voice speech synthesis middleware layer comprising:
- at least a first voice object having an application interface configured to receive TTS engine attribute information from the application and to instantiate first and second TTS engines based on the TTS attribute information, to receive a speak request requesting at least one of the TTS engines to speak a message, and to receive priority information associated with each speak request indicative of a precedence each speak request is to take;
  
  wherein the first voice object has an engine interface configured to call a specified one of the first and second TTS engines to synthesize input data;
  
  wherein the at least first voice object is configured to receive a normal priority associated with a message and to call the TTS engines so the message with normal priority is spoken in turn; and
  
  wherein the at least first voice object is configured to receive a speakover priority associated with a message and to call the TTS engines so the message with speakover priority is spoken at a same time as other currently speaking messages.
- View Dependent Claims (2)
- - 2. The multi-voice speech synthesis middleware layer of claim 1 wherein the at least first voice object is configured to receive an alert priority associated with a message and to call the TTS engines so the message with alert priority is spoken with precedence over messages with normal and speakover priority.

3. A method of formatting data for use by a speech engine and an audio device, comprisingobtaining, at a middleware layer which facilitates communication between the speech engine and an application, a data format for data used by the engine;
- obtaining, at the middleware layer, a data format of data used by the audio device;
  
  determining, at the middleware layer, whether the engine data format and the audio data format are consistent; and
  
  if not, utilizing the middleware layer to reconfigure the engine to change the data format of the data used by the engine.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Schmid, Philipp Heinz, Lipe, Ralph, Chambers, Robert, Connell, Edward
Primary Examiner(s)
Hudspeth, David R
Assistant Examiner(s)
Sked, Matthew J

Application Number

US09/751,836
Publication Number

US 20020069065A1
Time in Patent Office

2,153 Days
Field of Search

704/235, 704/251, 704/257, 704/260, 704/258
US Class Current

704/258
CPC Class Codes

G06F 9/4488   Object-oriented

G10L 13/04   Details of speech synthesis...

G10L 15/197   Probabilistic grammars, e.g...

G10L 15/26   Speech to text systems G10L...

G10L 15/28   Constructional details of s...

G10L 15/30   Distributed recognition, e....

H04L 65/1101   Session protocols

Middleware layer between speech related applications and engines

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

44 Citations

3 Claims

Specification

Solutions

Use Cases

Quick Links

Middleware layer between speech related applications and engines

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

44 Citations

3 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links