Context dependent phoneme networks for encoding speech information

US 6,182,038 B1
Filed: 12/01/1997
Issued: 01/30/2001
Est. Priority Date: 12/01/1997
Status: Expired due to Term

First Claim

Patent Images

1. A method for encoding speech information comprising:

generating at a local user location, as an intermediate step in speech recognition, a context dependent phoneme network from speech in a phoneme network generator using an acoustic model that adapts to a user'"'"'s voice, wherein the context dependent phoneme network is a representation of speech input in the form of nodes and arcs, each arc representing a score of a phoneme with start and end times represented by nodes, the phoneme network enabling the speech input to be represented by the nodes and arcs thereby resulting in the speech input being packaged into an intermediate format that is independent of vocabulary, language model, user and physical environment; and

transmitting the context dependent phoneme network to one or more application programs located remotely from the local user, to enable the remote application programs to effect recognition of speech in each application program using a vocabulary and language model selected by the application program, thereby obviating the need for the local user location to perform recognition of speech tasks.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and apparatus for generating a context dependent phoneme network as an intermediate step of encoding speech information. The context dependent phoneme network is generated from speech in a phoneme network generator (48) associated with an operating system (44). The context dependent phoneme network is then transmitted to a first application (52).

141 Citations

5 Claims

1. A method for encoding speech information comprising:
- generating at a local user location, as an intermediate step in speech recognition, a context dependent phoneme network from speech in a phoneme network generator using an acoustic model that adapts to a user'"'"'s voice, wherein the context dependent phoneme network is a representation of speech input in the form of nodes and arcs, each arc representing a score of a phoneme with start and end times represented by nodes, the phoneme network enabling the speech input to be represented by the nodes and arcs thereby resulting in the speech input being packaged into an intermediate format that is independent of vocabulary, language model, user and physical environment; and
  
  transmitting the context dependent phoneme network to one or more application programs located remotely from the local user, to enable the remote application programs to effect recognition of speech in each application program using a vocabulary and language model selected by the application program, thereby obviating the need for the local user location to perform recognition of speech tasks.
- View Dependent Claims (2)
- - 2. The method according to claim 1 further comprising extracting, at a first application, information needed from the context dependent phoneme network using vocabulary and language models of the first application in order to operate the first application.

3. A data storage medium comprising instructions and data which, when loaded into a first general purpose microprocessor having an operating system cause the first general purpose microprocessor to comprise:
- a phoneme network generator located at a local user location generating a context dependent phoneme network having an output defining the context dependent phoneme network, wherein the context dependent phoneme network enables the speech input to be represented in the form of nodes and arcs, where each arc represents a score of a phoneme with start and end times represented by nodes, thereby resulting in the speech input being packaged in an intermediate format; and
  
  a plurality of application programs located remotely from the local user location adapted to receive the output of the phoneme network generator and extract information needed from the output using vocabulary and language models of the plurality of application programs thereby eliminating information from being extracted at the local user location, the phoneme network generator and the plurality of application programs being independently associated with the operating system.
- View Dependent Claims (4)
- - 4. The data storage medium according to claim 3 wherein the data storage medium comprises a first part having stored thereon the phoneme network generator and a second part having stored thereon the plurality of applications.

5. A method for encoding speech information comprising:
- generating at a local user location a context dependent phoneme network from speech in a phoneme network generator associated with an operating system, wherein the context dependent phoneme network is a representation of speech input in the form of nodes and arcs, where each arc represents a score of a phoneme with start and end times represented by nodes, thereby packaging the speech input in an intermediate format;
  
  transmitting the context dependent phoneme network to a plurality of applications located remotely from the local user location via the operating system; and
  
  extracting, at the remotely located plurality of applications, information needed from the context dependent phoneme network using vocabulary and language models of the plurality of applications in order to operate the plurality of applications.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google Technology Holdings LLC (Alphabet Inc.)
Original Assignee
Motorola, Inc. (Motorola Solutions, Inc.)
Inventors
Austin, Stephen, Balakrishnan, Sreeram
Primary Examiner(s)
Hudspeth, David R..
Assistant Examiner(s)
Lerner, Martin

Application Number

US08/980,954
Time in Patent Office

1,156 Days
Field of Search

704/231, 704/232, 704/233, 704/244, 704/246, 704/251, 704/254, 704/255, 704/256, 704/257, 704/270, 704/275, 704/250, 379/88.01, 379/88.04
US Class Current

704/250
CPC Class Codes

G06F 16/24   Querying

G10L 15/187   Phonemic context, e.g. pron...

G10L 15/30   Distributed recognition, e....

Context dependent phoneme networks for encoding speech information

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

141 Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Context dependent phoneme networks for encoding speech information

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

141 Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links