Network and language models for use in a speech recognition system

US 6,668,243 B1
Filed: 08/02/2001
Issued: 12/23/2003
Est. Priority Date: 11/25/1998
Status: Expired due to Fees

First Claim

Patent Images

1. A language model structure for use in a speech recognition system employing a tree-structured network model, the language model comprising identifiers with associated language model probabilities, the language model being structured such that identifiers associated with each word and contained therein are arranged such that each node of the network model with which the language model is associated spans a continuous range of identifiers and associated language model probabilities in the language model structure.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A language model structure for use in a speech recognition system employs a tree-structured network model. The language model is structured such that identifiers associated with each word and contained therein are arranged such that each node of the network model with which the language model is associated spans a continuous range of identifiers. A method of transferring tokens through a tree-structured network in a speech recognition process is also provided.

46 Citations

View as Search Results

11 Claims

1. A language model structure for use in a speech recognition system employing a tree-structured network model, the language model comprising identifiers with associated language model probabilities, the language model being structured such that identifiers associated with each word and contained therein are arranged such that each node of the network model with which the language model is associated spans a continuous range of identifiers and associated language model probabilities in the language model structure.
- View Dependent Claims (2)
- - 2. A speech recognition system including a language model according to claim 1.

3. A tree-structured network for use in a speech recognition system, the tree-structured network comprising:
- a first tree-structured section representing the first phone of each word having two or more phones;
  
  a second tree-structured section representing within word phones, wherein within word phones includes any phone between the first phone and the last phone of a word;
  
  a third tree-structured section representing the last or only phone of each word;
  
  a fourth tree-structured section representing inter-word silences; and
  
  , a number of null nodes for joining each tree-structured section to the following tree-structured section.
- View Dependent Claims (4)
- - 4. A speech recognition system including a tree-structured network according to claim 3.

5. A method of transferring tokens through a tree-structured network in a speech recognition process, each token including a likelihood which indicates the probability of a respective path through the network representing a respective word to be recognised, and wherein each token further includes a history of previously recognised words, the method comprising the steps of:
- i) combining tokens at each state of the network to form a set of tokens, the set including a main token having the highest likelihood and one or more relative tokens;
  
  ii) converting the likelihood of each relative token into a relative likelihood that is set relative to the likelihood of the main token;
  
  iii) for each set of tokens, merging tokens having the same history;
  
  iv) transferring the set of tokens to subsequent nodes in the network;
  
  v) updating the likelihood of at least the main token of each set of tokens; and
  
  vi) repeating steps i) to v) at each respective node.
- View Dependent Claims (6)
- - 6. The method according to claim 5, wherein the step of merging tokens comprises:

7. A speech recognition system having a network of nodes comprising:
- a set of first-phone nodes representing the first phones of words;
  
  a tree-structured section representing within word phones, wherein within word phones includes any phone between the first phone and the last phone of a word; and
  
  a number of null nodes for joining the set of first-phone nodes to the tree-structured section.
- View Dependent Claims (8, 9, 10, 11)
- - 8. The speech recognition system of claim 7 wherein the network of nodes further comprises:
9. The speech recognition system of claim 8 wherein the network of nodes further comprises:
- a set of inter-word silence nodes; and
  
  a set of null nodes for connecting nodes in the set of last-phone nodes to the inter-word silence nodes.
10. The speech recognition system of claim 9 wherein the network of nodes further comprises:
- a set of null nodes for connecting the inter-word silence nodes to the nodes in the set of first-phone nodes.
11. The speech recognition system of claim 7 wherein each first-phone node represents a tri-phone.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Odell, Julian J.
Primary Examiner(s)
ABEBE, DANIEL DEMELASH

Application Number

US09/856,802
Time in Patent Office

873 Days
Field of Search

704/231, 704/239, 704/240, 704/232, 704/242, 704/243, 704/244, 704/245, 704/251, 704/255, 704/256, 704/259
US Class Current

704/243
CPC Class Codes

G10L 15/08   Speech classification or se...

G10L 15/187   Phonemic context, e.g. pron...

G10L 15/197   Probabilistic grammars, e.g...

Network and language models for use in a speech recognition system

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

46 Citations

11 Claims

Specification

Solutions

Use Cases

Quick Links

Network and language models for use in a speech recognition system

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

46 Citations

11 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links