Large vocabulary connected speech recognition system and method of language representation using evolutional grammer to represent context free grammars

US 5,719,997 A
Filed: 08/20/1996
Issued: 02/17/1998
Est. Priority Date: 01/21/1994
Status: Expired due to Term

First Claim

Patent Images

1. A speech recognition system for recognizing one or more speech inputs, said recognition system having no initial grammar, the system comprising:

means for creating a grammar start state to initialize a grammar represented by a grammar network, which is comprised of arcs interconnecting nodes including a start node, predetermined initial word scores being assigned to the nodes, said predetermined initial word scores including a first predetermined initial word score assigned to the start node, and a second different predetermined initial word score assigned to at least one of the other nodes,means for dynamically creating word representations from the grammar start state for the speech inputs as the inputs are received, each of the word representations being represented by a respective one of the arcs in the grammar network,means for maintaining a score for each word representation created,means for propagating word scores meeting a threshold level through the grammar network,means for updating the word scores at each node other than the start node to maintain only active word representations having word scores above the threshold level,means for chaining word scores together which exceed the threshold level, andmeans for determining the chain of word scores which represents the speech input.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of recognizing speech input selectively creates and maintains grammar representations of the speech input in essentially real time. Speech input frames are received by a speech recognition system. Grammar representations are created for each speech frame and a probability score is derived for the representations indicating the probability of the accuracy of the representations to the speech input. Representations having a probability score below a predetermined threshold are not maintained. Those grammar representations having probability scores above the predetermined threshold are maintained. As more speech frames are received by the system, additional grammar representations are created and the probability scores are updated. When the entire speech input has been received, the chain of grammar representations having the highest probability score is identified as the speech input.

Citations

12 Claims

1. A speech recognition system for recognizing one or more speech inputs, said recognition system having no initial grammar, the system comprising:
- means for creating a grammar start state to initialize a grammar represented by a grammar network, which is comprised of arcs interconnecting nodes including a start node, predetermined initial word scores being assigned to the nodes, said predetermined initial word scores including a first predetermined initial word score assigned to the start node, and a second different predetermined initial word score assigned to at least one of the other nodes,means for dynamically creating word representations from the grammar start state for the speech inputs as the inputs are received, each of the word representations being represented by a respective one of the arcs in the grammar network,means for maintaining a score for each word representation created,means for propagating word scores meeting a threshold level through the grammar network,means for updating the word scores at each node other than the start node to maintain only active word representations having word scores above the threshold level,means for chaining word scores together which exceed the threshold level, andmeans for determining the chain of word scores which represents the speech input.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The speech recognition system of claim 1 wherein said word representations are Ephemeral Hidden Markov Models (EHMM).
  - 3. The speech recognition system of claim 2 wherein said EHMMs are dynamically created and destroyed as determined by the word score for a particular EHMM.
  - 4. The speech recognition system of claim 1 wherein said updating means periodically compares said updated scores to said threshold levels.
  - 5. The speech recognition system of claim 1 further comprising traceback means for tracing back through each string having word scores above the threshold level to determine the string having the highest score.
  - 6. The system of claim 1 wherein said second different initial predetermined word score is assigned to each one of the other nodes.

7. A method of recognizing speech input signals comprising the steps of:
- a) creating a grammar start state to initialize a grammar represented by a grammar network, which is comprised of arcs interconnecting nodes including a start node, predetermined initial word scores being assigned to the nodes, said predetermined initial word scores including a first predetermined initial word score assigned to the start node, and a second different predetermined initial word score assigned to at least one of the other nodes,b) dynamically creating word representations for said speech signals from the grammar start state, each of the word representations being represented by a respective one of the arcs in the grammar network,c) computing a word score for each word representation,d) comparing said word scores to a threshold level,e) chaining together those word representations having word scores above the threshold level to form phrase strings,f) determining which word scores are below the threshold value,g) destroying those word representations having word scores below the threshold level,h) updating the word score for each active word representation,i) computing, at a respective one of the nodes, phrase scores for each word string comprising word representations having word scores above the threshold level,j) repeating steps b)-i) until said entire speech signal has been inputted, andk) identifying the phrase string having the highest phrase score as the recognized speech input.
- View Dependent Claims (8, 9, 10, 11)
- - 8. The method according to claim 7, wherein said word representations are created from a context free grammar.
  - 9. The method according to claim 8 wherein said step of creating word representations further comprises the step of:
    - evolutionally creating finite state grammar representations from the context free grammar representations.
  - 10. The method according to claim 7 wherein said step of identifying the phrase string having the highest score further comprises the steps of:
    - tracing back though the string having the highest score from the end of the suing to a start position.
  - 11. The method of claim 7 wherein said second different initial predetermined word score is assigned to each one of the other nodes.

12. A method of recognizing speech input by a speech recognition system comprising the steps of:
- receiving a sequence of speech frames;
  
  creating a grammar representation for each speech frame, each grammar representation having a source node and an end node, the creating step including the step of assigning predetermined initial probability scores to respective source nodes of grammar representations of the speech frames, a predetermined initial probability score assigned to the source node of the grammar representation of the first frame in the sequence being different from that assigned to the source node of the grammar representation of at least another frame in the sequence;
  
  deriving a probability score for each grammar representation at the end node thereof indicating the probability of the accuracy of the representation to the speech input;
  
  selectively maintaining grammar representations having a probability score above a predetermined threshold; and
  
  chaining together representations having the highest probability score for each speech frame.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Lucent Technologies, Inc. (Nokia Corporation)
Original Assignee
Lucent Technologies, Inc. (Nokia Corporation)
Inventors
Brown, Michael Kenneth, Glinski, Stephen Charles
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
Dorvil, Richemond

Application Number

US08/697,152
Time in Patent Office

546 Days
Field of Search

395/2.65, 395/2.64, 395/2.66, 395/2.53, 395/2.54, 395/2.52, 395/2.6, 395/2.49, 395/2.61, 395/2.5, 364/419.01, 364/419.02, 364/419.08, 364/419.11
US Class Current

704/257
CPC Class Codes

G06F 18/29   Graphical models, e.g. Baye...

G06V 30/347   Sampling; Contour coding; S...

G06V 30/373   using a special pattern or ...

G10L 15/18   using natural language mode...

G10L 15/1815   Semantic context, e.g. disa...

G10L 15/193   Formal grammars, e.g. finit...

Large vocabulary connected speech recognition system and method of language representation using evolutional grammer to represent context free grammars

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

12 Claims

Specification

Solutions

Use Cases

Quick Links

Large vocabulary connected speech recognition system and method of language representation using evolutional grammer to represent context free grammars

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

12 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links