Search optimization system and method for continuous speech recognition

US 6,397,179 B2
Filed: 11/04/1998
Issued: 05/28/2002
Est. Priority Date: 12/24/1997
Status: Expired due to Fees

First Claim

Patent Images

1. A method for continuous speech recognition, comprising:

receiving an input signal derived from a spoken utterance of a set of words;

providing a search network indicative of a plurality of recognizable words, the search network comprising a plurality of interconnected words, a given interconnection being established at least in part based on a connected word grammar, providing semantic information;

detecting in the search network connected word grammars bounded by semantically null words;

generating a modified search network by performing a process comprising;

collapsing each list of semantically null words into a unique single-input single-output search network portion, and identifying stop nodes in the search network, the modified search network being indicative of a plurality of non-semantically-null words; and

processing the input signal at least in part based on the modified search network to derive a list of N-best salient words that potentially match at least one word of the spoken utterance.

View all claims

6 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for continuous speech recognition (CSR) is optimized to reduce processing time for connected word grammars bounded by semantically null words. The savings, which reduce processing time both during the forward and the backward passes of the search, as well as during rescoring, are achieved by performing only the minimal amount of computation required to produce an exact N-best list of semantically meaningful words (N-best list of salient words). This departs from the standard Spoken Language System modeling which any notion of meaning is handled by the Natural Language Understanding (NLU) component. By expanding the task of the recognizer component from a simple acoustic match to allow semantic information to be fed to the recognizer, significant processing time savings are achieved, and make it possible to run an increased number of speech recognition channels in parallel for improved performance, which may enhance users perception of value and quality of service.

109 Citations

9 Claims

1. A method for continuous speech recognition, comprising:
- receiving an input signal derived from a spoken utterance of a set of words;
  
  providing a search network indicative of a plurality of recognizable words, the search network comprising a plurality of interconnected words, a given interconnection being established at least in part based on a connected word grammar, providing semantic information;
  
  detecting in the search network connected word grammars bounded by semantically null words;
  
  generating a modified search network by performing a process comprising;
  
  collapsing each list of semantically null words into a unique single-input single-output search network portion, and identifying stop nodes in the search network, the modified search network being indicative of a plurality of non-semantically-null words; and
  
  processing the input signal at least in part based on the modified search network to derive a list of N-best salient words that potentially match at least one word of the spoken utterance.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1, wherein the stop nodes comprise forward stop nodes and backward stop nodes.
  - 3. The method of claim 2, wherein the step of processing the input signal comprises a search characterized by a forward pass and a backward pass, wherein the method further comprises the steps of:
4. The method of claim 3, wherein scoring comprises Viterbi scoring.

5. Software on a machine readable medium for performing a method for continuous speech recognition, the method comprising the steps of:
- providing an input signal derived from a spoken utterance of a set of words;
  
  providing a search network indicative of a plurality of recognizable words, the search network comprising a plurality of interconnected words, a given interconnection being established at least in part based on a connected word grammar, providing semantic information;
  
  detecting in the search network connected word grammars bounded by semantically null words;
  
  generating a modified search network by performing a process comprising;
  
  collapsing each list of semantically null words into a unique single-input single-output search network portion, and identifying stop nodes in the search network, the modified search network being indicative of a plurality of non-semantically-null words; and
  
  processing the input signal at least in part based on the modified search network to derive a list of N-best salient words that potentially match at least one word of the spoken utterance.

6. A system for continuous speech recognition, comprising:
- an input for receiving an input signal derived from a spoken utterance of a set of words;
  
  means for providing a search network indicative of a plurality of recognizable words, the search network comprising a plurality of interconnected words, a given interconnection being established at least in part based on a connected word grammar, means for providing, semantic information;
  
  means for detecting in the search network connected word grammars bounded by semantically null words;
  
  means for generating a modified search network by performing a process comprising;
  
  collapsing each list of semantically null words into a unique single-input single-output search network portion, and identifying stop nodes in the search network, the modified search network being indicative of a plurality of non-semantically-null words; and
  
  means for processing the input signal at least in pair based on the modified search network to derive a list of N-best salient words that potentially match at least one word of the spoken utterance.
- View Dependent Claims (7, 8, 9)
- - 7. The system of claim 6, wherein the stop nodes include forward stop nodes and backward stop nodes.
  - 8. The system of claim 7, wherein the means for processing the input signal is operative for implementing a search characterized by a forward pass and a backward pass;
9. The system of claim 8, wherein scoring comprises Viterbi scoring.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Popkin Family Assets LLC (Intellectual Ventures LLC)
Original Assignee
Nortel Networks Limited (Nortel Networks Corporation)
Inventors
Stubley, Peter R., Robillard, Serge, Crespo, Jean-Francois
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
Nolan, Daniel A.

Application Number

US09/185,529
Publication Number

US 20010041978A1
Time in Patent Office

1,301 Days
Field of Search

704/231,239,240,243,251,252,242,257
US Class Current

704/242
CPC Class Codes

G10L 15/1815 Semantic context, e.g. disa...

G10L 2015/085 Methods for reducing search...

Search optimization system and method for continuous speech recognition

First Claim

6 Assignments

0 Petitions

Accused Products

Abstract

109 Citations

9 Claims

Specification

Solutions

Use Cases

Quick Links

Search optimization system and method for continuous speech recognition

First Claim

6 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

109 Citations

9 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links