Speech recognition apparatus and methods

US 5,228,087 A
Filed: 07/07/1992
Issued: 07/13/1993
Est. Priority Date: 04/12/1989
Status: Expired due to Term

First Claim

Patent Images

1. A method of word recognition in continuous speech comprising the steps of:

deriving a speech signal;

initially performing a first analysis of the speech signal by a Markov or other technique not involving neural net techniques to identify boundaries between different words and to separate the entire speech signal into discrete words;

providing a first signal in accordance with the first analysis;

comparing the first signal from the first analysis with a stored vocabulary of a multiplicity of words to provide a second signal that is a first indication of the words spoken;

supplying the entire first signal provided by the first analysis to means for performing a second analysis different from the first analysis and utilizing neural net techniques on the entire words without any prior restriction of word candidates by the first analysis to produce a third signal representative of the words spoken; and

providing an output signal representative of the words spoken from at least the third signal produced by the second analysis.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Speech recognition is carried out by performing a first analysis of a speech signal using a Hidden Semi Markov Model and an asymmetric time warping algorithm. A second analysis is also performed using Multi-Layer Perceptron techniques in conjunction with a neural net. The first analysis is used by the second to identify word boundaries. Where the first analysis provides an indication of the word spoken above a certain level of confidence, an output representative of the word spoken may be generated solely in response to the first analysis, the second analysis being utilized when the level of confidence falls. The output controls a function of an aircraft and provides feedback to the speaker of the words spoken.

Citations

9 Claims

1. A method of word recognition in continuous speech comprising the steps of:
- deriving a speech signal;
  
  initially performing a first analysis of the speech signal by a Markov or other technique not involving neural net techniques to identify boundaries between different words and to separate the entire speech signal into discrete words;
  
  providing a first signal in accordance with the first analysis;
  
  comparing the first signal from the first analysis with a stored vocabulary of a multiplicity of words to provide a second signal that is a first indication of the words spoken;
  
  supplying the entire first signal provided by the first analysis to means for performing a second analysis different from the first analysis and utilizing neural net techniques on the entire words without any prior restriction of word candidates by the first analysis to produce a third signal representative of the words spoken; and
  
  providing an output signal representative of the words spoken from at least the third signal produced by the second analysis.
- View Dependent Claims (2, 4, 5, 6)
- - 2. A method according to claim 1, wherein the vocabulary contains dynamic time warping templates.
  - 4. A method according to claim 1, wherein the first analysis is performed utilizing a plurality of different algorithms, wherein each algorithm provides a signal indicative of the word in the vocabulary store closest to the speech signal together with an indication of the confidence that the indicated word is the word spoken, and wherein a comparison is made between the signals provided by the different algorithms.
  - 5. A method according to claim 1, wherein the said first indication of the words spoken is provided with a measure of confidence, and wherein the said output signal is provided solely in response to said first indication when the measure of confidence is greater than a predetermined value.
  - 6. A method according to claim 1, wherein the second analysis is performed using a multi-layer perceptron technique in conjunction with a neural net.

3. A method according to claim 3, wherein the first analysis is performed using an asymmetric dynamic time warping algorithm.

7. Speech recognition apparatus for recognizing words in continuous speech comprising:
- store means containing speech information about a vocabulary of words that can be recognized;
  
  means for deriving a speech signal;
  
  first analysis means for performing a first analysis of the entire speech signal by a Markov or other technique not involving neural net techniques, said first analysis identifying boundaries between all the different words in said continuous speech and providing a first signal in accordance therewith;
  
  means for comparing the first signal provided by the first analysis with the stored vocabulary to provide a second signal that is a first indication of the words spoken;
  
  second analysis means operative subsequent to the performance of said first analysis for performing a second analysis of the speech signal;
  
  means for supplying the entire first signal provided by said first analysis means to said second analysis means, said second analysis means utilizing neural net techniques and word boundary identification from said first analysis on the entire words without any prior restriction of word candidates by the first analysis;
  
  means for providing from the second analysis a second indication of the words spoken; and
  
  means for providing an output signal representative of the words spoken in response to at least the second indication.
- View Dependent Claims (8, 9)
- - 8. Apparatus according to claim 7, wherein the apparatus includes a noise marking unit that performs a noise marking algorithm on the speech signals.
  - 9. Apparatus according to claim 7, wherein the apparatus includes a syntax unit that performs syntax restriction on the stored vocabulary in accordance with the syntax of previously identified words.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
GE Aviation UK (GE Aerospace)
Original Assignee
Smiths Industries Public Limited Company
Inventors
Bickerton, Ian
Primary Examiner(s)
KEMENY, EMANUEL

Application Number

US07/908,806
Time in Patent Office

371 Days
Field of Search

381/41-43, 395/2
US Class Current

704/232
CPC Class Codes

G10L 15/12   using dynamic programming t...

G10L 15/142   Hidden Markov Models [HMMs]

G10L 15/16   using artificial neural net...

Speech recognition apparatus and methods

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

9 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition apparatus and methods

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

9 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links