Noise compensation in speech recognition apparatus

US 4,897,878 A
Filed: 08/26/1985
Issued: 01/30/1990
Est. Priority Date: 08/26/1985
Status: Expired due to Term

First Claim

Patent Images

1. A method of compensating for noisy input speech in order to improve the recognition result of a speech recognition apparatus having an input for unknown speech, converting means for converting the unknown speech into time-sampled frames of speech signals representing its spectral distribution over a given range of frequencies, storing means for storing templates of known speech in the form of speech signals representing its spectral distribution over the given range of frequencies, computing means for computing the minimum mean square error of the Euclidean squared distance between the speech signals of the unknown speech compared with the speech signals of the template speech, and recognizer means for producing a recognition result based upon the minimum mean square error computed by the computing means,wherein said method of compensating for noisy input speech comprises the following steps for producing an improved minimum mean square error estimate conditioned by compensatory characteristics of the noisy input speech:

(a) computing optimal estimated distance values over the given range of frequencies for noise-free template speech, based upon comparing known speech segments, which are input in a noise-free environment and converted into corresponding templates of known speech signals t_s, with unknown speech segments, which are input in a noise-free environment and converted to unknown speech signals u_s ;

(b) computing estimated variance values corresponding to the optimal estimated distance values for a sample population of noise-free speech segments;

(c) storing said optimal estimated distance values and variance values on a look-up table associated with the template speech;

(d) computing squared distance values over the given range of frequencies for input noisy unknown speech signals u_s+n compared with signals t_s+n representing template speech to which a spectral representation of noise n in the actual input environment is added;

(e) replacing the computed squared distance values for the unknown speech signals with conditional expected distance values calculated using the optimal estimated distance values and variance values obtained from the look-up table, in order to derive noise-immune metric values for the unknown speech signals; and

(f) computing the minimum mean square error of the noise-immune metric values for the unknown speech signals compared with the noise-free template speech signals, whereby an improved recognition result is obtained.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and apparatus for noise suppression for speech recognition systems which employs the principle of a least means square estimation which is implemented with conditional expected values. Essentially, according to this method, one computes a series of optimal estimators which estimators and their variances are then employed to implement a noise immune metric. This noise immune metric enables the system to substitute a noisy distance with an expected value which value is calculated according to combined speech and noise data which occurs in the bandpass filter domain. Thus the system can be used with any set of speech parameters and is relatively independent of a specific speech recognition apparatus structure.

Citations

8 Claims

1. A method of compensating for noisy input speech in order to improve the recognition result of a speech recognition apparatus having an input for unknown speech, converting means for converting the unknown speech into time-sampled frames of speech signals representing its spectral distribution over a given range of frequencies, storing means for storing templates of known speech in the form of speech signals representing its spectral distribution over the given range of frequencies, computing means for computing the minimum mean square error of the Euclidean squared distance between the speech signals of the unknown speech compared with the speech signals of the template speech, and recognizer means for producing a recognition result based upon the minimum mean square error computed by the computing means,wherein said method of compensating for noisy input speech comprises the following steps for producing an improved minimum mean square error estimate conditioned by compensatory characteristics of the noisy input speech:
- (a) computing optimal estimated distance values over the given range of frequencies for noise-free template speech, based upon comparing known speech segments, which are input in a noise-free environment and converted into corresponding templates of known speech signals t_s, with unknown speech segments, which are input in a noise-free environment and converted to unknown speech signals u_s ;
  
  (b) computing estimated variance values corresponding to the optimal estimated distance values for a sample population of noise-free speech segments;
  
  (c) storing said optimal estimated distance values and variance values on a look-up table associated with the template speech;
  
  (d) computing squared distance values over the given range of frequencies for input noisy unknown speech signals u_s+n compared with signals t_s+n representing template speech to which a spectral representation of noise n in the actual input environment is added;
  
  (e) replacing the computed squared distance values for the unknown speech signals with conditional expected distance values calculated using the optimal estimated distance values and variance values obtained from the look-up table, in order to derive noise-immune metric values for the unknown speech signals; and
  
  (f) computing the minimum mean square error of the noise-immune metric values for the unknown speech signals compared with the noise-free template speech signals, whereby an improved recognition result is obtained.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method according to claim 1, wherein said values are provided at specific frequencies within the speech band.
  - 3. The method according to claim 2, wherein said frequencies employed are at 300, 425, 1063, 2129 and 3230 Hz.
  - 4. The method according to claim 3, wherein said values are provided at selected average signal-to-noise ratios.
  - 5. The method according to claim 4, wherein said average signal-to-noise ratios are 0 db, 10 db, and 20 db.
  - 6. The method according to claim 2, wherein said values stored are indicative of said first value at different frequencies within said speech bandwidth.
  - 7. The method according to claim 4, wherein said values stored are indicative of said first value at different signal-to-noise ratios.
  - 8. The method according to claim 1, wherein said values are replaced by mean values to provide a new expected distance equal to:
    - space="preserve" listing-type="equation">d.sup.2 =(t-u).sup.2 +σ
      
      .sub.i.sup.2 +σ
      
      .sub.u.sup.2
      where t &
      
      u are the expected values of the template and unknown and σ
      
      _t &
      
      σ
      
      _u are the variances of the estimates.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
ITT Corporation (ITT, Inc.)
Original Assignee
ITT Corporation (ITT, Inc.)
Inventors
Porter, Jack E., Boll, Steven F.
Primary Examiner(s)
Clark, David L.
Assistant Examiner(s)
Knepper, David D.

Application Number

US06/769,215
Time in Patent Office

1,618 Days
Field of Search

381/41-50, 364/513.5
US Class Current

704/233
CPC Class Codes

G10L 15/20 Speech recognition techniqu...

Noise compensation in speech recognition apparatus

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Noise compensation in speech recognition apparatus

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links