Frame comparison method for word recognition in high noise environments

US 4,918,732 A
Filed: 05/25/1989
Issued: 04/17/1990
Est. Priority Date: 01/06/1986
Status: Expired due to Fees

First Claim

Patent Images

1. A method for comparing stored speech recognition templates which are framed into time segments and channelized into at least two channels which are frequency band-limited to an input signal which has been contaminated by high levels of noise and which is framed into a time segment and channelized into at least two channels which are frequency band-limited, comprising the steps of:

determining, from the input signal, a first noise level associated with a first of the at least two channels and a second noise level associated with a second one of the at least two channels;

adding a buffering level to each of said first and second noise levels to create respective first and second buffered noise levels;

determining, from the input signal, a first signal level associated with a first of the at least two channels and a second signal level associated with a second of the at least two channels;

normalizing the level of each said first and second signal levels to create, respectively, normalized first and second signal levels;

normalizing a first channel stored speech recognition template and normalizing a second channel stored speech recognition template to create, respectively, first and second normalized template signal levels,subtracting said normalized first signal level from said normalized first template signal level to determine a first difference and subtracting said normalized second signal level from said normalized second template signal level to determine a second difference; and

generating a distance measure by at least adding together;

(a) the absolute value of said first difference if said first signal level is greater than said first buffered noise level, or said first difference if said first signal level is less than said first buffered noise level and said first difference is a positive value, or a predetermined nominal differential value if said first signal level is less than said first buffered noise level and said first difference is a negative value; and

(b) the absolute value of said second difference if said second signal level is greater than said second buffered noise level, or said second difference if said second signal level is less than said second buffered noise level and said second difference is a positive value, or a predetermined nominal differential value if said second signal level is less than said second buffered noise level and said second difference is a negative value.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and arrangement for a speech recognition system employs channel bank information to represent speech. The method considers background noise included with the speech. The method includes determining three energy levels for each channel the first representative of background noise energy, the second representative of the input frame energy and the third representative of the word template frame energy. Values representing energy level differentials are assigned at each channel. If the second energy level is less than the first energy level, then a predetermined constant value is assigned at the particular channel. These values are combined to generate a distance measure depicting the similarity between the two frames.

49 Citations

View as Search Results

6 Claims

1. A method for comparing stored speech recognition templates which are framed into time segments and channelized into at least two channels which are frequency band-limited to an input signal which has been contaminated by high levels of noise and which is framed into a time segment and channelized into at least two channels which are frequency band-limited, comprising the steps of:
- determining, from the input signal, a first noise level associated with a first of the at least two channels and a second noise level associated with a second one of the at least two channels;
  
  adding a buffering level to each of said first and second noise levels to create respective first and second buffered noise levels;
  
  determining, from the input signal, a first signal level associated with a first of the at least two channels and a second signal level associated with a second of the at least two channels;
  
  normalizing the level of each said first and second signal levels to create, respectively, normalized first and second signal levels;
  
  normalizing a first channel stored speech recognition template and normalizing a second channel stored speech recognition template to create, respectively, first and second normalized template signal levels,subtracting said normalized first signal level from said normalized first template signal level to determine a first difference and subtracting said normalized second signal level from said normalized second template signal level to determine a second difference; and
  
  generating a distance measure by at least adding together;
  
  (a) the absolute value of said first difference if said first signal level is greater than said first buffered noise level, or said first difference if said first signal level is less than said first buffered noise level and said first difference is a positive value, or a predetermined nominal differential value if said first signal level is less than said first buffered noise level and said first difference is a negative value; and
  
  (b) the absolute value of said second difference if said second signal level is greater than said second buffered noise level, or said second difference if said second signal level is less than said second buffered noise level and said second difference is a positive value, or a predetermined nominal differential value if said second signal level is less than said second buffered noise level and said second difference is a negative value.
- View Dependent Claims (2, 3)
- - 2. A method in accordance with the method of claim 1 wherein said step of generating a distance measure further comprises the steps of:
    - assigning a first binary value to a storage means associated with said first channel if said first signal level is greater than said first buffered noise level;
      
      assigning a second binary value to said storage means associated with said first channel if said first signal level is less than said first buffered noise level;
      
      if said first channel has an associated first binary assigned storage value, summing the absolute value of said first difference into said distance measure in response to said positive value determination; and
      
      if said first channel has an associated second binary assigned storage value, determining if said first difference is a positive value and adding said first difference to said distance measure; and
      
      if said first channel has an associated second binary assigned storage value, determining if said first difference is a negative value and adding a predetermined nominal differential value to said distance measure in response to said negative value determination.
  - 3. A method in accordance with the method of claim 1 wherein said step of generating a distance measure further comprises the steps of:
    - determining whether both said first and second signal levels exceed a predetermined threshold level andinhibiting said generating step if both said first and second signal levels do not exceed said predetermined threshold level.

4. A word recognition detector which compares stored speech recognition templates which are framed into time segments and channelized into at least two channels which are frequency band-limited to an input signal which has been contaminated by high levels of noise and which is framed into a time segment and channelized into at least two channels which are frequency band-limited, the word recognition detector comprising:
- means for determining, from the input signal, a first noise level associated with a first of the at least two channels and a second noise level associated with a second one of the at least two channels;
  
  means for adding a buffering level to each of said first and second noise levels to create respectively first and second buffered noise levels;
  
  means for determining, from the input signal, a first signal level associated with a first of the at least two channels and a second signal level associated with a second of the at least two channels;
  
  means for normalizing the level of each said first and second signal levels to create, respectively, normalized first and second signal levels;
  
  means for normalizing a first channel stored recognition template and means for normalizing a second channel stored speech recognition template to create, respectively, first and second normalized template signal levels;
  
  means for subtracting said normalized first signal level from said normalized first template signal level to determine a first difference and means for subtracting said normalized second signal level from said normalized second template signal level to determine a second difference; and
  
  means for generating a distance measure by adding together at least a first and a second addend, further comprising;
  
  (a) means for selecting a first addend as the absolute value of said first difference if said first signal level is greater than said first buffered noise level, or as said first difference if said first signal level is less than said first buffered noise level and if said first difference is a positive value, or as a predetermmined nominal differential value if said first signal level is less than said first buffered noise level and if said first difference is a negative value; and
  
  (b) means for selecting a second addend as the absolute value of said second difference if said second signal level is greater than said second buffered noise level, or as said second difference if said second signal level is less than said second buffered noise level and if said second difference is a positive value, or as a predetermined nominal differential value if said second signal level is less than said second buffered noise level and if said second difference is a negative value.
- View Dependent Claims (5, 6)
- - 5. A word recognition detector is accordance with claim 4 wherein said means for generating a distance measure further comprises:
    - means for assigning a first binary value to a storage means associated with said first channel if said first signal level is greater than said first buffered noise level;
      
      means for assigning a second binary value to said storage means associated with said first channel if said first signal level is less than said first buffered noise level;
      
      means for summing the absolute value of said first difference into said distance measure if said first channel has an associated first binary assigned storage value;
      
      means for determining whether said first difference is a positive value and, in response to said positive value determination, for adding said first difference to said distance measure if said first channel has an associated second binary assigned storage value; and
      
      means for determining whether said first difference is a negative value and, in response to said negative value determination, for adding a predetermined nominal differential value to said distance measure if said first channel has an associated second binary assigned storage value.
  - 6. A word recognition detector in accordance with claim 4 wherein said means for generating a distance measure further comprises:
    - means for determining whether both said first and second signal levels exceed a predetermined threshold level andmeans for inhibiting said means for generating if both said first and second signal levels do not exceed said predetermined threshold level.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Motorola, Inc. (Motorola Solutions, Inc.)
Original Assignee
Motorola, Inc. (Motorola Solutions, Inc.)
Inventors
Lindsley, Brett L., Gerson, Ira A.
Primary Examiner(s)
NOT, DEFINED
Assistant Examiner(s)
Merecki, John A.

Application Number

US07/357,688
Time in Patent Office

327 Days
Field of Search

381/41-47, 381/36-40, 381/71, 381/94, 364/513.5
US Class Current

704/233
CPC Class Codes

G10L 15/00 Speech recognition G10L17/0...

Frame comparison method for word recognition in high noise environments

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

49 Citations

6 Claims

Specification

Solutions

Use Cases

Quick Links

Frame comparison method for word recognition in high noise environments

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

49 Citations

6 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links