×

Speech detection for noisy conditions

  • US 6,480,823 B1
  • Filed: 03/24/1998
  • Issued: 11/12/2002
  • Est. Priority Date: 03/24/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A speech detection system for examining an input signal to determine whether a speech signal is present or absent, comprising:

  • a frequency band splitter for splitting said input signal into a plurality of frequency bands, each band representing a band-limited signal energy corresponding to a different range of frequencies;

    an energy comparator system for comparing the band-limited signal energy of said plurality of frequency bands with a plurality of thresholds such that each frequency band is compared with at least one threshold associated with that band;

    a speech signal state machine coupled to said energy comparator system that switches;

    (a) from a speech-absent state to a speech-present state when the band-limited signal energy of at least one of said bands is above at least one of its associated thresholds, and (b) from a speech-present state to a speech-absent state when the band-limited signal energy of at least one of said bands is below at least one of its associated thresholds;

    a histogram data structure residing in computer memory accessible to said speech detection system wherein said histogram data structure initially has a size based at least in part on the energy level of the non-speech portion of the input signal, and wherein said histogram data structure is organized by a predetermined number of histogram steps having a step size based at least in part on a mean of accumulated historical data;

    a histogram updating module operable to periodically update said histogram data structure based on a portion of the input signal having an energy level falling within the size of the histogram data structure, said histogram updating module further operable to adjust the size of said histogram data structure based on actual operating conditions wherein said histogram updating module periodically adjusts the step size to reflect a change in said mean, thereby affecting adjustment of the size of the histogram data structure based on actual operating conditions; and

    an adaptive threshold updating system that employs said histogram data structure to accumulate historical data indicative of a pre-speech silence portion of said input signal within at least one of said frequency bands such that an energy level of greatest magnitude among all energy levels of the historical data defines a noise floor, the updating system using the noise floor to adjust at least one of said plurality of thresholds used by said energy comparator, said historical data being initially limited to a non-speech portion of the input signal.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×