×

Method for utilizing validity constraints in a speech endpoint detector

  • US 6,718,302 B1
  • Filed: 01/12/2000
  • Issued: 04/06/2004
  • Est. Priority Date: 10/20/1997
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system for detecting endpoints of an utterance, comprising:

  • a processor configured to manipulate speech energy corresponding to said utterance;

    a filter bank which band-passes said speech energy before providing said speech energy to, an endpoint detector that is responsive to said processor, said endpoint detector analyzing said speech energy in real time by progressively examining frames of said speech energy in sequence to determine threshold values and energy parameters, said energy parameters being short-term energy parameters corresponding to said frames of said speech energy, said short-term energy parameters being calculated using a following equation;

    DTF

    (i)
    =

    m=0M-1






    yi

    (m)


    wi

    (m)
    embedded imagewhere wi(m) is a respective weighting value, yi(m) is channel signal energy of a channel m at a frame i, and M is a total number of channels of said filter bank, said endpoint detector smoothing said short-term energy parameters by using a multiple-point median filter, said endpoint detector using a starting threshold and said short-term energy parameters to determine a starting point for a reliable island, said speech energy including at least one reliable island in which said short-term energy parameters are greater than said starting threshold and an ending threshold, said endpoint detector calculating a background noise value, said background noise value being derived from said short-term energy parameters during a background noise period, said background noise period ending at least 250 milliseconds ahead of said reliable island and having a normalized deviation that is less than a predetermined value, said endpoint detector comparing said threshold values with said energy parameters to identify a beginning point and an ending point of said utterance; and

    a validity manager, responsive to said processor, for analyzing said speech energy according to selectable criteria to thereby verify said utterance.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×