×

Method and apparatus for automatically recognizing input audio and/or video streams

  • US 9,715,626 B2
  • Filed: 12/02/2010
  • Issued: 07/25/2017
  • Est. Priority Date: 09/21/1999
  • Status: Expired due to Fees
First Claim
Patent Images

1. Audio signal recognition server apparatus adapted to receive, from a capture device, feature data that corresponds to a captured audio sample that is less than an entire reference audio work, the recognition server apparatus comprising:

  • interface structure configured to receive the sample feature data from the capture device;

    a memory storing a library comprising (i) a first plurality of reference feature data sets which correspond to a first recorded reference audio work, and (ii) a second plurality of reference feature data sets which correspond to a second recorded reference audio work, each recorded reference audio work being longer than the captured audio sample; and

    server processing structure configured to;

    receive a first reference input audio signal corresponding to the first recorded reference audio work;

    separate the received first reference input audio signal into a first plurality of frequency bands which have different frequencies, a frequency bandwidth of a lower frequency band of the first plurality of frequency bands being narrower than a frequency bandwidth of a higher frequency band of the first plurality of frequency bands;

    compute the first plurality of reference feature data sets, which correspond to spectrally distinct portions of the first plurality of frequency bands of the first received reference input audio signal, this computing comprising performing envelope extraction on the first plurality of frequency bands to provide low-bandwidth amplitude measurements of each of the first plurality of frequency bands to provide the first plurality of reference feature data sets;

    store in the memory the first plurality of reference feature data sets which correspond to the first reference input audio signal;

    receive a second reference input audio signal corresponding to the second recorded reference audio work;

    separate the received second reference input audio signal into a second plurality of frequency bands which have different frequencies, a frequency bandwidth of a lower frequency band of the second plurality of frequency bands being narrower than a frequency bandwidth of a higher frequency band of the second plurality of frequency bands;

    compute the second plurality of reference feature data sets, which correspond to spectrally distinct portions of the second plurality of frequency bands of the second received reference input audio signal, this computing comprising performing envelope extraction on the second plurality of frequency bands to provide low-bandwidth amplitude measurements of each of the second plurality of frequency bands to provide the second plurality of reference feature data sets;

    store in the memory the second plurality of reference feature data sets which correspond to the second reference input audio signal;

    compare the sample feature data received by said interface structure with the stored first and second pluralities of reference feature data sets; and

    generate a recognition signal in response to the received sample feature data matching at least one reference feature data set of the stored first and second pluralities of reference feature data sets.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×