×

Audio fingerprinting based on audio energy characteristics

  • US 10,540,993 B2
  • Filed: 08/10/2017
  • Issued: 01/21/2020
  • Est. Priority Date: 04/08/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method of audio fingerprinting comprising:

  • obtaining audio samples of a piece of audio, each of the audio samples corresponding to a specific time;

    generating frequency representations of the audio samples, the frequency representations being divided in frequency bands;

    identifying energy regions in the frequency bands, each of the energy regions being one of an increasing energy region and a decreasing energy region, an increasing energy region defined as a time region within one of the frequency bands during which audio energy increases from a start time to an end time of the time region and a decreasing energy region defined as a time region within one of the frequency bands during which audio energy decreases from a start time to an end time of the time region;

    analyzing portions of the identified energy regions appearing within time windows to generate hashes of features of the piece of audio, each hash of features corresponding to portions of the identified energy regions appearing in a respective time window, each feature defined as a numeric value that encodes information representing;

    a frequency band of an energy region appearing in the respective time window, whether the energy region appearing in the respective time window is an increasing energy region or whether the energy region appearing in the respective time window is a decreasing energy region, and a placement of the energy region appearing in the respective time window, the placement of the energy region appearing in the respective time window corresponding to one of;

    whether the energy region appearing in the respective time window starts before and ends after the respective time window,whether the energy region appearing in the respective time window starts before and ends within the respective time window,whether the energy region appearing in the respective time window starts within and ends after the respective time window, andwhether the energy region appearing in the respective time window starts within and ends within the respective time window; and

    storing each hash of features together with the specific time,wherein the frequency bands include forty four frequency bands whose bandwidth decrease logarithmically from a first frequency band that starts at 200 Hz to a forty fourth frequency band that ends at 3300 Hz.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×