×

Intervalgram representation of audio for melody recognition

  • US 8,158,870 B2
  • Filed: 06/29/2010
  • Issued: 04/17/2012
  • Est. Priority Date: 06/29/2010
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for matching audio clips, the method executed by a computer system, and comprising:

  • receiving an audio chromagram representing an input audio clip, the audio chromagram comprising a sequence of vectors, each vector in the audio chromagram corresponding to a different time point of the input audio clip, and each vector representing a distribution of audio pitches at the corresponding time point of the input audio clip;

    selecting a sampling of different reference time points within the audio chromagram;

    for each of the selected reference time points, generating a chroma block having a plurality of vectors, each of the plurality of vectors in the chroma block corresponding to a different time sub-period of the input audio clip, and each vector representing a weighted average of distributions of audio pitches within the corresponding time sub-period;

    for each of the selected reference time points, generating a reference vector representing a reference distribution of pitches for the selected reference time point in the audio chromagram;

    applying a circular cross-correlation of the vectors of each chroma block against the reference vector to produce a sequence of intervalgram blocks for the input audio clip, wherein the sequence of intervalgram blocks comprises an intervalgram representation for the input audio clip, each intervalgram block associated with a different time period within the input audio clip, and each intervalgram block representing a distribution of pitch intervals occurring between different sub-periods within the time period;

    comparing the intervalgram representation for the input audio clip to stored intervalgram representations corresponding to reference audio clips in a reference database;

    selecting a reference audio clip from the reference database having an intervalgram representation best matching the intervalgram representation for the input audio clip; and

    generating a recognition result indicative of the selected reference audio clip.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×