System and method for tracking sound pitch across an audio signal using harmonic envelope
First Claim
Patent Images
1. A system configured to track pitch in an audio signal, the system comprising:
- an electronic storage storing computer program modules; and
one or more processors configured to execute the computer program modules, the computer program modules being configured to;
receive the audio signal obtained from a user input device;
obtain a first transformation of the audio signal in a first time period, wherein the first transformation represents the audio signal as a function of frequency in the first time period;
obtain a first pitch corresponding to a first sound in the first time period of the audio signal;
determine a first envelope vector of the first time period from the first transformation in a multi-dimensional space, wherein each dimension of the multi-dimensional space corresponds to one of a plurality of harmonics of a pitch and the first envelope vector of the first time period is defined by a first set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the first pitch in the first transformation;
obtain a second transformation of the audio signal in a second time period, wherein the second time period is different from the first time period and the second transformation represents the audio signal as a function of frequency in the second time period;
obtain a second pitch corresponding to a second sound in the second time period of the audio signal;
determine a second envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the second envelope vector of the second time period is defined by a second set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the second pitch in the second transformation;
determine a first correlation between the first envelop vector of the first time period and the second envelope vector of the second time period;
obtain a third pitch corresponding to a third sound in the second time period of the audio signal;
determine a third envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the third envelope vector of the second time period is defined by a third set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the third pitch in the second transformation;
determine a second correlation between the first envelop vector of the first time period and the third envelope vector of the second time period; and
determine, using the first correlation and the second correlation, that the first sound in the first time period of the audio signal and the second sound in the second time period of the audio signal are portions of a same harmonic sound.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method may be configured to analyze audio information derived from an audio signal. The system and method may track sound pitch across the audio signal. The tracking of pitch across the audio signal may take into account change in pitch by determining at individual time sample windows in the signal duration an estimated pitch and a representation of harmonic envelope at the estimated pitch. The estimated pitch and the representation of harmonic envelope may then be implemented to determine an estimated pitch for another time sample window in the signal duration with an enhanced accuracy and/or precision.
138 Citations
24 Claims
-
1. A system configured to track pitch in an audio signal, the system comprising:
-
an electronic storage storing computer program modules; and one or more processors configured to execute the computer program modules, the computer program modules being configured to; receive the audio signal obtained from a user input device; obtain a first transformation of the audio signal in a first time period, wherein the first transformation represents the audio signal as a function of frequency in the first time period; obtain a first pitch corresponding to a first sound in the first time period of the audio signal; determine a first envelope vector of the first time period from the first transformation in a multi-dimensional space, wherein each dimension of the multi-dimensional space corresponds to one of a plurality of harmonics of a pitch and the first envelope vector of the first time period is defined by a first set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the first pitch in the first transformation; obtain a second transformation of the audio signal in a second time period, wherein the second time period is different from the first time period and the second transformation represents the audio signal as a function of frequency in the second time period; obtain a second pitch corresponding to a second sound in the second time period of the audio signal; determine a second envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the second envelope vector of the second time period is defined by a second set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the second pitch in the second transformation; determine a first correlation between the first envelop vector of the first time period and the second envelope vector of the second time period; obtain a third pitch corresponding to a third sound in the second time period of the audio signal; determine a third envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the third envelope vector of the second time period is defined by a third set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the third pitch in the second transformation; determine a second correlation between the first envelop vector of the first time period and the third envelope vector of the second time period; and determine, using the first correlation and the second correlation, that the first sound in the first time period of the audio signal and the second sound in the second time period of the audio signal are portions of a same harmonic sound. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for tracking pitch in an audio signal, the method comprising:
-
receiving the audio signal obtained from a user input device; obtaining a first transformation of the audio signal in a first time period, wherein the first transformation represents the audio signal as a function of frequency in the first time period; obtaining a first pitch corresponding to a first sound in the first time period of the audio signal; determining a first envelope vector of the first time period from the first transformation in a multi-dimensional space, wherein each dimension of the multi-dimensional space corresponds to one of a plurality of harmonics of a pitch and the first envelope vector of the first time period is defined by a first set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the first pitch in the first transformation; obtaining a second transformation of the audio signal in a second time period, wherein the second time period is different from the first time period and the second transformation represents the audio signal as a function of frequency in the second time period; obtaining a second pitch corresponding to a second sound in the second time period of the audio signal; determining a second envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the second envelope vector of the second time period is defined by a second set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the second pitch in the second transformation; determining a first correlation between the first envelop vector of the first time period and the second envelope vector of the second time period; obtaining a third pitch corresponding to a third sound in the second time period of the audio signal; determining a third envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the third envelope vector of the second time period is defined by a third set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the third pitch in the second transformation; determining a second correlation between the first envelop vector of the first time period and the third envelope vector of the second time period; and determining, using the first correlation and the second correlation, that the first sound in the first time period of the audio signal and the second sound in the second time period of the audio signal are portions of a same harmonic sound. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A non-transitory computer readable storage medium having data stored therein representing computer program modules executable by a computer, the computer program modules including instructions to track pitch in an audio signal, the storage medium comprising:
-
instructions for receiving the audio signal obtained from a user input device; instructions for obtaining a first transformation of the audio signal in a first time period, wherein the first transformation represents the first portion of the audio signal as a function of frequency in the first time period; instructions for obtaining a first pitch corresponding to a first sound in the first time period of the audio signal; instructions for determining a first envelope vector of the first time period from the first transformation in a multi-dimensional space, wherein each dimension of the multi-dimensional space corresponds to one of a plurality of harmonics of a pitch and the first envelope vector of the first time period is defined by a first set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the first pitch in the first transformation; instructions for obtaining a second transformation of the audio signal in a second time period, wherein the second time period is different from the first time period and the second transformation represents the second portion of the audio signal as a function of frequency in the second time period; instructions for obtaining a second pitch corresponding to a second sound in the second time period of the audio signal; instructions for determining a second envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the second envelope vector of the second time period is defined by a second set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the second pitch in the second transformation; instructions for determining a first correlation between the first envelop vector of the first time period and the second envelope vector of the second time period; instructions for obtaining a third pitch corresponding to a third sound in the second time period of the audio signal; instructions for determining a third envelope vector of the second time period from the second transformation in the multi-dimensional space, wherein the third envelope vector of the second time period is defined by a third set of coordinates corresponding to intensity coefficients at a plurality of harmonics of the third pitch in the second transformation; instructions for determining a second correlation between the first envelop vector of the first time period and the third envelope vector of the second time period; and instructions for determining, using the first correlation and the second correlation, that the first sound in the first time period of the audio signal and the second sound in the second time period of the audio signal are portions of a same harmonic sound. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification