SYSTEM AND METHOD FOR TRACKING SOUND PITCH ACROSS AN AUDIO SIGNAL USING HARMONIC ENVELOPE
First Claim
1. A system configured to analyze audio information, the system comprising:
- one or more processors configured to execute computer program modules, the modules comprising;
an audio information module configured to obtain audio information derived from an audio signal representing one or more sounds, wherein the audio information includes audio information that corresponds to an audio signal during a first time sample window, and wherein such information includes transformed audio information that specifies a magnitude of an intensity coefficient related to an intensity of the audio signal as a function of frequency during the first time sample window;
an envelope vector module configured to determine, as a function of pitch in the first time sample window, an envelope vector having coordinates, wherein the envelope vector module is configured to determine the envelope vector for a given pitch in the first time sample window based on the values for the intensity coefficient at harmonic frequencies of the given pitch in the first time sample window;
an envelope correlation module configured to obtain an envelope vector for a sound represented by the audio signal during a second time sample window, and to determine, for the first time sample window, a correlation metric as a function of pitch, wherein the correlation metric for a given pitch in the first time sample window indicates a level of correlation between the envelope vector for the second time sample window and the envelope vector for the given pitch in the first time sample window; and
a pitch estimation module configured to determine an estimated pitch for the first time sample window based on the determination of the correlation metric for the first time sample window.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method may be configured to analyze audio information derived from an audio signal. The system and method may track sound pitch across the audio signal. The tracking of pitch across the audio signal may take into account change in pitch by determining at individual time sample windows in the signal duration an estimated pitch and a representation of harmonic envelope at the estimated pitch. The estimated pitch and the representation of harmonic envelope may then be implemented to determine an estimated pitch for another time sample window in the signal duration with an enhanced accuracy and/or precision.
27 Citations
21 Claims
-
1. A system configured to analyze audio information, the system comprising:
one or more processors configured to execute computer program modules, the modules comprising; an audio information module configured to obtain audio information derived from an audio signal representing one or more sounds, wherein the audio information includes audio information that corresponds to an audio signal during a first time sample window, and wherein such information includes transformed audio information that specifies a magnitude of an intensity coefficient related to an intensity of the audio signal as a function of frequency during the first time sample window; an envelope vector module configured to determine, as a function of pitch in the first time sample window, an envelope vector having coordinates, wherein the envelope vector module is configured to determine the envelope vector for a given pitch in the first time sample window based on the values for the intensity coefficient at harmonic frequencies of the given pitch in the first time sample window; an envelope correlation module configured to obtain an envelope vector for a sound represented by the audio signal during a second time sample window, and to determine, for the first time sample window, a correlation metric as a function of pitch, wherein the correlation metric for a given pitch in the first time sample window indicates a level of correlation between the envelope vector for the second time sample window and the envelope vector for the given pitch in the first time sample window; and a pitch estimation module configured to determine an estimated pitch for the first time sample window based on the determination of the correlation metric for the first time sample window. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
8. A method analyzing audio information, the method comprising:
-
obtaining audio information derived from an audio signal representing one or more sounds, wherein the audio information includes audio information that corresponds to an audio signal during a first time sample window, and wherein such information includes transformed audio information that specifies a magnitude of an intensity coefficient related to an intensity of the audio signal as a function of frequency during the first time sample window; determining, as a function of pitch in the first time sample window, an envelope vector having coordinates, wherein determination of the coordinates of the envelope vector for a given pitch in the first time sample window is based on the values for the intensity coefficient at harmonic frequencies of the given pitch in the first time sample window; obtaining an envelope vector for a sound represented by the audio signal during a second time sample window; determining, for the first time sample window, a correlation metric as a function of pitch, wherein the correlation metric for a given pitch in the first time sample window indicates a level of correlation between the envelope vector for the second time sample window and the envelope vector for the given pitch in the first time sample window; and determining an estimated pitch for the first time sample window based on the determination of the correlation metric for the first time sample window. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
15. A system configured to analyze audio information, the system comprising:
one or more processors configured to execute computer program modules, the modules comprising; an audio information module configured to obtain audio information derived from an audio signal representing one or more sounds, wherein the audio information includes audio information that corresponds to an audio signal during a set of discrete time sample windows, wherein such information includes for the individual time sample windows (i) transformed audio information that specifies a magnitude of an intensity coefficient related to an intensity of the audio signal as a function of frequency during an individual time sample window, and (ii) pitch likelihood information that specifies, as a function of pitch, a pitch likelihood metric for an individual time sample window; a processing window module configured to define a processing time window within the signal duration, the processing time window including a plurality of time sample windows; a pitch estimation module configured to determine, for the individual time sample windows in the processing time window, an estimated pitch; an envelope vector module configured to determine, for the individual time sample windows, an envelope vector as a function of pitch such that the envelope vector for a given pitch in a given time sample window is determined based on the values for the intensity coefficient at harmonic frequencies of the given pitch in the given time sample window, a primary window module configured to identify a primary time sample window within the processing time window, and to identify a primary pitch within the primary time sample window, wherein the primary pitch is considered to be the estimated pitch for the primary time sample window; wherein the pitch estimation module is further configured to determine estimated pitch for the time sample windows in the processing time window other than the primary time sample window by iterating through the processing time window from the primary time sample window toward one or both of the boundaries of the processing time window and determining the estimated pitch for a given time sample window based on (i) the pitch likelihood metric specified by the transformed audio information for the given time sample window, and (ii) the envelope vector for an estimated pitch in a time sample window adjacent to the given time sample window. - View Dependent Claims (14, 16, 17, 18, 19, 20, 21)
Specification