Sound alignment using timing information
First Claim
Patent Images
1. A method implemented by one or more computing devices, the method comprising:
- identifying features of first sound data generated from a first sound signal using a feature module, the features including bases that describe spectral characteristics of the first sound data and weights that describe temporal features of the first sound data;
identifying timing information of the first sound data using a timing module, the timing information being a cross-correlation of the weights for different frames of the first sound data;
estimating parameters of the features and the timing information of the first sound data;
processing second sound data generated from a second sound signal to identify second features and second timing information of the second sound data that are within the estimated parameters of the first sound data;
extracting the identified features of the first sound data;
inserting the extracted identified features of the first sound data into the second sound data based on the second features and second timing information of the second sound data, the inserting effective to provide altered second sound data; and
producing the altered second sound data with the extracted identified features of the first sound data.
2 Assignments
0 Petitions
Accused Products
Abstract
Sound alignment techniques that employ timing information are described. In one or more implementations, features and timing information of sound data generated from a first sound signal are identified and used to identify features of sound data generated from a second sound signal. The identified features may then be utilized to align portions of the sound data from the first and second sound signals to each other.
214 Citations
20 Claims
-
1. A method implemented by one or more computing devices, the method comprising:
-
identifying features of first sound data generated from a first sound signal using a feature module, the features including bases that describe spectral characteristics of the first sound data and weights that describe temporal features of the first sound data; identifying timing information of the first sound data using a timing module, the timing information being a cross-correlation of the weights for different frames of the first sound data; estimating parameters of the features and the timing information of the first sound data; processing second sound data generated from a second sound signal to identify second features and second timing information of the second sound data that are within the estimated parameters of the first sound data; extracting the identified features of the first sound data; inserting the extracted identified features of the first sound data into the second sound data based on the second features and second timing information of the second sound data, the inserting effective to provide altered second sound data; and producing the altered second sound data with the extracted identified features of the first sound data. - View Dependent Claims (2, 3, 10, 11, 12, 13, 15, 16)
-
-
4. A system comprising:
-
at least one extraction module implemented at least partially in hardware and configured to process sound data generated from a first and a second sound signal and identify features and timing information common to the first and second sound signals, the identification of features and timing information based on an estimated set of parameters for the features and timing information of the sound data generated from the first sound signal; the features including bases that describe spectral characteristics of the sound data and weights that describe temporal features of the sound data; the timing information being a cross-correlation of the weights for different frames of the sound data; and one or more modules implemented at least partially in hardware and configured to extract the identified features of the sound data from the first sound signal and insert the identified features of the sound data from the first sound signal into the second sound signal to produce altered sound data from the second sound signal with the identified features of the first sound signal. - View Dependent Claims (5, 6, 14, 17, 18)
-
-
7. One or more computer-readable and non-transitory storage media having instructions stored thereon that, responsive to execution on a computing device, causes the computing device to perform operations comprising:
-
identifying features and timing information of sound data of a first sound signal, the identified features including bases that describe spectral characteristics of the sound data and weights that describe temporal features of the sound data and the timing information is computed as a cross-correlation of the weights for different frames in the sound data generated from the first sound signal; estimating parameters for the features and timing information of the sound data of the first sound signal; processing sound data generated from a second sound signal to identify second features and second timing information that are within the estimated parameters from the sound data generated from the first sound signal; extracting the identified features of the sound data from the first sound signal; inserting the extracted identified features of the first sound signal into the second sound signal based on the second features and second timing information of the second sound signal, the inserting effective to provide altered second sound data; and producing the altered second sound signal with the extracted identified features of the first sound signal. - View Dependent Claims (8, 9, 19, 20)
-
Specification