APPARATUS, SYSTEMS AND METHODS FOR PROVIDING CLOUD BASED BLIND SOURCE SEPARATION SERVICES
First Claim
1. A method for processing at least one signal acquired using an acoustic sensor, the at least one signal having contributions from a plurality of acoustic sources, the method comprising using one or more processors performing steps of:
- accessing an indication of a current block size, the current block size defining a size of a portion of the at least one signal to be analyzed to separate from the at least one signal one or more contributions from a first acoustic source of the plurality of acoustic sources;
analyzing a first portion of the at least one signal, the first portion being of the current block size, by;
computing one or more first characteristics from data of the first portion, andusing the computed one or more first characteristics, or derivatives thereof, in performing iterations of a nonnegative tensor factorization (NTF) model for the plurality of acoustic sources for the data of the first portion to separate, from at least the first portion of the at least one acquired signal, one or more first contributions from the first acoustic source; and
analyzing a second portion of the at least one signal, the second portion being of the current block size and being temporaly shifted with respect to the first portion, by;
computing one or more second characteristics from data of the second portion, andusing the computed one or more second characteristics, or derivatives thereof, in performing iterations of the NTF model for the data of the second portion to separate, from at least the second portion of the at least one acquired signal, one or more second contributions from the first acoustic source.
1 Assignment
0 Petitions
Accused Products
Abstract
Use of spoken input for user devices, e.g. smartphones, can be challenging due to presence of other sound sources. Blind source separation (BSS) techniques aim to separate a sound generated by a particular source of interest from a mixture of different sounds. Various BSS techniques disclosed herein are based on recognition that providing additional information that is considered within iterations of a nonnegative tensor factorization (NTF) model improves accuracy and efficiency of source separation. Examples of such information include direction estimates or neural network models trained to recognize a particular sound of interest. Furthermore, identifying and processing incremental changes to an NTF model, rather than re-processing the entire model each time data changes, provides an efficient and fast manner for performing source separation on large sets of quickly changing data. Carrying out at least parts of BSS techniques in a cloud allows flexible utilization of local and remote sources.
-
Citations
149 Claims
-
1. A method for processing at least one signal acquired using an acoustic sensor, the at least one signal having contributions from a plurality of acoustic sources, the method comprising using one or more processors performing steps of:
-
accessing an indication of a current block size, the current block size defining a size of a portion of the at least one signal to be analyzed to separate from the at least one signal one or more contributions from a first acoustic source of the plurality of acoustic sources; analyzing a first portion of the at least one signal, the first portion being of the current block size, by; computing one or more first characteristics from data of the first portion, and using the computed one or more first characteristics, or derivatives thereof, in performing iterations of a nonnegative tensor factorization (NTF) model for the plurality of acoustic sources for the data of the first portion to separate, from at least the first portion of the at least one acquired signal, one or more first contributions from the first acoustic source; and analyzing a second portion of the at least one signal, the second portion being of the current block size and being temporaly shifted with respect to the first portion, by; computing one or more second characteristics from data of the second portion, and using the computed one or more second characteristics, or derivatives thereof, in performing iterations of the NTF model for the data of the second portion to separate, from at least the second portion of the at least one acquired signal, one or more second contributions from the first acoustic source. - View Dependent Claims (2, 3, 4, 143, 144, 145, 146, 147, 148, 149)
-
-
5. -73. (canceled)
-
74. A method for processing at least one signal acquired using a corresponding acoustic sensor, the signal having contributions from a plurality of different acoustic sources, the method comprising using one or more processors performing steps of steps of:
-
computing time-dependent spectral characteristics from the at least one acquired signal, the spectral characteristics comprising a plurality of components; applying a neural network model to the time-dependent spectral characteristics, the neural network model configured to compute property estimates of a property, each component of a first subset of the components having a corresponding one or more property estimates of the property; performing iterations of a nonnegative tensor factorization (NTF) model for the plurality of acoustic sources, the iterations comprising (a) combining values of a plurality of parameters of the NTF model with the computed property estimates to separate from the at least one acquired signal one or more contributions from a first acoustic source of the plurality of acoustic sources. - View Dependent Claims (75, 76)
-
-
77. -85. (canceled)
-
86. A method for processing at least one signal acquired using a corresponding acoustic sensor, the signal having contributions from a plurality of different acoustic sources, the method comprising using one or more processors performing steps of steps of:
-
computing time-dependent spectral characteristics from the at least one acquired signal, the spectral characteristics comprising a plurality of components; accessing at least a first model configured to predict contributions from a first acoustic source of the plurality of acoustic sources; and performing iterations of a nonnegative tensor factorization (NTF) model for the plurality of acoustic sources, the iterations comprising running the first model to separate from the at least one acquired signal one or more contributions from the first acoustic source. - View Dependent Claims (87, 88)
-
-
89. -134. (canceled)
-
135. A system for processing at least one signal acquired using an acoustic sensor, the at least one signal having contributions from a plurality of acoustic sources, the system comprising:
-
at least one memory configured to store computer executable instructions; and at least one processor coupled to or comprising the at least one memory and configured, when executing the instructions, to carry out a method comprising; accessing an indication of a current block size, the current block size defining a size of a portion of the at least one signal to be analyzed to separate from the at least one signal one or more contributions from a first acoustic source of the plurality of acoustic sources; analyzing a first portion of the at least one signal, the first portion being of the current block size, by; computing one or more first characteristics from data of the first portion, and using the computed one or more first characteristics, or derivatives thereof, in performing iterations of a nonnegative tensor factorization (NTF) model for the plurality of acoustic sources for the data of the first portion to separate, from at least the first portion of the at least one acquired signal, one or more first contributions from the first acoustic source; and analyzing a second portion of the at least one signal, the second portion being of the current block size and being temporaly shifted with respect to the first portion, by; computing one or more second characteristics from data of the second portion, and using the computed one or more second characteristics, or derivatives thereof, in performing iterations of the NTF model for the data of the second portion to separate, from at least the second portion of the at least one acquired signal, one or more second contributions from the first acoustic source. - View Dependent Claims (136, 137)
-
-
138. -142. (canceled)
Specification