Multi-sourced noise suppression
First Claim
1. A method for multi-sourced noise suppression, the method comprising:
- assigning weights to audio streams, the audio streams being provided substantially synchronously by a plurality of audio devices, the weights depending on quality of the audio streams, wherein the assigning weights includes generating an acoustic activity map by locating, identifying and mapping target sounds and noise sources in at least one of a single room and multi-room environment, so as to create a multidimensional acoustic view of the environment;
based on the weights, performing noise suppression processing on the audio streams to generate a cleaned voice signal;
providing the cleaned voice signal from the noise suppression processing to at least one remote device for further processing; and
based on the acoustic activity map, selecting an optimal one of the plurality of audio devices to communicate with the user.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for multi-sourced noise suppression are provided. An example system may receive streams of audio data including a voice signal and noise, the voice signal including a spoken word. The streams of audio data are provided by distributed audio devices. The system can assign weights to the audio streams based at least partially on quality of the audio streams. The weights of audio streams can be determined based on signal-to-noise ratios (SNRs). The system may further process, based on the weights, the audio stream to generate cleaned speech. Each audio device comprises microphone(s) and can be associated with the Internet of Things (IoT), such that the audio devices are Internet of Things devices. The processing can include noise suppression and reduction and echo cancellation. The cleaned speech can be provided to a remote device for further processing which may include Automatic Speech Recognition (ASR).
-
Citations
20 Claims
-
1. A method for multi-sourced noise suppression, the method comprising:
-
assigning weights to audio streams, the audio streams being provided substantially synchronously by a plurality of audio devices, the weights depending on quality of the audio streams, wherein the assigning weights includes generating an acoustic activity map by locating, identifying and mapping target sounds and noise sources in at least one of a single room and multi-room environment, so as to create a multidimensional acoustic view of the environment; based on the weights, performing noise suppression processing on the audio streams to generate a cleaned voice signal; providing the cleaned voice signal from the noise suppression processing to at least one remote device for further processing; and based on the acoustic activity map, selecting an optimal one of the plurality of audio devices to communicate with the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 18)
-
-
15. A system for multi-sourced audio processing, the system comprising:
-
a processor; and a memory communicatively coupled with the processor, the memory storing instructions, which, when executed by the processor, perform a method comprising; assigning weights to audio streams, the audio streams being provided substantially synchronously by a plurality of audio devices, the weights depending on quality of the audio streams; based on the weights, performing noise suppression processing on the audio streams to generate a cleaned voice signal, and providing the cleaned voice signal from the noise suppression processing to a remote device for further processing, wherein each of the audio devices includes at least one microphone and wherein the plurality of audio devices are physically separate from each other but connected in a dynamic network of connected devices, such that the audio devices are connected as part of an Internet of Things environment. - View Dependent Claims (16, 20)
-
-
17. A non-transitory computer-readable storage medium having embodied thereon instructions, which, when executed by at least one processor, perform steps of a method, the method comprising:
-
assigning weights to audio streams, the audio streams being provided substantially synchronously by a plurality of audio devices, the weights depending on quality of the audio streams, wherein the assigning weights includes generating an acoustic activity map by locating, identifying and mapping target sounds and noise sources in at least one of a single room and multi-room environment, so as to create a multidimensional acoustic view of the environment; based on the weights, performing noise suppression processing on the audio streams to generate a cleaned voice signal; providing the cleaned voice signal from the noise suppression processing to at least one remote device for further processing; and based on the acoustic activity map, selecting an optimal one of the plurality of audio devices to communicate with the user. - View Dependent Claims (19)
-
Specification