Tailoring beamforming techniques to environments
First Claim
Patent Images
1. An apparatus comprising:
- one or more processors;
a microphone array; and
one or more computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising;
generating, based at least in part on sound captured by the microphone array, a plurality of audio signals, wherein each of the plurality of audio signals corresponds to a respective microphone of the microphone array;
processing, by a beamforming component configured with one or more beamforming coefficients, at least a first audio signal of the plurality of audio signals to generate a first processed audio signal, wherein the first processed audio signal corresponds to a first portion of the sound received from a first direction;
processing, by the beamforming component configured with the one or more beamforming coefficients, at least a second audio signal of the plurality of audio signals to generate a second processed audio signal, wherein the second processed audio signal corresponds to a second portion of the sound received from a second direction;
selecting a direction of interest based at least in part on;
an amount of energy associated with a portion of the first processed audio signal;
an amount of energy associated with a portion of the second processed audio signal; and
directional data indicating at least one of a number of times speech has been identified from the first direction in previously processed audio signals or a number of times speech has been identified from the second direction in the previously processed audio signals; and
selecting, based at least in part on the direction of interest, the first processed audio signal.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for tailoring beamforming techniques to environments such that processing resources may be devoted to a portion of an audio signal corresponding to a lobe of a beampattern that is most likely to contain user speech. The techniques take into account both acoustic characteristics of an environment and heuristics regarding lobes that have previously been found to include user speech.
27 Citations
18 Claims
-
1. An apparatus comprising:
-
one or more processors; a microphone array; and one or more computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; generating, based at least in part on sound captured by the microphone array, a plurality of audio signals, wherein each of the plurality of audio signals corresponds to a respective microphone of the microphone array; processing, by a beamforming component configured with one or more beamforming coefficients, at least a first audio signal of the plurality of audio signals to generate a first processed audio signal, wherein the first processed audio signal corresponds to a first portion of the sound received from a first direction; processing, by the beamforming component configured with the one or more beamforming coefficients, at least a second audio signal of the plurality of audio signals to generate a second processed audio signal, wherein the second processed audio signal corresponds to a second portion of the sound received from a second direction; selecting a direction of interest based at least in part on; an amount of energy associated with a portion of the first processed audio signal; an amount of energy associated with a portion of the second processed audio signal; and directional data indicating at least one of a number of times speech has been identified from the first direction in previously processed audio signals or a number of times speech has been identified from the second direction in the previously processed audio signals; and selecting, based at least in part on the direction of interest, the first processed audio signal. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
generating, based at least in part on sound captured by a plurality of microphones, a plurality of audio signals, wherein individual audio signals of the plurality of audio signals corresponds to a respective microphone of the plurality of microphones; processing, by a beamforming component configured with one or more beamforming coefficients, at least a first audio signal of the plurality of audio signals; generating, based at least in part on processing at least the first audio signal, a first processed audio signal corresponding to a first portion of the sound received from a first direction; processing, by the beamforming component configured with the one or more beamforming coefficients, at least a second audio signal of the plurality of audio signals; generating, based at least in part on processing at least the second audio signal, a second processed audio signal corresponding to a second portion of the sound received from a second direction; selecting a direction of interest based at least in part on; an amount of energy associated with a portion of the first processed audio signal; an amount of energy associated with a portion of the second processed audio signal; and directional data indicating at least one of a number of times speech has been identified from the first direction in previously processed audio signals or a number of times speech has been identified from the second direction in the previously processed audio signals; and selecting, based at least in part on selecting the direction of interest, the first processed audio signal. - View Dependent Claims (7, 8, 9, 10, 11)
-
-
12. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform acts comprising:
-
generating, based at least in part on sound captured by a plurality of microphones within an environment, a plurality of audio signals, wherein individual audio signals of the plurality of audio signals corresponds to a respective microphone of the plurality of microphones; processing, by a beamforming component configured with one or more beamforming coefficients, a first audio signal of the plurality of audio signals; generating, based at least in part on processing the first audio signal, a first processed audio signal corresponding to a first portion of the sound received from a first direction within the environment; processing, by the beamforming component configured with the one or more beamforming coefficients, a second audio signal of the plurality of audio signals; generating, based at least in part on processing the second audio signal, a second processed audio signal corresponding to a second portion of the sound received from a second direction within the environment; selecting a direction within the environment based at least in part on; an amount of energy associated with a portion of the first processed audio signal; an amount of energy associated with a portion of the second processed audio signal; and directional data indicating at least one of a number of times speech has been identified from the first direction in previously processed audio signals or a number of times speech has been identified from the second direction in the previously processed audio signals; and selecting, based at least in part on selecting the direction within the environment, the first processed audio signal. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
Specification