Three-dimensional beam forming with a microphone array
First Claim
1. A networked microphone device comprising:
- one or more processors;
a microphone array; and
tangible, non-transitory computer-readable media comprising instructions encoded therein, wherein the instructions, when executed by the one or more processors, cause the networked microphone device to perform functions comprising;
continuously monitoring each received-sound beam in a set of received-sound beams generated by the networked microphone device, wherein each received-sound beam generated by the networked microphone device corresponds to a separate three-dimensional polar direction relative to the microphone array;
determining, for each received-sound beam, whether the received-sound beam comprises speech content;
ranking received-sound beams determined to comprise speech content; and
selecting one received-sound beam having the highest ranking for voice command processing.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for three-dimensional beamforming disclosed herein include, among other features (i) generating a set of received-sound beams by applying a plurality of sets of beamforming coefficients stored in a tangible memory of the network device to sound received via a microphone array of the network device, wherein each received-sound beam corresponds to a separate direction relative to the microphone array, (ii) identifying a subset of the received-sound beams comprising speech content, (iii) for each received-sound beam in the subset of the received-sound beams comprising speech content, determining whether the speech content comprises a wake word, (iv) selecting one final received-sound beam from the received-sound beams in the subset of the received-sound beams determined to comprise a wake word; and (v) causing the selected one final received-sound beam to be processed to identify a voice command.
589 Citations
20 Claims
-
1. A networked microphone device comprising:
-
one or more processors; a microphone array; and tangible, non-transitory computer-readable media comprising instructions encoded therein, wherein the instructions, when executed by the one or more processors, cause the networked microphone device to perform functions comprising; continuously monitoring each received-sound beam in a set of received-sound beams generated by the networked microphone device, wherein each received-sound beam generated by the networked microphone device corresponds to a separate three-dimensional polar direction relative to the microphone array; determining, for each received-sound beam, whether the received-sound beam comprises speech content; ranking received-sound beams determined to comprise speech content; and selecting one received-sound beam having the highest ranking for voice command processing. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. Tangible, non-transitory computer-readable medium comprising instructions encoded therein, wherein the instructions, when executed, cause a networked microphone device to perform functions comprising:
-
continuously monitoring each received-sound beam in a set of received-sound beams generated by the networked microphone device, wherein each received-sound beam generated by the networked microphone device corresponds to a separate three-dimensional polar direction relative to a microphone array of the networked microphone device; determining, for each received-sound beam, whether the received-sound beam comprises speech content; ranking received-sound beams determined to comprise speech content; and selecting one received-sound beam having the highest ranking for voice command processing. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A method performed by a networked microphone device, the method comprising:
-
continuously monitoring each received-sound beam in a set of received-sound beams generated by the networked microphone device, wherein each received-sound beam generated by the networked microphone device corresponds to a separate three-dimensional polar direction relative to a microphone array of the networked microphone device; determining, for each received-sound beam, whether the received-sound beam comprises speech content; ranking received-sound beams determined to comprise speech content; and selecting one received-sound beam having the highest ranking for voice command processing. - View Dependent Claims (18, 19, 20)
-
Specification