Method for improving near-end voice activity detection in talker localization system utilizing beamforming technology
First Claim
Patent Images
1. A method for detecting voice activity comprising the steps of:
- receiving audio signals on a plurality of channels;
processing the audio signals on the channels to improve the signal-to-noise ratio thereof;
feeding the processed audio signals on each channel to an associated voice activity detection algorithm and further processing the audio signals via said voice activity detection algorithms; and
rendering a voice or silence determination based on at least the output of said voice activity detection algorithms.
5 Assignments
0 Petitions
Accused Products
Abstract
A method for detecting voice activity comprises receiving audio signals on a plurality of channels and processing the audio signals on the channels to improve the signal-to-noise ratio thereof. The processed audio signals on each channel are then fed to associated voice activity detection algorithms and further processed. A voice or silence determination is then rendered based on at least the output of the voice activity detection algorithms. A voice activity detector is also provided.
77 Citations
13 Claims
-
1. A method for detecting voice activity comprising the steps of:
-
receiving audio signals on a plurality of channels;
processing the audio signals on the channels to improve the signal-to-noise ratio thereof;
feeding the processed audio signals on each channel to an associated voice activity detection algorithm and further processing the audio signals via said voice activity detection algorithms; and
rendering a voice or silence determination based on at least the output of said voice activity detection algorithms. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A voice activity detector comprising:
-
an array of beamformers, each beamformer in said array having a different look direction and receiving audio signals on multiple channels, each beamformer processing said audio signals to improve the signal-to-noise ratio thereof;
an array of voice activity detector modules, each voice activity detector module being associated with a respective one of said beamformers and processing the output of said associated beamformer; and
logic receiving the output of said voice activity detector modules and generating output signifying the presence or absence of voice in said audio signals. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
Specification