Voice activity detector for half-duplex audio communication system
First Claim
1. A method of locating a noise floor for qualifying a signal, comprising the steps of:
- establishing a noise function based on;
a relationship between an approximate peak level of the signal and a current level of the signal, andvariation measures determined for a plurality of time intervals;
repeatedly updating the noise function to produce a current state of the noise function; and
using the current state of the noise function to locate the noise floor.
0 Assignments
0 Petitions
Accused Products
Abstract
A method of detecting voice in an audio signal comprises the steps of determining an average peak value representing an envelope of the audio signal, determining a running instance of audio signal standard deviation, which corresponds to one of a number of overlapping time intervals, and updating a power density function (PDF) by adding instances of noise to the PDF if the average peak of the audio signal exceeds the current level of the audio signal by a certain amount and if the current standard deviation value fails below a threshold for a predetermined time interval. A noise floor is located based on the mean value of the PDF, and, if the audio signal sustains a power level exceeding the noise floor, voice activity is determined to be present in the audio signal. The PDF is updated by a low confidence factor if all of the standard deviation values calculated during a certain period of time are below the threshold value and by a high confidence factor if all standard deviation values within a certain longer period of time period are below the threshold value.
37 Citations
20 Claims
-
1. A method of locating a noise floor for qualifying a signal, comprising the steps of:
-
establishing a noise function based on; a relationship between an approximate peak level of the signal and a current level of the signal, and variation measures determined for a plurality of time intervals; repeatedly updating the noise function to produce a current state of the noise function; and using the current state of the noise function to locate the noise floor. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of detecting speech in an audio signal, comprising the steps of:
-
determining an average peak of the audio signal; determining variation measures for the audio signal, each of the variation measures corresponding to one of a plurality of time intervals; updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the average peak and a current level of the audio signal and based on the variation measures; locating a noise floor based on the current state of the PDF; and if a predetermined relationship exists between the current level of the audio signal and the noise floor, determining that speech is represented in the audio signal. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13)
-
-
14. An apparatus for determining whether voice is present in an audio signal, comprising:
-
a peak calculator determining a peak of the audio signal; a variation measure generator determining variation measures of the audio signal, each of the variation measures corresponding to one of a plurality of time intervals; updating logic coupled to receive the peak and the variation measures, the updating logic updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the peak and a current level of the audio signal and based on the variation measures; a noise floor locator coupled to receive the current state of the PDF, the noise floor locator locating a noise floor based on the current state of the PDF; and decision logic coupled to receive the noise floor and the audio signal, the decision logic determining that voice is represented in the audio signal when a predetermined relationship exists between the current level of the audio signal and the noise floor. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A computer system having capability for duplex audio communication with a remote site, the system comprising:
-
a processor controlling the computer system; an input device coupled to the processor and coupled to input audio information to be transmitted to the remote site; an output device coupled to the processor and coupled to output audio information received from the remote site; and a voice activity detector coupled to the input device and the output device, the voice activity detector detecting voice represented in an audio signal received by the computer system or to be transmitted by the computer system, the voice activity detector including; peak logic determining an average peak of the audio signal; a variation measure generator determining variation measures of the audio signal, each of the variation measures corresponding to one of a plurality of time intervals; updating logic coupled to receive the variation measures and the average peak, the updating logic updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the average peak and a current level of the audio signal and according to the variation. measures; noise logic locating a noise floor based on the current state of the PDF; and decision logic determining that voice is represented in the audio signal when a predetermined relationship exists between the current level of the audio signal and the noise floor. - View Dependent Claims (20)
-
Specification