Voice activity detector for half-duplex audio communication system
First Claim
1. A method of locating a noise floor for qualifying a signal, comprising the steps of:
- establishing a noise power density function (NPDF), based on;
a relationship between an approximate peak level of the signal and a current level of the signal, anda plurality of standard deviation values of the signal, each of the standard deviation values corresponding to one of a plurality of time intervals;
repeatedly updating the NPDF to produce a current state of the NPDF; and
using the current state of the NPDF to locate the noise floor.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of detecting voice in an audio signal comprises the steps of determining an average peak value representing an envelope of the audio signal, determining a running instance of audio signal standard deviation, which corresponds to one of a number of overlapping time intervals, and updating a power density function (PDF) by adding instances of noise to the PDF if the average peak of the audio signal exceeds the current level of the audio signal by a certain amount and if the current standard deviation value falls below a threshold for a predetermined time interval. A noise floor is located based on the mean value of the PDF, and, if the audio signal sustains a power level exceeding the noise floor, voice activity is determined to be present in the audio signal. The PDF is updated by a low confidence factor if all of the standard deviation values calculated during a certain period of time are below the threshold value and by a high confidence factor if all standard deviation values within a certain longer period of time period are below the threshold value.
-
Citations
31 Claims
-
1. A method of locating a noise floor for qualifying a signal, comprising the steps of:
-
establishing a noise power density function (NPDF), based on; a relationship between an approximate peak level of the signal and a current level of the signal, and a plurality of standard deviation values of the signal, each of the standard deviation values corresponding to one of a plurality of time intervals; repeatedly updating the NPDF to produce a current state of the NPDF; and using the current state of the NPDF to locate the noise floor. - View Dependent Claims (2, 3, 4)
-
-
5. A method of detecting speech in an audio signal, comprising the steps of:
-
determining an average peak of the audio signal;
determining a plurality of standard deviation values of the audio signal, each of the standard deviation values corresponding to one of a plurality of time intervals;updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the average peak and a current level of the audio signal and based on the standard deviation values; locating a noise floor based on the current state of the PDF; and if a predetermined relationship exists between the current level of the audio signal and the noise floor, determining that speech is represented in the audio signal. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12)
-
-
13. An apparatus for determining whether voice is present in an audio signal, comprising:
-
a peak calculator determining a peak of the audio signal; a standard deviation generator determining a plurality of standard deviation values of the audio signal, each of the standard deviation values corresponding to one of a plurality of time intervals; updating logic coupled to receive the peak and the standard deviation values, the updating logic updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the peak and a current level of the audio signal and based on the standard deviation values; a noise floor locator coupled to receive the current state of the PDF, the noise floor locator locating a noise floor based on the current state of the PDF; and decision logic coupled to receive the noise floor and the audio signal, the decision logic determining that voice is represented in the audio signal when a predetermined relationship exists between the current level of the audio signal and the noise floor. - View Dependent Claims (14, 15, 16, 17)
-
-
18. An apparatus for detecting voice in an audio signal, comprising:
-
means for determining an average peak of the audio signal; means for determining a plurality of standard deviation values of the audio signal, each of the standard deviation values corresponding to one of a plurality of time intervals; means for updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the average peak and a current level of the audio signal and based on the standard deviation values; means for locating a noise floor based on the current state of the PDF; and means for determining that voice is represented in the audio signal if a predetermined relationship exists between the current level of the audio signal and the noise floor. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
-
25. A computer system having capability for duplex audio communication with a remote site, the system comprising:
-
a processor controlling the computer system; an input device coupled to the processor and coupled to input audio information to be transmitted to the remote site; an output device coupled to the processor and coupled to output audio information received from the remote site; and a voice activity detector coupled to the input device and the output device, the voice activity detector detecting voice represented in an audio signal received by the computer system or to be transmitted by the computer system, the voice activity detector including; peak logic determining an average peak of the audio signal; a standard deviation generator determining a plurality of standard deviation values of the audio signal, each of the standard deviation values corresponding to one of a plurality of time intervals; updating logic coupled to receive the standard deviation values and the average peak updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the average peak and a current level of the audio signal and based on the standard deviation values; noise logic locating a noise floor based on the current state of the PDF; and decision logic determining that voice is represented in the audio signal when a predetermined relationship exists between the current level of the audio signal and the noise floor. - View Dependent Claims (26)
-
-
27. A processing system having capability for duplex audio communication with a remote site, the system comprising:
-
processor means for controlling the processing system; input means for inputting audio information to be transmitted to the remote site; output means for outputting audio information received from the remote sight; and voice detection means for detecting voice in an audio signal received by the processing system or to be transmitted by the processing system, the voice detection means including; means for determining an approximate peak of the audio signal; means for determining a plurality of standard deviation values of the audio signal, each of the standard deviation values corresponding to one of a plurality of time intervals; means for updating a power density function (PDF) to establish a current state of the PDF according to a relationship between the approximate peak and a current level of the audio signal and based on the standard deviation values; means for locating a noise floor based on the current state of the PDF; and means for determining that voice is represented in the audio signal if a predetermined relationship exists between the current level of the audio signal and the noise floor. - View Dependent Claims (28, 29, 30, 31)
-
Specification