Method for making a voice activity decision
First Claim
1. A method for determining speech activity in a signal segment of an audio signal, the method comprising:
- assessing, in a first stage, whether spectral stationarity is present in the signal segment;
assessing, in a second stage, whether temporal stationarity is present in the signal segment; and
making a decision on the presence of speech activity in the signal segment based on outputs of the first and second stages.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to a method for determining voice activity in a signal section of an audio signal. The result, i.e., whether voice activity is present in the section of the signal thus observed, depends upon spectral and temporal stationarity of the signal section and/or prior signal sections. In a first step, the method determines whether there is spectral stationarity in the observed signal section. In a second step, the method determines whether there is temporal stationarity in the signal section in question. The final decision as to the presence of voice activity in the signal section observed depends upon the initial values of both steps.
21 Citations
21 Claims
-
1. A method for determining speech activity in a signal segment of an audio signal, the method comprising:
-
assessing, in a first stage, whether spectral stationarity is present in the signal segment; assessing, in a second stage, whether temporal stationarity is present in the signal segment; and making a decision on the presence of speech activity in the signal segment based on outputs of the first and second stages. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for determining speech activity in a signal segment of an audio signal, the method comprising:
-
comparing a first evaluation of the signal segment with a first threshold value to determine whether spectral stationary is present in the signal segment; comparing a second evaluation of the signal segment with a second threshold value to calculate whether temporal stationarity is present in the signal segment; and determining a presence of speech activity in the signal segment based on a comparison of the first and second evaluations of the signal segment.
-
-
20. A method for determining speech activity in a single segment of an audio signal, the method comprising:
-
dividing the signal segment into a series of frames; calculating a spectral distance between a current frame of the signal segment and a preceding frame of the signal segment; calculating a mean value of a voicedness of the signal segment; comparing the spectral distance and mean value of voicedness to respective threshold values to determine if the signal segment has spectral stationarity; determining if the signal segment has temporal stationarity based on an energy calculation of the frames; and deciding if the signal segment contains speech activity based on the presence of spectral stationarity and temporal stationarity. - View Dependent Claims (21)
-
Specification