Noise-robust speech coding mode classification
First Claim
Patent Images
1. A method of noise-robust speech classification, comprising:
- inputting classification parameters to a speech classifier from external components;
generating, in the speech classifier, internal classification parameters from at least one of the input classification parameters;
setting a Normalized Auto-correlation Coefficient Function threshold, wherein setting the Normalized Auto-correlation Coefficient Function threshold comprises;
increasing a first voicing threshold for classifying a current frame as unvoiced when a signal-to-noise ratio (SNR) fails to exceed a first SNR threshold, wherein the first voicing threshold is not adjusted if the SNR is above the first SNR threshold, andincreasing an energy threshold for classifying the current frame as unvoiced when the noise estimate exceeds a noise estimate threshold, wherein the energy threshold is not adjusted if the noise estimate is below the noise estimate threshold; and
determining a speech mode classification based on a the first voicing threshold and the energy threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of noise-robust speech classification is disclosed. Classification parameters are input to a speech classifier from external components. Internal classification parameters are generated in the speech classifier from at least one of the input parameters. A Normalized Auto-correlation Coefficient Function threshold is set. A parameter analyzer is selected according to a signal environment. A speech mode classification is determined based on a noise estimate of multiple frames of input speech.
25 Citations
43 Claims
-
1. A method of noise-robust speech classification, comprising:
-
inputting classification parameters to a speech classifier from external components; generating, in the speech classifier, internal classification parameters from at least one of the input classification parameters; setting a Normalized Auto-correlation Coefficient Function threshold, wherein setting the Normalized Auto-correlation Coefficient Function threshold comprises; increasing a first voicing threshold for classifying a current frame as unvoiced when a signal-to-noise ratio (SNR) fails to exceed a first SNR threshold, wherein the first voicing threshold is not adjusted if the SNR is above the first SNR threshold, and increasing an energy threshold for classifying the current frame as unvoiced when the noise estimate exceeds a noise estimate threshold, wherein the energy threshold is not adjusted if the noise estimate is below the noise estimate threshold; and determining a speech mode classification based on a the first voicing threshold and the energy threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. An apparatus for noise-robust speech classification, comprising:
-
a processor; memory in electronic communication with the processor; instructions stored in the memory, the instructions being executable by the processor to; input classification parameters to a speech classifier from external components; generate, in the speech classifier, internal classification parameters from at least one of the input classification parameters; set a Normalized Auto-correlation Coefficient Function threshold, wherein the instructions executable to set the Normalized Auto-correlation Coefficient Function threshold further comprise instructions executable to; increase a first voicing threshold for classifying a current frame as unvoiced when a signal-to-noise ratio (SNR) fails to exceed a first SNR threshold, wherein the first voicing threshold is not adjusted if the SNR is above the first SNR threshold, and increase an energy threshold for classifying the current frame as unvoiced when the noise estimate exceeds a noise estimate threshold, wherein the energy threshold is not adjusted if the noise estimate is below the noise estimate threshold; and determine a speech mode classification based on the first voicing threshold and the energy threshold. - View Dependent Claims (34, 35, 36, 37, 38, 39)
-
-
40. An apparatus for noise-robust speech classification, comprising:
-
means for inputting classification parameters to a speech classifier from external components; means for generating, in the speech classifier, internal classification parameters from at least one of the input classification parameters; means for setting a Normalized Auto-correlation Coefficient Function threshold, wherein the means for setting the Normalized Auto-correlation Coefficient Function threshold comprise; means for increasing a first voicing threshold for classifying a current frame as unvoiced when a signal-to-noise ratio (SNR) fails to exceed a first SNR threshold, wherein the first voicing threshold is not adjusted if the SNR is above the first SNR threshold, and means for increasing an energy threshold for classifying the current frame as unvoiced when the noise estimate exceeds a noise estimate threshold, wherein the energy threshold is not adjusted if the noise estimate is below the noise estimate threshold; and means for determining a speech mode classification based on the first voicing threshold and the energy threshold. - View Dependent Claims (41)
-
-
42. A computer-program product for noise-robust speech classification, the computer-program product comprising a non-transitory computer-readable medium having instructions thereon, the instructions, comprising:
-
code for inputting classification parameters to a speech classifier from external components; code for generating, in the speech classifier, internal classification parameters from at least one of the input classification parameters; code for setting a Normalized Auto-correlation Coefficient Function threshold, wherein the code for setting the Normalized Auto-correlation Coefficient Function threshold comprises; code for increasing a first voicing threshold for classifying a current frame as unvoiced when the noise estimate exceeds a noise estimate threshold a signal-to-noise ratio (SNR) fails to exceed a first SNR threshold, wherein the first voicing threshold is not adjusted if the SNR is above the first SNR threshold; and code for increasing an energy threshold for classifying the current frame as unvoiced when the noise estimate exceeds a noise estimate threshold, wherein the voicing threshold and the energy threshold is not adjusted if the noise estimate is below the noise estimate threshold; and code for determining a speech mode classification based on the first voicing threshold and the energy threshold. - View Dependent Claims (43)
-
Specification