Unvoiced/voiced decision for speech processing
First Claim
1. A method for speech processing, the method comprising:
- determining by a processor, a first unvoicing parameter for a first subframe of a speech signal, wherein the first unvoicing parameter is determined using a product of (1−
Pvoicing) and (1−
Ptilt), wherein Pvoicing is a periodicity parameter and Ptilt is a spectral tilt parameter;
determining by a processor a smoothed first unvoicing parameter for the first subframe according to a smoothed second unvoicing parameter for a second subframe prior to the first subframe of the speech signal;
computing a difference between the first unvoicing parameter for the first subframe and the smoothed first unvoicing parameter for the first subframe;
determining a classification of the first subframe using the computed difference as a decision parameter, the classification indicating whether the first subframe is an unvoiced speech signal or not an unvoiced speech signal; and
performing bandwidth extension on the speech signal for the first subframe, wherein a parameter for performing the bandwidth extension when the classification indicates the first subframe is an unvoiced speech signal is different from a parameter for performing the bandwidth extension when the classification indicates the first subframe is not an unvoiced speech signal.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for speech processing includes determining a first unvoicing parameter for a first subframe of a speech signal, and determining a smoothed unvoicing parameter for the first subframe according to a second unvoicing parameter of a second subframe prior to the first subframe of the speech signal. The first unvoicing parameter is determined according to a periodicity parameter and a spectral tilt parameter. The method further includes computing a difference between the first unvoicing parameter for the first subframe and the smoothed unvoicing parameter for the first subframe and determining a classification of the first subframe using the computed difference as a decision parameter. The classification indicates whether the first subframe is an unvoiced speech signal or not an unvoiced speech signal. Bandwidth extension is performed on the speech signal for the first subframe according to the classification of the first subframe.
-
Citations
18 Claims
-
1. A method for speech processing, the method comprising:
-
determining by a processor, a first unvoicing parameter for a first subframe of a speech signal, wherein the first unvoicing parameter is determined using a product of (1−
Pvoicing) and (1−
Ptilt), wherein Pvoicing is a periodicity parameter and Ptilt is a spectral tilt parameter;determining by a processor a smoothed first unvoicing parameter for the first subframe according to a smoothed second unvoicing parameter for a second subframe prior to the first subframe of the speech signal; computing a difference between the first unvoicing parameter for the first subframe and the smoothed first unvoicing parameter for the first subframe; determining a classification of the first subframe using the computed difference as a decision parameter, the classification indicating whether the first subframe is an unvoiced speech signal or not an unvoiced speech signal; and performing bandwidth extension on the speech signal for the first subframe, wherein a parameter for performing the bandwidth extension when the classification indicates the first subframe is an unvoiced speech signal is different from a parameter for performing the bandwidth extension when the classification indicates the first subframe is not an unvoiced speech signal. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An audio access device comprising a network interface and a CODEC with a decoder, wherein the decoder receives an encoded audio signal via the network interface, and is configured to:
-
determine a first unvoicing parameter for a first subframe of a speech signal, wherein the first unvoicing parameter is determined using a product of (1−
Pvoicing) and (1−
Ptilt), wherein Pvoicing is a periodicity parameter and Ptilt is a spectral tilt parameter;determine a smoothed first unvoicing parameter for the first subframe according to a smoothed second unvoicing parameter for a second subframe prior to the first subframe of the speech signal; compute a difference between the first unvoicing parameter for the first subframe and the smoothed first unvoicing parameter for the first subframe; determine a classification of the first subframe using the computed difference as a decision parameter, the classification indicates whether the first subframe is an unvoiced speech signal or not an unvoiced speech signal; and perform bandwidth extension on the speech signal, wherein a parameter for performing the bandwidth extension when the classification indicates the first subframe is an unvoiced speech signal is different from a parameter for performing the bandwidth extension when the classification indicates the first subframe is not an unvoiced speech signal. - View Dependent Claims (9, 10)
-
-
11. A speech processing apparatus comprising:
-
a processor; and a non-transitory computer-readable storage medium storing computer instructions, that when executed by the processor, cause the processor to; determine a first unvoicing parameter for a first subframe of a speech signal, wherein the first unvoicing parameter is determined using a product of (1−
Pvoicing) and (1−
Ptilt), wherein Pvoicing is a periodicity parameter and Ptilt is a spectral tilt parameter;determine a smoothed first unvoicing parameter for the first subframe according to a smoothed second unvoicing parameter for a second subframe prior to the first subframe of the speech signal; compute a difference between the first unvoicing parameter for the first subframe and the smoothed first unvoicing parameter for the first subframe; determine a classification of the first subframe using the computed difference as a decision parameter, the classification indicates whether the first subframe is an unvoiced speech signal or not an unvoiced speech signal; and perform bandwidth extension on the speech signal for the first subframe, wherein a parameter for performing the bandwidth extension when the classification indicates the first subframe is an unvoiced speech signal is different from a parameter for performing the bandwidth extension when the classification indicates the first subframe is not an unvoiced speech signal. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A non-transitory computer readable storage medium storing instructions which, when executed by a processor, cause the processor to perform the steps of:
-
determining a first unvoicing parameter for a first subframe of a speech signal, wherein the first unvoicing parameter is determined using a product of (1−
Pvoicing) and (1−
Ptilt), according wherein Pvoicing is a periodicity parameter and Ptilt is a spectral tilt parameter;determining a smoothed first unvoicing parameter for the first subframe according to a second smoothed unvoicing parameter for a second subframe prior to the first subframe of the speech signal; computing a difference between the first unvoicing parameter for the first subframe and the smoothed first unvoicing parameter for the first subframe; determining a classification of the first subframe using the computed difference as a decision parameter, the classification indicating whether the first subframe is an unvoiced speech signal or not an unvoiced speech signal; and performing bandwidth extension on the speech signal for the first subframe, wherein a parameter for performing the bandwidth extension when the classification indicates the first subframe is an unvoiced speech signal is different from a parameter for performing the bandwidth extension when the classification indicates the first subframe is not an unvoiced speech signal. - View Dependent Claims (18)
-
Specification