Unvoiced/voiced decision for speech processing
First Claim
1. A method for speech processing, the method comprising:
- determining, by a processor, an unvoicing parameter for a first frame of a speech signal, wherein the unvoicing parameter reflects a speech characteristic of the first frame;
determining, by a processor, a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter for the first frame and a smoothed unvoicing parameter for a second frame, when the smoothed unvoicing parameter for the second frame is greater than the unvoicing parameter for the first frame, the smoothed unvoicing parameter for the second frame is weighted less heavily than the case when the smoothed unvoicing parameter for the second frame is not greater than the unvoicing parameter for the first frame;
computing a difference, by the processor, between the unvoicing parameter for the first frame and the smoothed unvoicing parameter for the first frame;
determining a classification of the first frame according to the computed difference, wherein the classification indicates whether the first frame is an unvoiced speech signal or not;
processing the first frame by the processor in accordance with the classification of the first frame; and
outputting a synthesized speech signal according to the processing of the first frame.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for speech processing includes determining an unvoicing parameter for a first frame of a speech signal and determining a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter of the first frame and a smoothed unvoicing parameter of a second frame. The unvoicing parameter reflects a speech characteristic of the first frame. The smoothed unvoicing parameter of the second frame is weighted less heavily when the smoothed unvoicing parameter of the second frame is greater than the unvoicing parameter of the first frame. The method further includes computing a difference, by a processor, between the unvoicing parameter of the first frame and the smoothed unvoicing parameter of the first frame, and determining a classification of the first frame according to the computed difference. The classification includes unvoiced speech or voiced speech. The first frame is processed in accordance with the classification of the first frame.
-
Citations
21 Claims
-
1. A method for speech processing, the method comprising:
-
determining, by a processor, an unvoicing parameter for a first frame of a speech signal, wherein the unvoicing parameter reflects a speech characteristic of the first frame; determining, by a processor, a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter for the first frame and a smoothed unvoicing parameter for a second frame, when the smoothed unvoicing parameter for the second frame is greater than the unvoicing parameter for the first frame, the smoothed unvoicing parameter for the second frame is weighted less heavily than the case when the smoothed unvoicing parameter for the second frame is not greater than the unvoicing parameter for the first frame; computing a difference, by the processor, between the unvoicing parameter for the first frame and the smoothed unvoicing parameter for the first frame; determining a classification of the first frame according to the computed difference, wherein the classification indicates whether the first frame is an unvoiced speech signal or not; processing the first frame by the processor in accordance with the classification of the first frame; and outputting a synthesized speech signal according to the processing of the first frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A speech processing apparatus comprising:
-
a processor; and a non-transitory computer-readable storage medium storing computer instructions, that when executed by the processor, cause the processor to; determine an unvoicing parameter for a first frame of a speech signal, wherein the unvoicing parameter reflects a speech characteristic of the first frame; determine a smoothed unvoicing parameter for the first frame, wherein the smoothed unvoicing parameter is a weighted sum of the unvoicing parameter for the first frame and a smoothed unvoicing parameter for a second frame, and when the smoothed unvoicing parameter for the second frame is greater than the unvoicing parameter for the first frame, the smoothed unvoicing parameter for the second frame is weighted less heavily than the case when the smoothed unvoicing parameter for the second frame is not greater than the unvoicing parameter for the first frame; compute a difference between the unvoicing parameter for the first frame and the smoothed unvoicing parameter for the first frame; determine a classification of the first frame according to the computed difference, wherein the classification indicates whether the first frame is an unvoiced speech signal or not; process the first frame in accordance with the classification of the first frame; and output a synthesized speech signal according to the processing of the first frame. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
Specification