Pitch detection method and apparatus uses voiced/unvoiced decision in a frame other than the current frame of a speech signal
First Claim
1. A pitch detection method in an encoding method in which an input speech signal is divided on a time axis in terms of a pre-set frame and in which the frame-based speech signal is judged as to voiced/unvoiced, comprising:
- a pitch searching step of detecting a pitch information under a pre-set pitch detection condition; and
a pitch determining step of determining a pitch of the current frame of the input speech signal based on the results of voiced/unvoiced decisions of the frames of the inputted speech signal other than the current frame on the time axis.
1 Assignment
0 Petitions
Accused Products
Abstract
For realizing high-precision pitch detection even for speech signals in which half-pitch or double-pitch exhibits stronger autocorrelation than the pitch to be detected, an input speech signal is judged as to voicedness or unvoicedness and a voiced portion and an unvoiced portion of the input speech signal are encoded by a sinusoidal analytic encoding unit 114 and by a code excitation encoding unit 120, respectively, for producing respective encoded outputs. The sinusoidal analytic encoding unit 114 performs pitch search on the encoded outputs for finding the pitch information from the input speech signal and sets the high-reliability pitch information based on the detected pitch information. The results of pitch detection are determined using the high-reliability pitch information and the results of decision voicedness/unvoicedness of the frames other than the current frame.
34 Citations
7 Claims
-
1. A pitch detection method in an encoding method in which an input speech signal is divided on a time axis in terms of a pre-set frame and in which the frame-based speech signal is judged as to voiced/unvoiced, comprising:
-
a pitch searching step of detecting a pitch information under a pre-set pitch detection condition; and a pitch determining step of determining a pitch of the current frame of the input speech signal based on the results of voiced/unvoiced decisions of the frames of the inputted speech signal other than the current frame on the time axis. - View Dependent Claims (2, 3)
-
-
4. A speech signal encoding method in which an input speech signal is divided in terms of frame on a time axis and encoded on the frame basis, comprising:
-
a step of detecting a pitch of the input speech signal; a predictive encoding step for finding short-term prediction residuals of the input speech signal; a sinusoidal analysis encoding step for performing sinusoidal analysis encoding on the short-term prediction residuals found in the predictive encoding step; a waveform encoding step for waveform encoding the input speech signal; and a decision step for judging voiced/unvoiced of the input speech signal on the frame basis, wherein the pitch of the input speech signal of the current frame is determined also using the results of the voiced/unvoiced decision of the inputted speech signal of the frames other than the current frame on the time axis. - View Dependent Claims (5)
-
-
6. A speech signal encoding apparatus in which an input speech signal is divided in terms of frames on a time axis and encoded on the frame basis, comprising:
-
means for detecting a pitch of the input speech signal; predictive encoding means for finding short-term prediction residuals of the input speech signal; sinusoidal analysis encoding means for performing sinusoidal analysis encoding on the short-term prediction residuals found by said predictive encoding means; waveform encoding means for waveform encoding the input speech signal; and decision means for judging voiced/unvoiced of the input speech signal on the frame basis, wherein a pitch of the input speech signal of the current frame is determined using the results of the voiced/unvoiced decision of the inputted speech signal of the frames other than the current frame on the time axis. - View Dependent Claims (7)
-
Specification