Method and apparatus for estimating harmonic information, spectral envelope information, and degree of voicing of speech signal
First Claim
1. A method of estimating harmonic information and spectral envelope information of a speech signal, the method comprising the steps of:
- converting a received speech signal of a time domain to a speech signal of a frequency domain;
calculating a coarse pitch value of the speech signal and determining a peak search range using the coarse pitch value;
setting a plurality of peak search ranges in the speech signal, detecting peaks existing in each of the peak search ranges, determining a peak having the greatest spectral value among the detected peaks as a harmonic peak in each of the peak search ranges, and outputting the harmonic peak of each of the peak search ranges as harmonic information of the speech signal; and
generating a harmonic spectral envelope by performing interpolation of the harmonic peaks, and outputting the generated harmonic spectral envelope as spectral envelope information of the speech signal.
1 Assignment
0 Petitions
Accused Products
Abstract
A degree of voicing is extracted using the characteristic of harmonic peaks existing in a constant period by converting an input speech or audio signal to a speech signal of the frequency domain, selecting the greatest peak in a first pitch period of the converted speech signal as a harmonic peak, thereafter selecting a peak having the greatest spectral value among peaks existing in each peak search range of the speech signal as a harmonic peak, extracting harmonic spectral envelope information by performing interpolation of the selected harmonic peaks, extracting non-harmonic spectral envelope information by performing interpolation of the non-harmonic peaks, and comparing the two pieces of envelope information to each other.
40 Citations
25 Claims
-
1. A method of estimating harmonic information and spectral envelope information of a speech signal, the method comprising the steps of:
-
converting a received speech signal of a time domain to a speech signal of a frequency domain;
calculating a coarse pitch value of the speech signal and determining a peak search range using the coarse pitch value;
setting a plurality of peak search ranges in the speech signal, detecting peaks existing in each of the peak search ranges, determining a peak having the greatest spectral value among the detected peaks as a harmonic peak in each of the peak search ranges, and outputting the harmonic peak of each of the peak search ranges as harmonic information of the speech signal; and
generating a harmonic spectral envelope by performing interpolation of the harmonic peaks, and outputting the generated harmonic spectral envelope as spectral envelope information of the speech signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of estimating harmonic information of a speech signal, the method comprising the steps of:
-
converting a received speech signal of a time domain to a speech signal of a frequency domain;
calculating a coarse pitch value of the speech signal and determining a peak search range using the coarse pitch value; and
setting a plurality of peak search ranges in the speech signal, detecting peaks existing in each of the peak search ranges, determining a peak having the greatest spectral value among the detected peaks as a harmonic peak in each of the peak search ranges, and outputting the harmonic peak of each of the peak search ranges as harmonic information of the speech signal.
-
-
12. A method of estimating a degree of voicing of a speech signal using spectral envelope information of the speech signal, the method comprising the steps of:
-
detecting harmonic spectral envelope information comprising harmonic peaks of the speech signal;
detecting non-harmonic spectral envelope information comprising peaks excluding the harmonic peaks among peaks of the speech signal; and
detecting a degree of voicing indicating a rate of a voiced sound included in the speech signal by comparing energy of the harmonic spectral envelope to energy of the non-harmonic spectral envelope. - View Dependent Claims (13)
-
-
14. An apparatus for estimating harmonic information and spectral envelope information of a speech signal, the apparatus comprising;
-
a frequency domain converter for converting a received speech signal of a time domain to a speech signal of a frequency domain;
a search range determiner for calculating a coarse pitch value of the speech signal output from the frequency domain converter and determining a peak search range using the coarse pitch value;
a harmonic peak detector for setting a plurality of peak search ranges in the speech signal, detecting peaks existing in each of the peak search ranges, determining a peak having the greatest spectral value among the detected peaks as a harmonic peak in each of the peak search ranges, and outputting the harmonic peak of each of the peak search ranges as harmonic information of the speech signal; and
a harmonic spectral envelope detector for generating a harmonic spectral envelope by performing interpolation of the harmonic peaks, and outputting the generated harmonic spectral envelope as spectral envelope information of the speech signal. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. An apparatus for estimating harmonic information of a speech signal, the apparatus comprising:
-
a frequency domain converter for converting a received speech signal of a time domain to a speech signal of a frequency domain;
a search range determiner for calculating a coarse pitch value of the speech signal output from the frequency domain converter and determining a peak search range using the coarse pitch value; and
a harmonic peak detector for setting a plurality of peak search ranges in the speech signal, detecting peaks existing in each of the peak search ranges, determining a peak having the greatest spectral value among the detected peaks as a harmonic peak in each of the peak search ranges, and outputting the harmonic peaks as harmonic information of the speech signal.
-
Specification