Method and apparatus for pitch determination of a low bit rate digital voice message
First Claim
1. A system controller comprises a pitch determiner that generates a smoothed pitch value for a current frame of a low bit rate voice message, the pitch determiner comprising:
- a pitch function generator that generates a pitch detection function (PDF) for each frame of digital samples of a voice signal;
a pitch candidate selector that selects a future frame pitch candidate from a pitch detection function (PDF); and
a subharmonic pitch corrector that determines a future frame pitch value by performing pitch subharmonic correction of the future frame pitch candidate using a roughness factor of the frequency transformed window.
1 Assignment
0 Petitions
Accused Products
Abstract
A pitch determiner (931) of a system controller (106) that generates a smoothed pitch value for a current frame of a low bit rate voice message includes a pitch function generator (955) that generates a pitch detection function (PDF) for each frame of digital samples of a voice signal, a pitch candidate selector (960) that selects a future frame pitch candidate from a pitch detection function (PDF), and a pitch adjuster (978) that generates the smoothed pitch value. The pitch adjuster includes a subharmonic pitch corrector (965) that determines a future frame pitch value by performing pitch subharmonic correction of a future frame pitch candidate using a roughness factor of the frequency transformed window.
44 Citations
17 Claims
-
1. A system controller comprises a pitch determiner that generates a smoothed pitch value for a current frame of a low bit rate voice message, the pitch determiner comprising:
-
a pitch function generator that generates a pitch detection function (PDF) for each frame of digital samples of a voice signal;
a pitch candidate selector that selects a future frame pitch candidate from a pitch detection function (PDF); and
a subharmonic pitch corrector that determines a future frame pitch value by performing pitch subharmonic correction of the future frame pitch candidate using a roughness factor of the frequency transformed window. - View Dependent Claims (2, 3)
a low frequency search function that identifies a smallest low frequency peak of the PDF using a fine tune function, and a high frequency search function that identifies a largest high frequency peak of the PDF using the fine tune function.
-
-
4. A system controller comprises a pitch function generator that generates a smoothed pitch value for a current frame of a low bit rate voice message, the pitch function generator comprising:
-
a band correlator that determines a plurality of band autocorrelations that correspond to a plurality of bands of a frequency transformed window of the digital samples, the window being related to a future frame of digital samples; and
a pitch function selector that generates the pitch detection function based on the plurality of band autocorrelations, wherein the pitch function selector is less than Q and el is greater than R, wherein Γ
n is a value of the pitch detection function,n is an index, rmax[1] is a maximum value of rn[1] for n from 1 to K, rn[1] is a first band autocorrelation value, K is a quantity of values in the frequency transformed window, Yn is a value in the frequency transformed window, el is a first band entropy value, and M, P, Q, and R are predetermined positive real values.
-
-
5. A system controller comprises a pitch candidate selector that selects a future frame pitch candidate from a pitch detection function (PDF), the pitch candidate selector comprising:
-
a fine tune function that determines a fine tune peak frequency of a relative peak of the PDF;
a low frequency search function that identifies a smallest low frequency peak of the PDF using the fine tune function;
a high frequency search function that identifies a largest high frequency peak of the PDF using the fine tune function; and
a rough pitch candidate selector that selects one of the smallest low frequency peak and the largest high frequency peak as a future frame rough pitch candidate. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A system controller comprises a pitch adjuster that generates a smoothed pitch value for a current frame of digital samples of a voice signal, the pitch adjuster comprising:
-
a subharmonic pitch corrector that determines a future frame pitch value by performing pitch subharmonic correction of a future frame pitch candidate using a roughness factor of the frequency transformed window; and
a pitch smoothing function that determines a smoothed pitch value as one of an integer multiple of a current frame pitch value, the current frame pitch value, and an integer sub-multiple of the current frame pitch value. - View Dependent Claims (11, 12, 13, 14, 15)
a roughness factor function that determines the roughness factor from the magnitudes of all harmonic peaks of a magnitude spectrum and magnitudes of all harmonic peaks of a logarithmic spectrum of the frequency transformed window.
-
-
12. The pitch adjuster according to claim 11, wherein the roughness factor function uses a difference between the value of every other harmonic peak in the logarithmic magnitude spectrum and an average of the values of the two peaks adjacent thereto.
-
13. The pitch adjuster according to claim 10, wherein the roughness factor, β
- , is given by
wherein Y is the frequency transformed window and ƒ
0c[1] is the future frame pitch candidate.
- , is given by
-
14. The pitch adjuster according to claim 10, wherein the subharmonic pitch corrector further comprises:
-
a high roughness decision function that doubles the future frame pitch candidate when the roughness factor β
exceeds a first predetermined value; and
a neural decision function that determines whether to double the future frame pitch candidate using a neural network.
-
-
15. The pitch adjuster according to claim 14, wherein the output of the neural network is based on inputs comprising at least one of the roughness factor, a ratio of the mid-term pitch value to the future frame pitch candidate, and a ratio of a maximum magnitude of the pitch detection function within a narrow frequency range around a frequency that is one third the future frame pitch candidate to the magnitude of the pitch detection function at the future frame pitch candidate.
-
16. A system controller comprises a pitch determiner that generates a smoothed pitch value for a current frame of a low bit rate voice message, the pitch determiner comprising:
-
a band autocorrelator that determines a plurality of band autocorrelations that correspond to a plurality of bands of a frequency transformed window of the digital samples, the frequency transformed window corresponding to a future frame of digital samples, comprising a vector filter that generates a reverse filtered spectrum by performing a magnitude transform, a logarithmic transform, and a reverse spectral filtering of the frequency transformed window, and a spectral autocorrelator that generates the band autocorrelations by applying a spectral autocorrelation function to each band of the reverse filtered spectrum;
a pitch function generator that determines a pitch detection function using the plurality of band autocorrelations;
a pitch candidate selector that selects a future frame pitch candidate from the pitch detection function; and
a pitch adjuster that generates a smoothed pitch value from the future frame pitch candidate and the pitch detection function, comprising a subharmonic pitch corrector that determines a corrected future frame pitch value by performing pitch subharmonic correction of the future frame pitch candidate using a roughness measure of the frequency transformed window, and a pitch smoother that determines a smoothed pitch value from the corrected future frame pitch value, the current frame pitch value, and a past frame pitch value.
-
-
17. A method used in a system controller that generates a smoothed pitch value for a current frame of a low bit rate voice message, the method comprising the steps of:
-
generating a pitch detection function (PDF) for each frame of digital samples of a voice signal, comprising the step of generating the pitch detection function based on a plurality of band autocorrelations;
selecting a future frame pitch candidate from a pitch detection function (PDF), comprising the steps of identifying a smallest low frequency peak of the PDF using a fine tune function, and identifying a largest high frequency peak of the PDF using the fine tune function; and
generating the smoothed pitch value, comprising a step of determining a future frame pitch value by performing pitch subharmonic correction of a future frame pitch candidate using a roughness factor of the frequency transformed window.
-
Specification