Voiced/unvoiced estimation of an acoustic signal
First Claim
1. A method for encoding an acoustic signal, the method comprising the steps of:
- A. breaking the signal into segments, each of the segments representing one of a succession of time intervals;
B. breaking each of said segments into a plurality of frequency bands; and
C. considering in turn each of the segments as the current segment, and for each of a plurality of said frequency bands of the current segment making a voiced/unvoiced decision by a method comprising the steps of;
evaluating a voicing measure for said frequency band;
making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold;
determining an energy measure of the current segment;
determining a measure of the signal energy of one or more recent prior segments;
comparing the energy measure of the current segment to the measure of the signal energy of the one or more recent prior segments; and
adjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the one or more recent prior segments.
0 Assignments
0 Petitions
Accused Products
Abstract
The pitch estimation method is improved. Sub-integer resolution pitch values are estimated in making the initial pitch estimate; the sub-integer pitch values are preferably estimated by interpolating intermediate variables between integer values. Pitch regions are used to reduce the amount of computation required in making the initial pitch estimate. Pitch-dependent resolution is used in making the initial pitch estimate, with higher resolution being used for smaller values of pitch. The accuracy of the voiced/unvoiced decision is improved by making the decision dependent on the energy of the current segment relative to the energy of recent prior segments; if the relative energy is low, the current segment favors an unvoiced decision; if high, it favors a voiced decision. Voiced harmonics are generated using a hybrid approach; some voiced harmonics are generated in the time domain, whereas the remaining harmonics are generated in the frequency domain; this preserves much of the computational savings of the frequency domain approach, while at the same time improving speech quality. Voiced harmonics generated in the frequency domain are generated with higher frequency accuracy; the harmonics are frequency scaled, transformed into the time domain with a Discrete Fourier Transform, interpolated and then time scaled.
-
Citations
10 Claims
-
1. A method for encoding an acoustic signal, the method comprising the steps of:
-
A. breaking the signal into segments, each of the segments representing one of a succession of time intervals; B. breaking each of said segments into a plurality of frequency bands; and C. considering in turn each of the segments as the current segment, and for each of a plurality of said frequency bands of the current segment making a voiced/unvoiced decision by a method comprising the steps of; evaluating a voicing measure for said frequency band; making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold; determining an energy measure of the current segment; determining a measure of the signal energy of one or more recent prior segments; comparing the energy measure of the current segment to the measure of the signal energy of the one or more recent prior segments; and adjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the one or more recent prior segments. - View Dependent Claims (4, 5, 6)
-
-
2. A method for encoding an acoustic signal, the method comprising the steps of:
-
A. breaking the signal into segments, each of the segments representing one of a succession of time intervals; B. breaking each of said segments into a plurality of frequency bands; and C. considering in turn each of the segments as the current segment, and for each of a plurality of said frequency bands of the current segment making a voiced/unvoiced decision by a method comprising the steps of; evaluating a voicing measure for said frequency band; making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold; determining an energy measure of the current segment; determining a measure of the signal energy of one or more recent prior segments; comparing the energy measure of the current segment to the measure of the signal energy of the one or more recent prior segments; and adjusting the threshold to make an unvoiced decision more likely when the energy measure of the current segment is less than the measure of the signal energy of the one or more recent prior segments. - View Dependent Claims (3)
-
-
7. A method for encoding an acoustic signal, the method comprising the steps of:
-
A. breaking the signal into segments, each of the segments representing one of a succession of time intervals; B. considering in turn each of the segments as the current segment, and making a voiced/unvoiced decision for at least a frequency band of the current segment by a method comprising the steps of; evaluating a voicing measure for said frequency band; making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold; determining an energy measure of the current segment; determining a measure of the signal energy of one or more consecutive preceding segments; comparing the energy measure of the current segment to the measure of the signal energy of the consecutive preceding segments; adjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the consecutive preceding segments. - View Dependent Claims (10)
-
-
8. A method for encoding an acoustic signal, the method comprising the steps of:
-
A. breaking the signal into segments, each of the segments representing one of a succession of time intervals; B. considering in turn each of the segments as the current segment, and making a voiced/unvoiced decision for at least a frequency band of the current segment by a method comprising the steps of; evaluating a voicing measure for said frequency band; making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold; determining an energy measure of the current segment; determining a measure of the signal energy of one or more consecutive preceding segments; comparing the energy measure of the current segment to the measure of the signal energy of the consecutive preceding segments; adjusting the threshold to make a voiced decision less likely when the energy measure of the current segment is less than the measure of the signal energy of the consecutive preceding segments. - View Dependent Claims (9)
-
Specification