Method and apparatus for judging voiced and unvoiced conditions of speech signal
First Claim
1. A method of judging voiced and unvoiced conditions of a speech signal, comprising the steps of determining a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function at a delay time τ
s of a sampling period, and combining said ratio with a parameter extracted from the speech signal by correlation technique and representing the degree of the periodicity of the speech signal thereby judging that the speech signal is in a voiced condition or an unvoiced condition.
1 Assignment
0 Petitions
Accused Products
Abstract
The voiced and unvoiced conditions of a speech signal are judged by combining a ratio (defined as the parcor coefficient k1) φ(τs)/φ(o) between the value φ(o) of the autocorrelation function of the speech signal at a zero delay time, and the value φ(τs) of the autocorrelation function at a delay time τs of the sampling period with a parameter extracted from the speech signal by a correlation technique and representing the degree of periodicity (Pm) of the speech signal. By comparing the result of the combination against a predetermined threshold it can be determined whether the speech signal is in a voiced condition or in an unvoiced condition.
50 Citations
24 Claims
-
1. A method of judging voiced and unvoiced conditions of a speech signal, comprising the steps of determining a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function at a delay time τ
s of a sampling period, and combining said ratio with a parameter extracted from the speech signal by correlation technique and representing the degree of the periodicity of the speech signal thereby judging that the speech signal is in a voiced condition or an unvoiced condition. - View Dependent Claims (2, 3, 4, 24)
- (τ
-
5. A method of judging voiced and unvoiced conditions of a speech signal comprising the steps of determining a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time and the value φ
(τ
s) of the autocorrelation function at a delay time τ
s of a sampling period, multiplying said ratio with a constant a to obtain a product, adding said product to the normalized value φ
(T)/φ
(o) of the autocorrelation function at a delay time T corresponding to the pitch period of the speech signal to obtain a sum, and comparing said sum with a predetermined theshold value thereby judging that the speech signal is in an unvoiced condition when said sum is smaller than said threshold value and that the speech signal is in a voiced condition in the other case.
- (τ
-
6. A method of judging voiced and unvoiced conditions of a speech signal, comprising the steps of determining a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time of a speech signal, and the value φ
(τ
s) of the autocorrelation coefficient at a delay time τ
s of a sampling period, multiplying said ratio with the normalized value of the autocorrelation function at a delay time T corresponding to the pitch period of the speech signal to obtain a product, and comparing the product with a predetermined threshold value thereby judging that the speech signal is in an unvoiced condition when said product is smaller than said threshold value and that the speech signal is in a voiced condition in the other case.
- (τ
-
7. A method of judging voiced and unvoiced condition of a speech signal, comprising the steps of determining a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech waveform at a zero delay time, and the value φ
(τ
s) of the autocorrelation function at a delay time τ
s of a sampling period, multiplying said ratio with a constant b to obtain a product, adding said product to the normalized value W(T)/W(o) of the autocorrelation function at a delay time T corresponding to the pitch period of the residual signal obtainable by a linear predictive analysis of the speech signal to obtain a sum, and comparing said sum with a predetermined threshold value thereby judging that the speech signal is in an unvoiced condition and that the speech signal is in a voiced condition in the other case.
- (τ
-
8. A method of judging voiced and unvoiced conditions of a speech signal comprising the steps of determining a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function of a sampling period, at a delay time τ
s, multiplying said ratio with the normalized value W(T)/W(o) at a delay time T corresponding to the pitch period of the autocorrelation function of the residual signal obtainable by the linear predictive analysis of the speech signal to obtain a product, and comparing said product with a predetermined threshold value thereby judging that the speech signal is in an unvoiced condition when said product is smaller than said threshold value and that the speech signal is in a voiced condition in the other case.
- (τ
-
9. A method of judging voiced and unvoiced conditions of a speech signal, comprising the steps of determining a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function of a sampling period at a delay time τ
s, multiplying said ratio with a constant a to obtain a product, subtracting the value DT at a delay time T corresponding to the pitch period of the average magnitude difference function of the residual signal obtainable by the linear predictive analysis of the speech signal thus obtaining a difference, and comparing said difference with a predetermined threshold value thereby judging that the speech signal is in an unvoiced condition when said difference is larger than said threshold value and that the speech signal is in a voiced condition in the other case.
- (τ
-
10. Apparatus for judging voiced and unvoiced conditions of a speech signal, comprising means for deriving a signal representative of a ratio k1 = φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of the speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function of the speech signal at a delay time τ
s of a sampling period, means for deriving a signal representative of a parameter ρ
m extracted from the speech signal by correlation technique and representing the degree of the periodicity of the speech signal, means for combining said k1 ratio signal with said ρ
m signal to derive a resultant signal and means for comparing the resultant signal to a threshold signal t determined by the maximum value of the autocorrelation coefficient of the parameter ρ
m when the ratio k1 is equal to zero to judge whether the speech signal is in a voiced condition or an unvoiced conditon.
- (τ
-
11. Apparatus for judging voiced and unvoiced conditions of a speech signal comprising means for deriving a signal representative of a ratio k1 = φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of the speech signal at a zero delay time and the value φ
(τ
s) of the autocorrelation function of the speech signal at a delay time τ
s of a sampling period, means for multiplying said k1 signal with a constant a to obtain a product, means for adding said product to a signal ρ
m representative of the normalized value φ
(T)/φ
(o) of the autocorrelation function of the speech signal at a delay time T corresponding to the pitch period of the speech signal to obtain a sum signal, and means for comparing said sum signal with a predetermined threshold signal t determined by the maximum value of the autocorrelation coefficient of the speech signal when the ratio k1 is equal to zero to thereby judge whether the speech signal is in an unvoiced condition if said sum is smaller than said threshold value and that the speech signal is in a voiced condition if the said sum is larger than said threshold value.
- (τ
-
12. Apparatus for judging voiced and unvoiced conditions of a speech signal, comprising means for deriving a signal representative of a ratio k1 = φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of the speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation coefficient of the speech signal at a delay time τ
s of a sampling period, means for multiplying said k1 signal with a signal representative of a normalized value W(T)/W(o) of the autocorrelation function at a delay time T corresponding to the pitch period of the residual signal to obtain a product signal, and means for comparing the product signal with a predetermined threshold signal t determined by the maximum value of the autocorrelation coefficient of the residual signal when the ratio k1 is equal to zero to thereby judge whether the speech signal is in an unvoiced condition if said product signal is smaller than said threshold signal and that the speech signal is in a voiced condition if the product signal is larger than said threshold signal.
- (τ
-
13. Apparatus for judging voiced and unvoiced condition of a speech signal, comprising means for deriving a signal k1 representative of a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function of the speech signal at a delay time τ
s of a sampling period, means for multiplying said ratio signal k1 with a constant b to obtain a product signal, adding said product signal with a signal representative of the normalized value W(T)/W(o) of the autocorrelation function at a delay time T corresponding to the pitch period of a residual signal obtained by a linear predictive analysis of the speech signal to thereby obtain a sum signal, and means for comparing said sum signal with a predetermined threshold value t determined by the maximum value of the autocorrelation coefficient of the residual signal when the ratio value k1 is equal to zero to thereby judge whether the speech signal is in an unvoiced condition or the speech signal is in a voiced condition.
- (τ
-
14. Apparatus for judging voiced and unvoiced conditions of a speech signal comprising means for deriving a signal k representative of a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function of the speech signal at a delay time s of a sampling period, means for multiplying said k1 signal with a signal representative of the normalized value W(T)/W(o) of the autocorrelation function at a delay time T corresponding to the pitch period of a residual signal obtainable by linear predictive analysis of the speech signal to thereby obtain a product signal, means for comparing said product value with a predetermined threshold value t determined by the maximum value of the autocorrelation coefficient of the speech signal under conditions where the ratio value k1 equals zero to thereby judge whether the speech signal in in an unvoiced condition if said product value is smaller than said threshold value and that the speech signal is in a voiced condition if the product signal is larger than said threshold signal.
- (τ
-
15. Apparatus for judging voiced and unvoiced conditions of a speech signal, comprising means for deriving a signal k representative of a ratio φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of a speech signal at a zero delay time, and the value φ
(τ
s) of the autocorrelation function of the speech signal at a delay time τ
s of a sampling period, means for multiplying said signal k1 with a constant a to obtain a product signal, means for subtracting said product signal from a signal representative of a parameter extracted from the speech signal by correlation technique and representing the degree of periodicity of the speech signal to derive a difference signal D (τ
) representative of the average magnitude difference function of a residual signal obtained by the linear predictive analysis of the speech signal, and means for comparing said difference signal with a predetermined threshold value t determined by the maximum value of the autocorrelation coefficient of the speech signal when the ratio k1 is equal to zero to judge whether the speech signal is in an unvoiced condition if said difference signal is larger than said threshold value and that the speech signal is in a voiced condition if said difference signal is smaller than said threshold value.
- (τ
-
16. Apparatus for judging voiced and unvoiced conditions of a speech signal comprising partial correlation coefficient analyzer means responsive to an input speech signal to be judged for deriving a ratio signal k1 = φ
- (τ
s)/φ
(o) between the value φ
(o) of the autocorrelation function of the speech signal at zero dealy time and the value φ
(τ
s) of the autocorrelation function at the speech signal at a delay time τ
s of the sampling period, pitch period detector means responsive to the autocorrelation function signal values supplied from said partial correlation coefficient analyzer means for extracting by correlation technique a normalized autocorrelation function value signal ρ
m representing the degree of periodicity of the speech signal, and voiced/unvoiced detector means responsive to the ratio signal k1 and the normalized correlation function value signal ρ
m for combining said k1 and ρ
m signals and comparing the resultant signal to a threshold signal t determined by the maximum value of the autocorrelation coefficient values of the residual or the speech signals when the ratio signal k1 = o to thereby judge whether the speech signal is in a voiced or unvoiced condition. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23)
- (τ
Specification