Speech characteristic extraction method speech charateristic extraction device speech recognition method and speech recognition device
First Claim
1. A speech characteristic extraction method that extracts a speech characteristic required for speech recognition, wherein an autocorrelation function of a speech signal is determined, and a value Φ
- (0) of when a delay time of the autocorrelation function is 0, a delay time τ
1 and an amplitude φ
1 of a first peak of the autocorrelation function, and an effective duration time τ
e of the autocorrelation function are extracted from the autocorrelation function.
1 Assignment
0 Petitions
Accused Products
Abstract
Speech characteristics are obtained using a minimum of parameters, which correspond to auditory perception characteristics, without carrying out spectral analysis, by determining an ACF (autocorrelation function) of a speech signal collected by a microphone, and deriving from the ACF a value Φ (0) of when a delay time of the ACF is 0, a delay time τ1 and an amplitude φ1 of a first peak of the ACF, and an effective duration time τe of the ACF. Furthermore, it is possible to achieve highly accurate recognition that reflects human perception in actual sound fields by determining an interaural crosscorrelation function (IACF) of the speech signal, and extracting from the IACF a maximum value IACC of the IACF, a delay time τIACC of a peak of the IACF, and a width WIACC of the maximum amplitude of the IACF, and including these IACF factors, that is, spatial information of the sound field.
-
Citations
12 Claims
-
1. A speech characteristic extraction method that extracts a speech characteristic required for speech recognition, wherein an autocorrelation function of a speech signal is determined, and a value Φ
- (0) of when a delay time of the autocorrelation function is 0, a delay time τ
1 and an amplitude φ
1 of a first peak of the autocorrelation function, and an effective duration time τ
e of the autocorrelation function are extracted from the autocorrelation function. - View Dependent Claims (3, 4, 6)
- (0) of when a delay time of the autocorrelation function is 0, a delay time τ
-
2. A speech characteristic extraction device that extracts a speech characteristic required for speech recognition, comprising:
-
a microphone;
a computing means for determining an autocorrelation function of a speech signal collected by the microphone; and
an extraction means for extracting from the autocorrelation function a value Φ
(0) of when a delay time of the autocorrelation function is 0, a delay time τ
1 and an amplitude φ
1 of a first peak of the autocorrelation function, and an effective duration time τ
e of the autocorrelation function. - View Dependent Claims (5)
-
-
7. A speech characteristic extraction method that extracts a speech characteristic required for speech recognition, wherein:
an autocorrelation function and an interaural crosscorrelation function of a binaurally measured speech signal are respectively determined, and a delay time τ
1 and an amplitude φ
1 of a first peak of the autocorrelation function, an effective duration time τ
e of the autocorrelation function, a maximum value IACC of the interaural crosscorrelation function, a delay time τ
IACC of a peak of the interaural crosscorrelation function, a width WIACC of a maximum amplitude of the interaural crosscorrelation function of the speech signal, and a value Φ
(0) of when a delay time of the autocorrelation function or the interaural crosscorrelation function is 0, are extracted from the autocorrelation function and the interaural crosscorrelation function.- View Dependent Claims (9, 10, 12)
-
8. A speech characteristic extraction device that extracts a speech characteristic required for speech recognition, comprising:
-
a binaural microphone;
a computing means for respectively determining an autocorrelation function and an interaural crosscorrelation function of a speech signal collected by the microphone; and
an extraction means for extracting from the autocorrelation function and the interaural crosscorrelation function a delay time τ
1 and an amplitude φ
1 of a first peak of the autocorrelation function, an effective duration time τ
e of the autocorrelation function, a maximum value IACC of the interaural crosscorrelation function, a delay time τ
IACC of a peak of the interaural crosscorrelation function, a width WIACC of a maximum amplitude of the interaural crosscorrelation function, and a value Φ
(0) of when a delay time of the autocorrelation function, or the interaural crosscorrelation function, is 0, are extracted. - View Dependent Claims (11)
-
Specification