SYSTEM, METHOD AND PROGRAM FOR SPEECH PROCESSING
First Claim
1. A speech processing system for processing a speech signal using a computer, comprising:
- a first means for receiving a power spectrum of a speech signal and generating a log power spectrum of the power spectrum;
a second means for performing discrete cosine transformation on an output from the second means;
a third means for receiving an output from the second means to cut off cepstrum upper and lower terms of the output;
a fourth means for receiving an output from the third means to perform inverse discrete cosine transformation on the output;
a fifth means for converting an output from the fourth means so as to bring the output back to a power spectrum domain; and
a sixth means for filtering the power spectrum of the speech signal by using, as a filter, the output which is brought back to the power spectrum domain.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a system, method and program for speech recognition. In an embodiment of the invention a method for processing a speech signal consists of receiving a power spectrum of a speech signal and generating a log power spectrum signal of the power spectrum. The method further consists of performing discrete cosine transformation on the log power spectrum signal and cutting off cepstrum upper and lower terms of the discrete cosine transformed signal. The method further consists of performing inverse discrete cosine transformation on the signal from which the cepstrum upper and lower terms are cut off. The method further consists of converting the inverse discrete cosine transformed signal so as to bring the signal back to a power spectrum domain and filtering the power spectrum of the speech signal by using, as a filter, the signal which is brought back to the power spectrum domain.
-
Citations
13 Claims
-
1. A speech processing system for processing a speech signal using a computer, comprising:
-
a first means for receiving a power spectrum of a speech signal and generating a log power spectrum of the power spectrum; a second means for performing discrete cosine transformation on an output from the second means; a third means for receiving an output from the second means to cut off cepstrum upper and lower terms of the output; a fourth means for receiving an output from the third means to perform inverse discrete cosine transformation on the output; a fifth means for converting an output from the fourth means so as to bring the output back to a power spectrum domain; and a sixth means for filtering the power spectrum of the speech signal by using, as a filter, the output which is brought back to the power spectrum domain. - View Dependent Claims (2, 3)
-
-
4. A speech processing method for processing a speech signal using a computer, comprising:
-
receiving a power spectrum of a speech signal and generating a log power spectrum signal of the power spectrum; performing discrete cosine transformation on the log power spectrum signal; cutting off cepstrum upper and lower terms of the discrete cosine transformed signal; performing inverse discrete cosine transformation on the signal from which the cepstrum upper and lower terms are cut off; converting the inverse discrete cosine transformed signal so as to bring the signal back to a power spectrum domain; and filtering the power spectrum of the speech signal by using, as a filter, the signal which is brought back to the power spectrum domain. - View Dependent Claims (5, 6)
-
-
7. A computer program product comprising a computer useable medium including a computer readable program, wherein the speech processing program when executed on a computer causes the computer to perform the method steps for operating a computer to process a speech signal for speech recognition. The method comprising the steps of:
-
receiving a power spectrum of a speech signal and generating a log power spectrum signal of the power spectrum; performing discrete cosine transformation on the log power spectrum signal; cutting off cepstrum upper and lower terms of the discrete cosine transformed signal; performing inverse discrete cosine transformation on the signal from which the cepstrum upper and lower terms are cut off; converting the inverse discrete cosine transformed signal so as to bring the signal back to a power spectrum domain; and filtering the power spectrum of the speech signal by using, as a filter, the signal which is brought back to the power spectrum domain. - View Dependent Claims (8, 9)
-
-
10. A speech recognition system for performing speech recognition using a computer, comprising:
-
a first means for receiving a power spectrum of a speech signal and generating a log power spectrum of the power spectrum; a second means for performing discrete cosine transformation on an output of the log power spectrum; a third means for receiving an output from the second means to cut off cepstrum upper and lower terms of the output; a fourth means for receiving an output from the third means to perform inverse discrete cosine transformation on the output; a fifth means for converting an output from the fourth means so as to bring the output back to a power spectrum domain; and a sixth means for filtering the power spectrum of the speech signal by using, as a filter, the output which is brought back to the power spectrum domain, wherein speech recognition processing is performed by using the filtered power spectrum. - View Dependent Claims (11)
-
-
12. A speech output system for outputting a speech captured with a microphone using a computer, comprising:
-
a first means for performing A/D conversion on the speech captured with the microphone to output a digital speech signal; a second means for performing discrete Fourier transformation on the digital speech signal to output a power spectrum of the speech signal; a third means for receiving the power spectrum of the speech signal to generate a log power spectrum of the power spectrum; a fourth means for performing discrete cosine transformation on an output of the log power spectrum; a fifth means for receiving an output from the fourth means to cut off cepstrum upper and lower terms of the output; a sixth means for receiving an output from the fifth means to perform inverse discrete cosine transformation on the output; a seventh means for converting an output from the sixth means so as to bring the output back to a power spectrum domain; an eighth means for filtering the power spectrum of the speech signal by using, as a filter, the output which is brought back to the power spectrum domain; and a ninth means for performing D/A conversion on the filtered power spectrum to output an analog speech signal.
-
-
13. A speech processing system for processing a speech signal using a computer, comprising:
-
a first means for receiving a power spectrum of a speech signal and generating a log power spectrum of the power spectrum; a second means for performing discrete cosine transformation on an output from an output of the log power spectrum; a third means for receiving an output from the second means to cut off cepstrum upper and lower terms of the output; a fourth means for receiving an output from the third means to perform inverse discrete cosine transformation on the output; a fifth means for converting an output from the fourth means so as to bring the output back to a power spectrum domain; and a sixth means for filtering the power spectrum of the speech signal by using, as a filter, the output which is brought back to the power spectrum domain.
-
Specification