SPEECH PROCESSING DEVICE, SPEECH PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT
First Claim
1. A speech processing device comprising:
- an input unit configured to input a speech signal;
a marking unit configured to assign a pitch mark representing a representative point in a fundamental period to the speech signal for each fundamental period;
an extractor configured to window a part of the speech signal and extract a partial waveform that is a speech waveform of the windowed part;
a calculator configured to perform frequency analysis of the partial waveform to calculate a frequency spectrum;
an estimator configured to generate an artificial waveform that is a waveform according to an interval between the pitch marks for each harmonic component having a frequency that is a predetermined multiple of a fundamental frequency of the speech signal and configured to estimate harmonic spectral features representing characteristics of the frequency spectrum of the harmonic component from each of the artificial waveforms; and
a separator configured to separate the partial waveform into a periodic component produced from periodic vocal-fold vibration as an acoustic source and an aperiodic component produced from aperiodic acoustic sources other than the vocal-fold vibration by using the respective harmonic spectral features and the frequency spectrum of the partial waveform.
1 Assignment
0 Petitions
Accused Products
Abstract
According to one embodiment, in a speech processing device, an extractor windows a part of the speech signal and extracts a partial waveform. A calculator performs frequency analysis of the partial waveform to calculate a frequency spectrum. An estimator generates an artificial waveform that is a waveform according to an interval between the pitch marks for each harmonic component having a frequency that is a predetermined multiple of a fundamental frequency of the speech signal and estimates harmonic spectral features representing characteristics of the frequency spectrum of the harmonic component from each of the artificial waveforms. A separator separates the partial waveform into a periodic component produced from periodic vocal-fold vibration as an acoustic source and an aperiodic component produced from aperiodic acoustic sources other than the vocal-fold vibration by using the respective harmonic spectral features and the frequency spectrum of the partial waveform.
20 Citations
12 Claims
-
1. A speech processing device comprising:
-
an input unit configured to input a speech signal; a marking unit configured to assign a pitch mark representing a representative point in a fundamental period to the speech signal for each fundamental period; an extractor configured to window a part of the speech signal and extract a partial waveform that is a speech waveform of the windowed part; a calculator configured to perform frequency analysis of the partial waveform to calculate a frequency spectrum; an estimator configured to generate an artificial waveform that is a waveform according to an interval between the pitch marks for each harmonic component having a frequency that is a predetermined multiple of a fundamental frequency of the speech signal and configured to estimate harmonic spectral features representing characteristics of the frequency spectrum of the harmonic component from each of the artificial waveforms; and a separator configured to separate the partial waveform into a periodic component produced from periodic vocal-fold vibration as an acoustic source and an aperiodic component produced from aperiodic acoustic sources other than the vocal-fold vibration by using the respective harmonic spectral features and the frequency spectrum of the partial waveform. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 10)
-
-
9. The speech processing device according to claim wherein
the analysis window used for windowing by the extractor is a Hanning window having a window width of 2 to 10 times a fundamental period.
-
11. A speech processing method comprising:
-
inputting a speech signal; assigning a pitch mark representing a representative point in a fundamental period to the speech signal for each fundamental period; windowing a part of the speech signal and extract a partial waveform that is a speech waveform of the windowed part; performing frequency analysis of the partial waveform to calculate a frequency spectrum; generating an artificial waveform that is a waveform according to an interval between the pitch marks for each harmonic component having a frequency that is a predetermined multiple of a fundamental frequency of the speech signal; estimating harmonic spectral features representing characteristics of the frequency spectrum of the harmonic component from each of the artificial waveforms; and separating the partial waveform into a periodic component produced from periodic vocal-fold vibration as an acoustic source and an aperiodic component produced from aperiodic acoustic sources other than the vocal-fold vibration by using the respective harmonic spectral features and the frequency spectrum of the partial waveform.
-
-
12. A computer program product comprising a computer-readable medium having programmed instructions, wherein the instructions, when executed by a computer, cause the computer to execute:
-
inputting a speech signal; assigning a pitch mark representing a representative point in a fundamental period to the speech signal for each fundamental period; windowing a part of the speech signal and extract a partial waveform that is a speech waveform of the windowed part; performing frequency analysis of the partial waveform to calculate a frequency spectrum; generating an artificial waveform that is a waveform according to an interval between the pitch marks for each harmonic component having a frequency that is a predetermined multiple of a fundamental frequency of the speech signal; estimating harmonic spectral features representing characteristics of the frequency spectrum of the harmonic component from each of the artificial waveforms; and separating the partial waveform into a periodic component produced from periodic vocal-fold vibration as an acoustic source and an aperiodic component produced from aperiodic acoustic sources other than the vocal-fold vibration by using the respective harmonic spectral features and the frequency spectrum of the partial waveform.
-
Specification