Speech signal compression device, speech signal compression method, and program
First Claim
1. A speech signal compression device comprising:
- division-according-to-phoneme means for acquiring a speech signal indicating a speech waveform to be compressed, and dividing the speech signal waveform for individual phonemes;
a filter for filtering the divided speech signal to extract a pitch signal;
phase adjustment means for separating the speech signal into sections based on the pitch signal extracted by the filter and adjusting, for each of the sections, phase based on correlation relation among the separated speech signal and the pitch signal;
sampling means for determining, for each of the sections for which the phase has been adjusted by the phase adjustment means, the sampling length based on the phase and generating a sampling signal by performing sampling in accordance with the sampling length;
speech signal processing means for processing the sampling signal to be a pitch waveform signal based on the result of the adjustments by the phase adjustment means and the value of the sampling length;
sub-band data generation means for generating sub-band data indicating change with time of spectral distribution of each of the phonemes based on the pitch waveform signal; and
compression-according-to-phoneme means for performing data compression of the sub-band data in accordance with a predetermined condition specified for a phoneme indicated by the sub-band data;
wherein the compression-according-to-phoneme means performs data compression of sub-band data by changing the sub-band data in such a manner as to delete a predetermined spectral component from the sub-band data.
5 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a speech signal compression device which allows a storage capacity of data representing speech to be efficiently compressed. In the present invention, a computer C1 operates with respect to speech data to be compressed into speech data for each phoneme on the basis of phoneme labeling data, to unify the time length of a unit pitch section for each of the divided speech data into the same value, thereby creating a pitch waveform and creating a sub-band data representing variation in time of spectrum components of the pitch waveform signal. Also, this sub-band data is compressed so as to match a condition designated by a table for compression, and the compressed data is further encoded in entropy to output the entropy coded data.
-
Citations
4 Claims
-
1. A speech signal compression device comprising:
-
division-according-to-phoneme means for acquiring a speech signal indicating a speech waveform to be compressed, and dividing the speech signal waveform for individual phonemes; a filter for filtering the divided speech signal to extract a pitch signal; phase adjustment means for separating the speech signal into sections based on the pitch signal extracted by the filter and adjusting, for each of the sections, phase based on correlation relation among the separated speech signal and the pitch signal; sampling means for determining, for each of the sections for which the phase has been adjusted by the phase adjustment means, the sampling length based on the phase and generating a sampling signal by performing sampling in accordance with the sampling length; speech signal processing means for processing the sampling signal to be a pitch waveform signal based on the result of the adjustments by the phase adjustment means and the value of the sampling length; sub-band data generation means for generating sub-band data indicating change with time of spectral distribution of each of the phonemes based on the pitch waveform signal; and compression-according-to-phoneme means for performing data compression of the sub-band data in accordance with a predetermined condition specified for a phoneme indicated by the sub-band data; wherein the compression-according-to-phoneme means performs data compression of sub-band data by changing the sub-band data in such a manner as to delete a predetermined spectral component from the sub-band data. - View Dependent Claims (2, 3, 4)
-
Specification