Phoneme-delta based speech compression
First Claim
Patent Images
1. A method, comprising:
- receiving original speech data;
compressing the original speech data based on a phoneme stream, detected from the original speech data, and a delta stream, extracted based on the difference between a speech signal stream, generated using the phoneme stream with respect to a voice font, and the original speech data, to generate compressed speech data;
sending the compressed speech data;
receiving the compressed speech data; and
decompressing the compressed speech data based on a decompressed phoneme stream and a decompressed delta stream to generate recovered speech data.
1 Assignment
0 Petitions
Accused Products
Abstract
An arrangement is provided for compressing speech data. Speech data is compressed based on a phoneme stream, detected from the speech data, and a delta stream, determined based on the difference between the speech data and a speech signal stream, generated using the phoneme stream with respect to a voice font. The compressed speech data is decompressed into a decompressed phoneme stream and a decompressed delta stream from which the speech data is recovered.
-
Citations
32 Claims
-
1. A method, comprising:
-
receiving original speech data;
compressing the original speech data based on a phoneme stream, detected from the original speech data, and a delta stream, extracted based on the difference between a speech signal stream, generated using the phoneme stream with respect to a voice font, and the original speech data, to generate compressed speech data;
sending the compressed speech data;
receiving the compressed speech data; and
decompressing the compressed speech data based on a decompressed phoneme stream and a decompressed delta stream to generate recovered speech data. - View Dependent Claims (2, 3)
extracting the phoneme stream from the original speech data;
compressing the phoneme stream to generate phoneme compression;
generating the delta stream based on the difference between the speech signal stream generated using the phoneme stream with respect to the voice font and the original speech data;
compressing the delta stream to generate delta compression; and
integrating the phoneme compression and the delta compression to generate the compressed speech data.
-
-
3. The method according to claim 2, wherein the decompressing the compressed speech data comprises:
-
decomposing the compressed speech data into the phoneme compression and the delta compression;
decompressing the phoneme compression to generate a decompressed phoneme stream;
decompressing the delta compression to generate a decompressed delta stream; and
generating the recovered speech data based on the decompressed phoneme stream and the decompressed delta stream.
-
-
4. A method for phoneme-delta based speech compression, comprising:
-
receiving original speech data;
compressing a phoneme stream, extracted from the original speech data, to generate phoneme compression;
compressing a delta stream, extracted based on the difference between a speech signal stream, generated based on the phoneme stream with respect to a voice font, and the original speech data, to generate delta compression; and
integrating the phoneme compression and the delta compression to generate compressed speech data. - View Dependent Claims (5, 6)
extracting a plurality of phonemes from the original speech data to generate the phoneme stream; and
compressing the phoneme stream.
-
-
6. The method according to claim 4, wherein the compressing the delta stream comprises:
-
generating the speech signal stream based on the phoneme stream with respect to the voice font;
generating the delta stream based on the difference between the speech signal stream and the original speech data; and
compressing the delta stream.
-
-
7. A method for phoneme-delta based speech decompression, comprising:
-
receiving compressed speech data that is compressed based on a phoneme compression and a delta compression;
decompressing the phoneme compression to generate a phoneme based speech signal stream;
decompressing the delta compression to generate a decompressed delta stream; and
generating recovered speech data by integrating the phoneme based speech signal stream with the decompressed delta stream. - View Dependent Claims (8)
decompressing the phoneme compression to generate a decompressed phoneme stream; and
synthesizing the phoneme based speech signal stream based on the decompressed phoneme stream with respect to a voice font.
-
-
9. A method for use of phoneme-delta based speech compression and decompression, comprising:
-
generating original speech data;
performing phoneme-delta based speech compression on the original speech data to generate compressed speech data;
sending the compressed speech data;
receiving the compressed speech data;
performing phoneme-delta based speech decompression on the received compressed speech data to generate a recovered speech data. - View Dependent Claims (10)
storing the compressed speech data, received by the receiving;
analyzing the compressed speech data, received by the receiving;
playing back the compressed speech data;
storing the recovered speech data;
analyzing the recovered speech data; and
playing back the recovered speech data.
-
-
11. A system, comprising:
-
a phoneme-delta based speech compression mechanism for compressing original speech data based on a phoneme stream, detected from the original speech data, and a delta stream, extracted based on the difference between a speech signal stream, generated using the phoneme stream with respect to a voice font, and the original speech data, to generate compressed speech data comprising phoneme compression and delta compression; and
a phoneme-delta based speech decompression mechanism for decompressing the compressed speech data with the phoneme compression and the delta compression to generate a recovered speech data. - View Dependent Claims (12)
the phoneme-delta based speech compression mechanism comprises;
a phoneme based compression channel that compresses the original speech data according to the phoneme stream to generate the phoneme compression;
a delta based compression channel that compresses the original speech data according to the delta stream to generate the delta compression; and
an integration mechanism for integrating the phoneme compression with the delta compression to generate the compressed speech data, the phoneme-delta based speech decompression mechanism comprises;
a phoneme based decompression channel that decompresses the phoneme compression to produce a decompressed phoneme stream based on which a phoneme based speech stream is generated with respect to the voice font;
a delta based decompression channel that decompresses the delta compression to generate the delta stream; and
a reconstruction mechanism for constructing the recovered speech data based on the phoneme based speech stream and the delta stream.
-
-
13. A system for phoneme-delta based speech compression, comprising:
-
a phoneme based speech compression channel for compressing original speech data according to a phoneme stream, detected from the original speech data, to generate a phoneme compression;
a delta based compression channel for compressing the original speech data according to a delta stream, determined according to the difference between a speech signal stream, generated based on the phoneme stream with respect to a voice font, and the original speech data, to generate a delta compression; and
an integration mechanism for integrating the phoneme compression with the delta compression to generate compressed speech data. - View Dependent Claims (14, 15, 16)
a phoneme recognizer for detecting the phoneme stream from the original speech data;
a phoneme-to-speech engine for synthesizing the speech signal stream using the phoneme stream with respect to the voice font; and
a phoneme compressor for compressing the phoneme stream to generate the phoneme compression.
-
-
15. The system according to claim 14, wherein the delta based compression channel comprises:
-
a delta detection mechanism for extracting the delta stream based on the difference between the original speech data and the speech signal stream; and
a delta compressor for compressing the delta stream to generate the delta compression.
-
-
16. The system according to claim 15, the delta compressor comprises:
-
a delta stream filter for filtering the delta stream to generate a filtered delta stream; and
an audio signal compression mechanism for compressing the filtered delta stream to generate the delta compression.
-
-
17. A system for phoneme-delta based speech decompression, comprising:
-
a decomposition mechanism for decomposing a phoneme-delta based compressed speech data into a phoneme compression and a delta compression;
a phoneme based decompression channel that decompresses the phoneme compression to produce a phoneme based speech stream generated with respect to a voice font;
a delta based decompression channel with a delta based decompressor for decompressing the delta compression to generate a delta stream; and
a reconstruction mechanism for constructing recovered speech data based on the phoneme based speech stream and the delta stream. - View Dependent Claims (18)
a phoneme decompressor for decompressing the phoneme compression to generate a decompressed phoneme stream; and
a phoneme-to-speech engine for synthesizing the phoneme based speech stream based on the decompressed phoneme stream with respect to the voice font.
-
-
19. A system, comprising:
-
a speech data generation source for generating original speech data and for sending compressed speech data encoded using a phoneme-delta based speech compression scheme, the compressed speech data being generated based on a phoneme stream and a delta stream, both detected based on the original speech data;
a speech data receiving destination for use of speech data recovered from the compressed speech data. - View Dependent Claims (20)
the speech data generation source comprises: a speech data generation mechanism for generating the original speech data; and
a phoneme-delta based speech compression mechanism for compressing the original speech data based on a phoneme stream and a delta stream to generate the compressed speech data. the speech data receiving destination comprises;
a phoneme-delta based speech decompression mechanism for decompressing the compressed speech data to generate the recovered speech data;
a speech data application mechanism for utilizing the compressed speech data and the recovered speech data.
-
-
21. A computer-readable medium encoded with a program in a receiving network end point, the program, when executed, causing:
-
receiving a plurality of packets, sent from an initiating network end point, with a corresponding plurality of destination spacings between pairs of adjacent received packets;
deriving an average destination spacing based on the destination spacings; and
sending the plurality of destination spacings and the average destination spacing. - View Dependent Claims (22)
receiving an average actual source spacing and an inter-departure jitter measure, sent from the initiating network end point; and
estimating the jitter between the initiating network end point and the receiving network end point and an associated confidence measure based on the average actual source spacing, the inter-departure jitter measure, the destination spacings, and the average destination spacing.
-
-
23. A computer-readable medium encoded with a program, the program, when executed, causing:
-
receiving original speech data;
compressing the original speech data based on a phoneme stream, detected from the original speech data, and a delta stream, extracted based on the difference between a speech signal stream, generated using the phoneme stream with respect to a voice font, and the original speech data, to generate compressed speech data;
sending the compressed speech data;
receiving the compressed speech data; and
decompressing the compressed speech data based on a decompressed phoneme stream and a decompressed delta stream to generate recovered speech data. - View Dependent Claims (24, 25)
extracting the phoneme stream from the original speech data;
compressing the phoneme stream to generate phoneme compression;
generating the delta stream based on the difference between the speech signal stream generated using the phoneme stream with respect to the voice font and the original speech data;
compressing the delta stream to generate delta compression; and
integrating the phoneme compression and the delta compression to generate the compressed speech data.
-
-
25. The medium according to claim 23, wherein the decompressing the compressed speech data comprises:
-
decomposing the compressed speech data into the phoneme compression and the delta compression;
decompressing the phoneme compression to generate a decompressed phoneme stream;
decompressing the delta compression to generate a decompressed delta stream; and
generating the recovered speech data based on the decompressed phoneme stream and the decompressed delta stream.
-
-
26. A computer-readable medium encoded with a program for phoneme-delta based speech compression, the program, when executed, causing:
-
receiving original speech data;
compressing a phoneme stream, extracted from the original speech data, to generate phoneme compression;
compressing a delta stream, extracted based on the difference between a speech signal stream, generated based on the phoneme stream with respect to a voice font, and the original speech data, to generate delta compression; and
integrating the phoneme compression and the delta compression to generate compressed speech data. - View Dependent Claims (27, 28)
extracting a plurality of phonemes from the original speech data to generate the phoneme stream; and
compressing the phoneme stream.
-
-
28. The medium according to claim 26, wherein the compressing the delta stream comprises:
-
generating the speech signal stream based on the phoneme stream with respect to the voice font;
generating the delta stream based on the difference between the speech signal stream and the original speech data; and
compressing the delta stream.
-
-
29. A computer-readable medium encoded with a program for phoneme-delta based speech decompression, the program, when executed, causing:
-
receiving compressed speech data that is compressed based on a phoneme compression and a delta compression;
decompressing the phoneme compression to generate a phoneme based speech signal stream;
decompressing the delta compression to generate a decompressed delta stream; and
generating recovered speech data by integrating the phoneme based speech signal stream with the decompressed delta stream. - View Dependent Claims (30)
decompressing the phoneme compression to generate a decompressed phoneme stream; and
synthesizing the phoneme based speech signal stream based on the decompressed phoneme stream with respect to a voice font.
-
-
31. A computer-readable medium encoded with a program for use of phoneme-delta based speech compression and decompression, the program, when executed, causing:
-
generating original speech data;
performing phoneme-delta based speech compression on the original speech data to generate compressed speech data;
sending the compressed speech data;
receiving the compressed speech data;
performing phoneme-delta based speech decompression on the received compressed speech data to generate a recovered speech data. - View Dependent Claims (32)
storing the compressed speech data, received by the receiving;
analyzing the compressed speech data, received by the receiving;
playing back the compressed speech data;
storing the recovered speech data;
analyzing the recovered speech data; and
playing back the recovered speech data.
-
Specification