Compression of stored waveforms for artificial speech
First Claim
1. In a real-time text-to-speech conversion system in which waveforms are encoded for storage in terms of digital samples, the frequency of occurrence of the various possible sample levels being non-uniform, the improvement comprising encoding corresponding samples of a pair of successive waveforms for storage by a Huffman coding in which the sample values themselves are encoded for the first waveform of said pair, and the value of the difference between each sample and the corresponding sample of said first waveform is encoded for the second waveform of said pair.
3 Assignments
0 Petitions
Accused Products
Abstract
In a digital text-to-speech conversion system of the type usually contained in all-software form on a floppy disk, memory requirements for the storage of digitized waveform samples are reduced while speech quality is improved, by providing compression techniques and anti-distortion techniques which interact to provide clear speech at widely varying speeds with a minimum of memory. These techniques include using Huffman coding of first- or second-order differences, encoding only differences between successive waveforms where feasible, using a demi-diphone organization of the speech to allow use of the same instruction lists for several sounds, selectively deleting or repeating waveforms in the concatenation to vary speed without affecting pitch, and encoding waveforms linearly or anti-logarithmically for storage while converting the stored linear or anti-logarithmic codes to logarithmic codes such as μ-law codes upon retrieval.
189 Citations
3 Claims
- 1. In a real-time text-to-speech conversion system in which waveforms are encoded for storage in terms of digital samples, the frequency of occurrence of the various possible sample levels being non-uniform, the improvement comprising encoding corresponding samples of a pair of successive waveforms for storage by a Huffman coding in which the sample values themselves are encoded for the first waveform of said pair, and the value of the difference between each sample and the corresponding sample of said first waveform is encoded for the second waveform of said pair.
Specification