Voice compression system and method
First Claim
1. A system for performing voice compression, comprising:
- voice interface means including a voice input device for receiving analog voice signals from a user;
conversion means for converting the analog voice signals into discrete samples of digital voice data and collecting the discrete samples into segments;
means for dividing the segments into subsegments and for producing therefrom a current voice subsegment;
pitch prediction means for determining the long term predicted gain of the current voice subsegment by comparing the current voice subsegment to reconstructed voice samples to produce a pitch predictor gain and a lag component;
pitch filter means for pitch filtering the current voice subsegment based on the pitch predictor gain and the lag component and for producing therefrom long term residual samples;
means for determining the peak amplitude of the long term residual samples;
means for scaling the long term residual samples based on the peak amplitude to produce normalized long term residual samples;
means including a code book stored in a memory for comparing the normalized long term residual samples to stored distinct normalized long term residual samples stored in the memory and for locating a single memory address of a closest match between the normalized long term residual samples and the stored distinct normalized long term residual samples;
means for producing normalized reconstructed long term residual samples from the stored distinct normalized long term residual samples at the single memory address of the closest match;
means for producing a reconstructed long term residual samples from the normalized reconstructed long term residually samples;
means for inverse pitch filtering the reconstructed long term residual samples based upon the pitch predictor gain and the lag component and for producing therefrom the reconstructed voice samples; and
output means for providing the distinct memory address, the pitch predictor gain, the lag component and the peak amplitude for each voice subsegment.
0 Assignments
0 Petitions
Accused Products
Abstract
A system for performing voice compression which includes, a conversion means for converting analog voice signals into discrete samples of digital voice data and collecting the discrete samples into segments, with a means for dividing the segments into subsegments and for producing therefrom a current voice subsegment. A pitch prediction means is used for determining the long term predicted gain of the current voice subsegment by comparing the current voice subsegment to reconstructed voice samples to produce a pitch predictor gain and a lag component and a pitch filter means is used for pitch filtering the current voice subsegment based on the pitch predictor gain and the lag component which then produces long term residual samples. The system includes a means for scaling the long term residual samples based on the peak amplitude of the long term residual samples to produce normalized long term residual samples and a means including a code book stored in a memory for locating a single memory address of a closest match between the normalized long term residual samples and the stored distinct normalized long term residual samples. The system includes an output means for providing the distinct memory address, the pitch predictor gain, the lag component and the peak amplitude for each voice subsegment and a means to reconstruct these samples based on these parameters.
172 Citations
2 Claims
-
1. A system for performing voice compression, comprising:
-
voice interface means including a voice input device for receiving analog voice signals from a user; conversion means for converting the analog voice signals into discrete samples of digital voice data and collecting the discrete samples into segments; means for dividing the segments into subsegments and for producing therefrom a current voice subsegment; pitch prediction means for determining the long term predicted gain of the current voice subsegment by comparing the current voice subsegment to reconstructed voice samples to produce a pitch predictor gain and a lag component; pitch filter means for pitch filtering the current voice subsegment based on the pitch predictor gain and the lag component and for producing therefrom long term residual samples; means for determining the peak amplitude of the long term residual samples; means for scaling the long term residual samples based on the peak amplitude to produce normalized long term residual samples; means including a code book stored in a memory for comparing the normalized long term residual samples to stored distinct normalized long term residual samples stored in the memory and for locating a single memory address of a closest match between the normalized long term residual samples and the stored distinct normalized long term residual samples; means for producing normalized reconstructed long term residual samples from the stored distinct normalized long term residual samples at the single memory address of the closest match; means for producing a reconstructed long term residual samples from the normalized reconstructed long term residually samples; means for inverse pitch filtering the reconstructed long term residual samples based upon the pitch predictor gain and the lag component and for producing therefrom the reconstructed voice samples; and output means for providing the distinct memory address, the pitch predictor gain, the lag component and the peak amplitude for each voice subsegment.
-
-
2. A method for compression speech information, comprising the steps of:
-
receiving an analog speech signal; digitizing the speech signal into a discrete samples of distal voice data and collecting the discrete samples into segments; dividing the segments into subsegments and producing at least one current subsegment; comparing the current subsegment to a reconstructed voice sample; determining the pitch predictor gain of the current subsegment; determining the lag component of the current subsegment; pitch filtering the current subsegment using the pitch predictor gain and the lag component and producing long term residual samples; detecting the amplitude of the long term predicted residual samples; scaling the long term predicted residual signal to produce a normalized long term predicted residual samples; comparing the normalized long term predicted residual samples to stored distinct normalized long term predicted residual samples stored in a memory, and for locating a single memory address of the closest match between the distinct brutalized long term predicted residual samples and the stored distinct normalized long term predicted residual samples; producing normalized reconstructed long term residual samples from the stored distinct normalized long term residual samples at the, single memory address of the closest match; producing reconstructed long term residual samples from the normalized reconstructed long term residual samples; and inverse pitch filtering the reconstructed long term residual samples based upon the pitch predictor gain and the lag component and producing therefrom the reconstructed voice samples.
-
Specification