Voice compression system and method

US 5,592,586 A
Filed: 08/11/1994
Issued: 01/07/1997
Est. Priority Date: 01/08/1993
Status: Expired due to Fees

First Claim

Patent Images

1. A system for performing voice compression, comprising:

voice interface means including a voice input device for receiving analog voice signals from a user;

conversion means for converting the analog voice signals into discrete samples of digital voice data and collecting the discrete samples into segments;

means for dividing the segments into subsegments and for producing therefrom a current voice subsegment;

pitch prediction means for determining the long term predicted gain of the current voice subsegment by comparing the current voice subsegment to reconstructed voice samples to produce a pitch predictor gain and a lag component;

pitch filter means for pitch filtering the current voice subsegment based on the pitch predictor gain and the lag component and for producing therefrom long term residual samples;

means for determining the peak amplitude of the long term residual samples;

means for scaling the long term residual samples based on the peak amplitude to produce normalized long term residual samples;

means including a code book stored in a memory for comparing the normalized long term residual samples to stored distinct normalized long term residual samples stored in the memory and for locating a single memory address of a closest match between the normalized long term residual samples and the stored distinct normalized long term residual samples;

means for producing normalized reconstructed long term residual samples from the stored distinct normalized long term residual samples at the single memory address of the closest match;

means for producing a reconstructed long term residual samples from the normalized reconstructed long term residually samples;

means for inverse pitch filtering the reconstructed long term residual samples based upon the pitch predictor gain and the lag component and for producing therefrom the reconstructed voice samples; and

output means for providing the distinct memory address, the pitch predictor gain, the lag component and the peak amplitude for each voice subsegment.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system for performing voice compression which includes, a conversion means for converting analog voice signals into discrete samples of digital voice data and collecting the discrete samples into segments, with a means for dividing the segments into subsegments and for producing therefrom a current voice subsegment. A pitch prediction means is used for determining the long term predicted gain of the current voice subsegment by comparing the current voice subsegment to reconstructed voice samples to produce a pitch predictor gain and a lag component and a pitch filter means is used for pitch filtering the current voice subsegment based on the pitch predictor gain and the lag component which then produces long term residual samples. The system includes a means for scaling the long term residual samples based on the peak amplitude of the long term residual samples to produce normalized long term residual samples and a means including a code book stored in a memory for locating a single memory address of a closest match between the normalized long term residual samples and the stored distinct normalized long term residual samples. The system includes an output means for providing the distinct memory address, the pitch predictor gain, the lag component and the peak amplitude for each voice subsegment and a means to reconstruct these samples based on these parameters.

172 Citations

2 Claims

1. A system for performing voice compression, comprising:
- voice interface means including a voice input device for receiving analog voice signals from a user;
  
  conversion means for converting the analog voice signals into discrete samples of digital voice data and collecting the discrete samples into segments;
  
  means for dividing the segments into subsegments and for producing therefrom a current voice subsegment;
  
  pitch prediction means for determining the long term predicted gain of the current voice subsegment by comparing the current voice subsegment to reconstructed voice samples to produce a pitch predictor gain and a lag component;
  
  pitch filter means for pitch filtering the current voice subsegment based on the pitch predictor gain and the lag component and for producing therefrom long term residual samples;
  
  means for determining the peak amplitude of the long term residual samples;
  
  means for scaling the long term residual samples based on the peak amplitude to produce normalized long term residual samples;
  
  means including a code book stored in a memory for comparing the normalized long term residual samples to stored distinct normalized long term residual samples stored in the memory and for locating a single memory address of a closest match between the normalized long term residual samples and the stored distinct normalized long term residual samples;
  
  means for producing normalized reconstructed long term residual samples from the stored distinct normalized long term residual samples at the single memory address of the closest match;
  
  means for producing a reconstructed long term residual samples from the normalized reconstructed long term residually samples;
  
  means for inverse pitch filtering the reconstructed long term residual samples based upon the pitch predictor gain and the lag component and for producing therefrom the reconstructed voice samples; and
  
  output means for providing the distinct memory address, the pitch predictor gain, the lag component and the peak amplitude for each voice subsegment.

2. A method for compression speech information, comprising the steps of:
- receiving an analog speech signal;
  
  digitizing the speech signal into a discrete samples of distal voice data and collecting the discrete samples into segments;
  
  dividing the segments into subsegments and producing at least one current subsegment;
  
  comparing the current subsegment to a reconstructed voice sample;
  
  determining the pitch predictor gain of the current subsegment;
  
  determining the lag component of the current subsegment;
  
  pitch filtering the current subsegment using the pitch predictor gain and the lag component and producing long term residual samples;
  
  detecting the amplitude of the long term predicted residual samples;
  
  scaling the long term predicted residual signal to produce a normalized long term predicted residual samples;
  
  comparing the normalized long term predicted residual samples to stored distinct normalized long term predicted residual samples stored in a memory, and for locating a single memory address of the closest match between the distinct brutalized long term predicted residual samples and the stored distinct normalized long term predicted residual samples;
  
  producing normalized reconstructed long term residual samples from the stored distinct normalized long term residual samples at the, single memory address of the closest match;
  
  producing reconstructed long term residual samples from the normalized reconstructed long term residual samples; and
  
  inverse pitch filtering the reconstructed long term residual samples based upon the pitch predictor gain and the lag component and producing therefrom the reconstructed voice samples.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Multi-Tech Systems Incorporated
Original Assignee
Multi-Tech Systems Incorporated
Inventors
Young, Steve, Maitra, Sidhartha, Thanawala, Ashish
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
ONKA, THOMAS

Application Number

US08/289,297
Time in Patent Office

880 Days
Field of Search

395/2.28, 395/2.29, 395/2.3
US Class Current

704/220
CPC Class Codes

G10L 19/04   using predictive techniques

H04L 12/6418   Hybrid transport

H04L 2012/6424   Access arrangements

H04L 2012/6427   Subscriber Access Module; C...

H04L 2012/6459   Multiplexing, e.g. TDMA, CDMA

H04L 2012/6475   N-ISDN, Public Switched Tel...

H04L 2012/6481   Speech, voice

H04L 2012/6491   Echo cancellation

H04L 2012/6494   Silence suppression

H04L 27/3416   in which the information is...

H04L 65/1069   Session establishment or de...

H04L 65/1101   Session protocols

H04L 65/762   at the source reformatting...

H04L 65/80   Responding to QoS

H04L 69/04   Protocols for data compress...

H04L 9/40   Network security protocols

H04M 1/247   Telephone sets including us...

H04M 1/2473   Telephone terminals interfa...

H04M 1/2477   for selecting a function fr...

H04M 1/253   Telephone sets using digita...

H04M 1/57 : Arrangements for indicating...

H04M 1/6033 : for providing handsfree use...

H04M 1/6505 : storing speech in digital form

H04M 1/663 : Preventing unauthorised cal...

H04M 11/06 : Simultaneous speech and dat...

H04M 11/064 : Data transmission during pa...

H04M 11/068 : using time division multipl...

H04M 2203/4509 : Unified messaging with sing...

H04M 3/002 : Applications of echo suppre...

H04M 3/42 : Systems providing special s...

H04M 3/436 : Arrangements for screening ...

H04M 3/5307 : for recording messages comp...

H04M 3/533 : Voice mail systems

H04M 3/567 : Multimedia conference systems

H04M 7/0027 : Collaboration services wher...

H04M 7/0057 : Services where the data ser...

H04M 7/006 : Networks other than PSTN/IS...

H04M 7/1295 : Details of dual tone multip...

H04N 1/00204 : with a digital computer or ...

H04N 1/00206 : Transmitting or receiving c...

Y10S 379/908 : Multimedia

View All

Voice compression system and method

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

172 Citations

2 Claims

Specification

Solutions

Use Cases

Quick Links

Voice compression system and method

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

172 Citations

2 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links