Perceptual speech coder and method

US 5,706,392 A
Filed: 06/01/1995
Issued: 01/06/1998
Est. Priority Date: 06/01/1995
Status: Expired due to Term

First Claim

Patent Images

1. A method for coding an analog speech signal, said method comprising the steps of:

filtering, sampling, and digitizing said analog speech signal to produce a digital speech signal, said digital speech signal comprising a plurality of frames;

performing frequency analysis on said digital speech signal to produce spectral output data for each of said frames, said spectral output data comprising segments, at least two of said segments being approximately 25 Hz or closer in frequency;

performing auditory analysis on said spectral output data to identify segments of said frames that are inaudible to the human auditory system due to simultaneous or temporal masking effects; and

coding said spectral output data into an output data stream in which said inaudible segments are compressed and audible segments are not compressed.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Simultaneous and temporal masking of digital speech data is applied to an MBE-based speech coding technique to achieve additional, substantial compression of coded speech over existing coding techniques, while enabling synthesis of coded speech with minimal perceptual degradation relative to the human auditory system. A real-time perceptual coder and decoder is disclosed in which speech may be sampled at 10 kHz, coded at an average rate of less than 2 bits/sample, and reproduced in a manner that is perceptually transparent to a human listener. The coder compresses speech segments that are inaudible due to simultaneous or temporal masking, while audible speech segments are not compressed.

38 Citations

View as Search Results

5 Claims

1. A method for coding an analog speech signal, said method comprising the steps of:
- filtering, sampling, and digitizing said analog speech signal to produce a digital speech signal, said digital speech signal comprising a plurality of frames;
  
  performing frequency analysis on said digital speech signal to produce spectral output data for each of said frames, said spectral output data comprising segments, at least two of said segments being approximately 25 Hz or closer in frequency;
  
  performing auditory analysis on said spectral output data to identify segments of said frames that are inaudible to the human auditory system due to simultaneous or temporal masking effects; and
  
  coding said spectral output data into an output data stream in which said inaudible segments are compressed and audible segments are not compressed.
- View Dependent Claims (3)
- - 3. The method of claim 1, wherein said frequency analysis comprises MBE coding.

2. A coder for coding a speech signal comprising a masking segment and a masked segment approximately 25 Hz or closer in frequency to said masking segment, said coder comprising:
- storage means for storing first application software, second application software, and masking data;
  
  a first processor connected to said storage means for using said first application software to generate spectral data for said speech signal; and
  
  a second processor connected to said storage means and said first processor for using said second application software, said masking data, and said spectral data to create a coded representation of said speech signal wherein said masked segment is compressed and said masking segment is not compressed.
- View Dependent Claims (4, 5)
- - 4. The coder of claim 2 wherein one integrated circuit includes said first processor and said second processor.
  - 5. The coder of claim 4 wherein said first application software includes MBE coding to generate said spectral data.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Grove Hydrogen Cells LLC (Intellectual Ventures LLC)
Original Assignee
Rutgers University
Inventors
Flanagan, James L., Goldberg, Randy G.
Primary Examiner(s)
Safourek, Benedict V.

Application Number

US08/457,517
Time in Patent Office

950 Days
Field of Search

395/2.17, 395/2.19, 395/2.23, 395/2.24, 395/2.35, 395/2.36, 395/2.38, 455/72, 341/87, 341/76
US Class Current

704/200.1
CPC Class Codes

G10L 19/087 using mixed excitation mode...

Perceptual speech coder and method

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

38 Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Perceptual speech coder and method

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

38 Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links