Multiple mode variable rate speech coding

US 6,691,084 B2
Filed: 12/21/1998
Issued: 02/10/2004
Est. Priority Date: 12/21/1998
Status: Expired due to Term

First Claim

Patent Images

1. A method for the variable rate coding of a speech signal, comprising:

classifying the speech signal as either active or inactive, wherein classifying speech as active or inactive comprises a two energy band based thresholding scheme;

classifying said active speech into one of a plurality of woes of active speech, wherein said plurality of types of active speech include voiced, unvoiced, and transient speech;

selecting an encoder mode based on whether the speech signal is active or inactive, and if active, based further on said type of active speech, wherein said selected encoder mode is characterized by either a coding bit rate or a coding algorithm, or by a coding bit rate and a coding algorithm; and

encoding the speech signal according to said encoder mode, forming an encoded speech signal.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and apparatus for the variable rate coding of a speech signal. An input speech signal is classified and an appropriate coding mode is selected based on this classification. For each classification, the coding mode that achieves the lowest bit rate with an acceptable quality of speech reproduction is selected. Low average bit rates are achieved by only employing high fidelity modes (i.e., high bit rate, broadly applicable to different types of speech) during portions of the speech where this fidelity is required for acceptable output. Lower bit rate modes are used during portions of speech where these modes produce acceptable output. Input speech signal is classified into active and inactive regions. Active regions are further classified into voiced, unvoiced, and transient regions. Various coding modes are applied to active speech, depending upon the required level of fidelity. Coding modes may be utilized according to the strengths and weaknesses of each particular mode. The apparatus dynamically switches between these modes as the properties of the speech signal vary with time. And where appropriate, regions of speech are modeled as pseudo-random noise, resulting in a significantly lower bit rate. This coding is used in a dynamic fashion whenever unvoiced speech or background noise is detected.

Citations

4 Claims

1. A method for the variable rate coding of a speech signal, comprising:
- classifying the speech signal as either active or inactive, wherein classifying speech as active or inactive comprises a two energy band based thresholding scheme;
  
  classifying said active speech into one of a plurality of woes of active speech, wherein said plurality of types of active speech include voiced, unvoiced, and transient speech;
  
  selecting an encoder mode based on whether the speech signal is active or inactive, and if active, based further on said type of active speech, wherein said selected encoder mode is characterized by either a coding bit rate or a coding algorithm, or by a coding bit rate and a coding algorithm; and
  
  encoding the speech signal according to said encoder mode, forming an encoded speech signal.

2. A method for the variable rate coding of a speech signal, comprising:
- classifying the speech signal as either active or inactive, wherein classifying speech as active or inactive comprises classifying the next M frames as active if the previous N_hoframes were classified as active;
  
  classifying said active speech into one of a plurality of types of active speech, wherein said plurality of types of active speech include voiced, unvoiced, and transient speech;
  
  selecting an encoder mode based on whether the speech signal is active or inactive, and if active, based further on said type of active speech, wherein said selected encoder mode is characterized by either a coding bit rate or a coding algorithm, or by a coding bit rate and a coding algorithm; and
  
  encoding the speech signal according to said encoder mode forming an encoded speech signal.

3. A variable rate coding system for coding a speech signal, comprising:
- classification means for classifying the speech signal as active or inactive based on a two energy band thresholding scheme, and if active, for classifying the active speech as one of a plurality of types of active speech; and
  
  a plurality of encoding means for encoding the speech signal as an encoded speech signal, wherein said encoding means are dynamically selected to encode the speech signal based on whether the speech signal is active or inactive, and if active, based further on said type of active speech.

4. A variable race coding system for coding a speech signal, comprising:
- classification means for classifying the speech signal as active or inactive, wherein said classification means classifies the next M frames as active if the previous N_hoframes were classified as active, and if active, for classifying the active speech as one of a plurality of types of active speech; and
  
  a plurality of encoding means for encoding the speech signal as an encoded speech signal, wherein said encoding means are dynamically selected to encode the speech signal based on whether the speech signal is active or inactive, and if active, based further on said type of active speech.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Qualcomm, Inc.
Original Assignee
Qualcomm, Inc.
Inventors
Gardner, William, Manjunath, Sharath
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
NOLAN, DANIEL A

Application Number

US09/217,341
Publication Number

US 20020099548A1
Time in Patent Office

1,877 Days
Field of Search

704/219, 704/220-221, 704/226-228, 704/230, 704/206, 704/214, 704/205
US Class Current

704/221
CPC Class Codes

G10L 19/20   using sound class specific ...

G10L 19/24   Variable rate codecs, e.g. ...

G10L 2025/783   based on threshold decision

G10L 2025/935   Mixed voiced class; Transit...

Multiple mode variable rate speech coding

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

4 Claims

Specification

Solutions

Use Cases

Quick Links

Multiple mode variable rate speech coding

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

4 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links