Mode-specific method and apparatus for encoding signals containing speech

US 5,596,676 A
Filed: 10/11/1995
Issued: 01/21/1997
Est. Priority Date: 06/01/1992
Status: Expired due to Term

- Alert
- Pin

Associated Case

Associated Defendants

First Claim

Patent Images

1. A method of encoding a signal having a speech component, the signal being organized as a plurality of frames, the method comprising the steps, performed for each frame, of:

analyzing a first linear prediction window to generate a first set of filter coefficients for a frame;

analyzing a second linear prediction window to generate a second set of filter coefficients for the frame;

analyzing a first pitch analysis window to generate a first pitch estimate for the frame;

analyzing a second pitch analysis window to generate a second pitch estimate for the frame;

determining whether the frame is one of a first mode, a second mode and a third mode, depending on measures of energy content of the frame and spectral content of the frame;

encoding the frame, depending on the second set of filter coefficients and the first and the second pitch estimates, independently of the first set of filter coefficients, when the frame is determined to be the third mode;

encoding the frame, depending on the first and the second sets of filter coefficients, independently of the first and the second pitch estimates, when the frame is determined to be the second mode; and

encoding the frame, depending on the second set of filter coefficients, independently of the first set of filter coefficients and the first and the second pitch estimates, when the frame is determined to be the first mode.

View all claims

13 Assignments

Timeline View

Assignment View

Litigations

0 Petitions

Accused Products

Abstract

A method for encoding a signal that includes a speech component is described. First and second linear prediction windows of a frame are analyzed to generate sets of filter coefficients. First and second pitch analysis windows of the frame are analyzed to generate pitch estimates. The frame is classified in one of at least two modes, e.g. voiced, unvoiced and noise modes, based, for example, on pitch stationarity, short-term level gradient or zero crossing rate. Then the frame is encoded using the filter coefficients and pitch estimates in a particular manner depending upon the mode determination for the frame, preferably employing CELP based encoding algorithms.

Citations

24 Claims

1. A method of encoding a signal having a speech component, the signal being organized as a plurality of frames, the method comprising the steps, performed for each frame, of:
- analyzing a first linear prediction window to generate a first set of filter coefficients for a frame;
  
  analyzing a second linear prediction window to generate a second set of filter coefficients for the frame;
  
  analyzing a first pitch analysis window to generate a first pitch estimate for the frame;
  
  analyzing a second pitch analysis window to generate a second pitch estimate for the frame;
  
  determining whether the frame is one of a first mode, a second mode and a third mode, depending on measures of energy content of the frame and spectral content of the frame;
  
  encoding the frame, depending on the second set of filter coefficients and the first and the second pitch estimates, independently of the first set of filter coefficients, when the frame is determined to be the third mode;
  
  encoding the frame, depending on the first and the second sets of filter coefficients, independently of the first and the second pitch estimates, when the frame is determined to be the second mode; and
  
  encoding the frame, depending on the second set of filter coefficients, independently of the first set of filter coefficients and the first and the second pitch estimates, when the frame is determined to be the first mode.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
- - 2. The method of claim 1, wherein the determining step includes the substep of:
    - determining a mode depending on a determined mode of a previous frame.
  - 3. The method of claim 1 wherein the determining step includes the substep of:
    - determining the mode to be the first mode only when the determined mode of a previous frame is either the first mode or the second mode.
  - 4. The method of claim 1, wherein the determining step includes the substep of:
    - determining the mode to be the third mode only when the determined mode of a previous frame is either the third mode or the second mode.
  - 5. The method of claim 1 wherein the determining step further depends on measures of pitch stationarity between the frame and a previous frame.
  - 6. The method of claim 1 wherein the determining step further depends on measures of short-term level gradient within the frame.
  - 7. The method of claim 1 wherein the determining step further depends on measures of a zero-crossing rate within the frame.
  - 8. The encoding method of claim 1, wherein the first linear prediction window is contained within the frame and the second linear prediction window begins during the frame and extends into the next frame.
  - 9. The encoding method of claim 1, wherein the first pitch estimate window is contained within the frame and the second pitch estimate window begins during the frame and extends into the next frame.
  - 10. The encoding method of claim 1, wherein a frame determined to be of a third mode contains a signal with a speech component composed of primarily voiced speech.
  - 11. The encoding method of claim 1, wherein a frame determined to be of a second mode contains a signal with a speech component composed of primarily unvoiced speech.
  - 12. The encoding method of claim 1, wherein a frame determined to be of a first mode contains a signal with a low speech component.

13. An encoder for encoding a signal having a speech component, the signal being organized as a plurality of frames, comprising:
- a filter coefficient generator for analyzing a first linear prediction window to generate a first set of filter coefficients for a frame and for analyzing a second linear prediction window to generate a second set of filter coefficients for the frame;
  
  a pitch estimator for analyzing a first pitch analysis window to generate a first pitch estimate for the frame and analyzing a second pitch analysis window to generate a second pitch estimate for the frame;
  
  a mode determinator for determining whether the frame is one of a first mode, a second mode and a third mode, depending on measures of energy content of the frame and spectral content of the frame; and
  
  a frame encoder for encoding the frame depending on the determined mode of the frame, whereina frame determined to be of a third mode is encoded depending on the second set of filter coefficients and the first and the second pitch estimates, independently of the first set of filter coefficients,a frame determined to be of a second mode is encoded depending on the first and the second sets of filter coefficients, independently of the first and the second pitch estimates, anda frame determined to be of a first mode is encoded depending on the second set of filter coefficients, independently of the first set of filter coefficients and the first and the second pitch estimates.
- View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
- - 14. The encoder of claim 13, wherein the mode determinator determines the mode depending on a determined mode of a previous frame.
  - 15. The encoder of claim 13, wherein the mode determinator determines the frame to be of the first mode only when the determined mode of a previous frame is either the first mode or the second mode.
  - 16. The encoder of claim 13, wherein the mode determininator determines the frame to be of the third mode only when the determined mode of a previous frame is either the third mode or the second mode.
  - 17. The encoder of claim 13 wherein the mode determininator further depends on measures of pitch stationarity between the frame and a previous frame.
  - 18. The encoder of claim 13 wherein the mode determinator further depends on measures of short-term level gradient within the frame.
  - 19. The encoder of claim 13 wherein the mode determinator further depends on measures of a zero-crossing rate within the frame.
  - 20. The encoder of claim 13, wherein the first linear prediction window is contained within the frame and the second linear prediction window begins during the frame and extends into the next frame.
  - 21. The encoder of claim 13, wherein the first pitch estimate window is contained within the frame and the second pitch estimate window begins during the frame and extends into the next frame.
  - 22. The encoder of claim 13, wherein a frame determined to be of a third mode contains a signal with a speech component composed of primarily voiced speech.
  - 23. The encoder of claim 13, wherein a frame determined to be of a second mode contains a signal with a speech component composed of primarily unvoiced speech.
  - 24. The encoder of claim 13, wherein a frame determined to be of a first mode contains a signal with a low speech component.

Specification

Resources

Litigation Campaign Assessment

Litigation Data

Current Assignee
Hughes Network Systems LLC (Echostar Corporation)
Original Assignee
Hughes Electronics Corporation (AT&T, Inc.)
Inventors
Swaminathan, Kumar, Ganesan, Kalyan, Gupta, Prabhat K.
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
GROVER, JOHN M

Application Number

US08/540,637
Time in Patent Office

468 Days
Field of Search

395/2.17, 395/2.19, 395/2.28, 395/2.32, 395/2.71, 395/2.77
US Class Current

704/208
CPC Class Codes

G10L 19/012   Comfort noise or silence co...

G10L 19/12   the excitation function bei...

G10L 19/26   Pre-filtering or post-filte...

G10L 2019/0002   Codebook adaptations

G10L 2019/0003   Backward prediction of gain

G10L 25/09   the extracted parameters be...

G10L 25/18   the extracted parameters be...

G10L 25/24   the extracted parameters be...

G10L 25/90   Pitch determination of spee...

G10L 25/93   Discriminating between voic...

Mode-specific method and apparatus for encoding signals containing speech

First Claim

13 Assignments

Litigations

0 Petitions

Accused Products

Abstract

Citations

24 Claims

Specification

Solutions

Use Cases

Quick Links

Mode-specific method and apparatus for encoding signals containing speech

First Claim

13 Assignments

Subscription Required

Subscription Required

Litigations

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

24 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links