Method and apparatus for performing packet loss or frame erasure concealment

US 9,336,783 B2
Filed: 11/26/2013
Issued: 05/10/2016
Est. Priority Date: 04/19/1999
Status: Expired due to Term

First Claim

Patent Images

1. A method for processing packets representing encoded speech of a speech signal, comprising:

determining, by a receiver, a first packet of the packets is an expected packet, wherein an expected packet comprises a packet that is not lost, corrupted, erased or delayed;

decoding, by the receiver, the first packet to create a plurality of speech samples in a buffer;

delaying, by the receiver, the plurality of speech samples by a delay period;

sending, by the receiver, the delayed plurality of speech samples to an output port; and

when the determining further determines that a second packet of the packets is an unexpected packet, wherein an unexpected packet comprises a packet that is lost, corrupted, erased or delayed,computing an estimated pitch period, using a most recent 20 msec of the plurality of speech samples in the buffer, wherein the estimated pitch period is computed using a 2;

1 decimated signal of the most recent 20 msec of the plurality of speech samples; and

using the estimated pitch period to select a portion of the plurality of speech samples to generate a synthesized speech segment.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder receives encoded frames of compressed speech information transmitted from an encoder. The method determines whether an encoded frame has been lost, corrupted in transmission, or erased, synthesizes properly received frames, and decides on an overlap-add window to use in combining a portion of the synthesized speech signal with a subsequent speech signal resulting from a received and decoded packet, where the size of the overlap-add window is based on the unavailability of packets. If it is determined that an encoded frame has been lost, corrupted in transmission, or erased, the method performed an overlap-add operation on the portion of the synthesized speech signal and the subsequent speech signal, using the decided-on overlap-add window.

38 Citations

14 Claims

1. A method for processing packets representing encoded speech of a speech signal, comprising:
- determining, by a receiver, a first packet of the packets is an expected packet, wherein an expected packet comprises a packet that is not lost, corrupted, erased or delayed;
  
  decoding, by the receiver, the first packet to create a plurality of speech samples in a buffer;
  
  delaying, by the receiver, the plurality of speech samples by a delay period;
  
  sending, by the receiver, the delayed plurality of speech samples to an output port; and
  
  when the determining further determines that a second packet of the packets is an unexpected packet, wherein an unexpected packet comprises a packet that is lost, corrupted, erased or delayed,computing an estimated pitch period, using a most recent 20 msec of the plurality of speech samples in the buffer, wherein the estimated pitch period is computed using a 2;
  
  1 decimated signal of the most recent 20 msec of the plurality of speech samples; and
  
  using the estimated pitch period to select a portion of the plurality of speech samples to generate a synthesized speech segment.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1, wherein the delay period corresponds to one quarter of a longest expected pitch period.
  - 3. The method of claim 2, wherein the one quarter of the longest expected pitch period comprises 30 speech samples.
  - 4. The method of claim 1, wherein the synthesized speech segment is generated by performing an overlap add process on a boundary between the portion and an overlap add segment, wherein the overlap add segment corresponds to a most recent one quarter of the estimated pitch period of the plurality of speech samples in the buffer.
  - 5. The method of claim 1, wherein the computing of the estimated pitch period determines a rough peak of the estimated pitch period using the 2:
    - 1 decimated signal.
  - 6. The method of claim 5, wherein the computing of the estimated pitch period further performs a fine search in a vicinity of the rough peak.
  - 7. The method of claim 1, wherein the delay period comprises 3.75 msec.

8. A receiver for processing packets representing encoded speech of a speech signal, comprising:
- a lost frame detector module for determining a first packet of the packets is an expected packet, wherein an expected packet comprises a packet that is not lost, corrupted, erased or delayed;
  
  a decoder module for decoding the first packet to create a plurality of speech samples to be stored in a buffer;
  
  a delay module for delaying the plurality of speech samples by a delay period, and for sending the plurality of speech samples that is delayed to an output port; and
  
  when the lost frame detector module further determines that a second packet of the packets is an unexpected packet, wherein an unexpected packet comprises a packet that is lost, corrupted, erased or delayed,a frame erasure concealment module for computing an estimated pitch period, using a most recent 20 msec of the plurality of speech samples in the buffer, wherein the estimated pitch period is computed using a 2;
  
  1 decimated signal of the most recent 20 msec of the plurality of speech samples, and using the estimated pitch period to select a portion of the plurality of speech samples to generate a synthesized speech segment.
- View Dependent Claims (9, 10, 11, 12, 13, 14)
- - 9. The receiver of claim 8, wherein the delay period corresponds to one quarter of a longest expected pitch period.
  - 10. The receiver of claim 9, wherein the one quarter of the longest expected pitch period comprises 30 speech samples.
  - 11. The receiver of claim 8, wherein the synthesized speech segment is generated by performing an overlap add process on a boundary between the portion and an overlap add segment, wherein the overlap add segment corresponds to a most recent one quarter of the estimated pitch period of the plurality of speech samples in the buffer.
  - 12. The receiver of claim 8, wherein the estimated pitch period is computed by determining a rough peak of the estimated pitch period using the 2:
    - 1 decimated signal.
  - 13. The receiver of claim 12, wherein the estimated pitch period is computed by further performing a fine search in a vicinity of the rough peak.
  - 14. The receiver of claim 8, wherein the delay period comprises 3.75 msec.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
AT&T Intellectual Property II LP (AT&T, Inc.)
Original Assignee
AT&T Intellectual Property II LP (AT&T, Inc.)
Inventors
Kapilow, David A.
Primary Examiner(s)
ABEBE, DANIEL DEMELASH

Application Number

US14/091,185
Publication Number

US 20140088957A1
Time in Patent Office

896 Days
Field of Search

704/500
US Class Current

1/1
CPC Class Codes

G10L 19/0017   Lossless audio signal codin...

G10L 19/005   Correction of errors induce...

G10L 19/028   Noise substitution, i.e. su...

G10L 21/003   Changing voice quality, e.g...

Method and apparatus for performing packet loss or frame erasure concealment

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

38 Citations

14 Claims

Specification

Solutions

Use Cases

Quick Links

Method and apparatus for performing packet loss or frame erasure concealment

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

38 Citations

14 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links