Packet loss concealment based on forced waveform alignment after packet loss

US 8,346,546 B2
Filed: 07/31/2007
Issued: 01/01/2013
Est. Priority Date: 08/15/2006
Status: Active Grant

First Claim

Patent Images

1. A method for concealing a lost segment in a speech or audio signal that comprises a series of segments, the method comprising:

(a) generating an extrapolated waveform based on a segment that precedes the lost segment in the series of segments and on one or more segments that follow the lost segment in the series of segments;

(b) generating a replacement waveform for the lost segment based on a first portion of the extrapolated waveform; and

(c) overlap-adding a second portion of the extrapolated waveform with a decoded waveform associated with the one or more segments following the lost segment in the series of segments;

wherein step (a) comprises;

performing a first-pass periodic waveform extrapolation using a pitch period associated with the segment that precedes the lost segment to generate a first-pass extrapolated waveform;

identifying a time lag between the first-pass extrapolated waveform and the decoded waveform associated with the one or more segments that follow the lost segment;

calculating a pitch contour based on the identified time lag; and

performing a second-pass periodic waveform extrapolation using the pitch contour to generate the extrapolated waveform; and

wherein at least one of steps (a), (b) and (c) is performed by a processor.

View all claims

6 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A packet loss concealment method and system is described that attempts to reduce or eliminate destructive interference that can occur when an extrapolated waveform representing a lost segment of a speech or audio signal is merged with a good segment after a packet loss. This is achieved by guiding a waveform extrapolation that is performed to replace the bad segment using a waveform available in the first good segment or segments after the packet loss. In another aspect of the invention, a selection is made between a packet loss concealment method that performs the aforementioned guided waveform extrapolation and one that does not. The selection may be made responsive to determining whether the first good segment or segments after the packet loss are available and also to whether a segment preceding the lost segment and the first good segment following the lost segment are deemed voiced.

Citations

18 Claims

1. A method for concealing a lost segment in a speech or audio signal that comprises a series of segments, the method comprising:
- (a) generating an extrapolated waveform based on a segment that precedes the lost segment in the series of segments and on one or more segments that follow the lost segment in the series of segments;
  
  (b) generating a replacement waveform for the lost segment based on a first portion of the extrapolated waveform; and
  
  (c) overlap-adding a second portion of the extrapolated waveform with a decoded waveform associated with the one or more segments following the lost segment in the series of segments;
  
  wherein step (a) comprises;
  
  performing a first-pass periodic waveform extrapolation using a pitch period associated with the segment that precedes the lost segment to generate a first-pass extrapolated waveform;
  
  identifying a time lag between the first-pass extrapolated waveform and the decoded waveform associated with the one or more segments that follow the lost segment;
  
  calculating a pitch contour based on the identified time lag; and
  
  performing a second-pass periodic waveform extrapolation using the pitch contour to generate the extrapolated waveform; and
  
  wherein at least one of steps (a), (b) and (c) is performed by a processor.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1, wherein identifying the time lag between the first-pass extrapolated waveform and the decoded waveform associated with the one or more segments that follow the lost segment comprises:
    - locating a peak of an energy-normalized cross-correlation function between the first-pass extrapolated waveform and the decoded waveform associated with the one or more segments that follow the lost segment.
  - 3. The method of claim 1, wherein calculating the pitch contour comprises determining an amount of pitch period change per sample.
  - 4. The method of claim 3, wherein determining the amount of pitch period change per sample comprises calculating:
  - 5. The method of claim 1, further comprising:
    - determining if the one or more segments that follow the lost segment are available; and
      
      performing steps (a), (b) and (c) responsive only to a determination that the one or more segments that follow the lost segment are available.
  - 6. The method of claim 5, further comprising:
    - performing a packet loss concealment technique that generates an extrapolated waveform based on the segment that precedes the lost segment in the series of segments but not on any segment that follows the lost segment in the series of segments responsive to a determination that the one or more segments that follow the lost segment are not available.
  - 7. The method of claim 5, further comprising:
    - determining if the segment that precedes the lost segment and the first of the one or more segments that follow the lost segment are deemed voiced segments; and
      
      performing steps (a), (b) and (c) responsive only to a determination that the one or more segments that follow the lost segment are available and that the segment that precedes the lost segment and the first of the one or more segments that follow the lost segment are deemed voiced segments.
  - 8. The method of claim 1, wherein performing the second-pass periodic waveform extrapolation using the pitch contour to generate the extrapolated waveform comprises calculating a scaling factor in accordance with:
    - c=r^1/m,or a mathematically equivalent formula, wherein c is the scaling factor, m is a number of pitch cycles in a gap that extends from the end of the segment that precedes the lost segment to a middle of an overlap-add region in the first of the one or more segments that follow the lost segment, and r is a ratio of an average magnitude of a decoded waveform in a target matching window over an average magnitude of a waveform that is m pitch periods earlier.

9. A computer program product comprising a computer-readable storage unit having computer program logic recorded thereon for enabling a processor to conceal a lost segment in a speech or audio signal that comprises a series of segments, the computer program logic comprising:
- first means for enabling the processor to generate an extrapolated waveform based on a segment that precedes the lost segment in the series of segments and on one or more segments that follow the lost segment in the series of segments;
  
  second means for enabling the processor to generate a replacement waveform for the lost segment based on a first portion of the extrapolated waveform; and
  
  third means for enabling the processor to overlap-add a second portion of the extrapolated waveform with a decoded waveform associated with the one or more segments following the lost segment in the series of segments;
  
  wherein the first means comprises;
  
  means for enabling the processor to perform a first-pass periodic waveform extrapolation using a pitch period associated with the segment that precedes the lost segment to generate a first-pass extrapolated waveform;
  
  means for enabling the processor to identify a time lag between the first-pass extrapolated waveform and the decoded waveform associated with the one or more segments that follow the lost segment;
  
  means for enabling the processor to calculate a pitch contour based on the identified time lag; and
  
  means for enabling the processor to perform a second-pass periodic waveform extrapolation using the pitch contour to generate the extrapolated waveform.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- - 10. The computer program product of claim 9, wherein the means for enabling the processor to identify the time lag between the first-pass extrapolated waveform and the decoded waveform associated with the one or more segments that follow the lost segment comprises:
    - means for enabling the processor to locate a peak of an energy-normalized cross-correlation function between the first-pass extrapolated waveform and the decoded waveform associated with the one or more segments that follow the lost segment.
  - 11. The computer program product of claim 9, wherein the means for enabling the processor to calculate the pitch contour comprises means for enabling the processor to determine an amount of pitch period change per sample.
  - 12. The computer program product of claim 11, wherein the means for enabling the processor to determine the amount of pitch period change per sample comprises means for enabling the processor to calculate:
  - 13. The computer program product of claim 9, further comprising:
    - means for enabling the processor to determine if the one or more segments that follow the lost segment in the series of segments are available; and
      
      means for enabling the processor to invoke the first means, second means and third means responsive only to a determination that the one or more segments that follow the lost segment are available.
  - 14. The computer program product of claim 13, further comprising:
    - means for enabling the processor to perform a packet loss concealment technique that generates an extrapolated waveform based on the segment that precedes the lost segment but not on any segment that follows the lost segment in the series of segments responsive to a determination that the one or more segments that follow the lost segment are not available.
  - 15. The computer program product of claim 13, further comprising:
    - means for enabling the processor to determine if the segment that precedes the lost segment and the first of the one or more segments that follow the lost segment are deemed voiced segments; and
      
      means for enabling the processor to invoke the first means, second means and third means responsive only to a determination that the one or more segments that follow the lost segment are available and that the segment that precedes the lost segment and the first of the one or more segments that follow the lost segment are deemed voiced segments.
  - 16. The computer program product of claim 9, wherein the means for enabling the processor to perform the second-pass periodic waveform extrapolation using the pitch contour to generate the extrapolated waveform comprises:
    - means for calculating a scaling factor in accordance with;
      
      c=r^1/m,or a mathematically equivalent formula, wherein c is the scaling factor, m is a number of pitch cycles in a gap that extends from the end of the segment that precedes the lost segment to a middle of an overlap-add region in the first of the one or more segments that follow the lost segment, and r is a ratio of an average magnitude of a decoded waveform in a target matching window over an average magnitude of a waveform that is m pitch periods earlier.

17. A method for concealing a lost segment in a speech or audio signal that comprises a series of segments, the method comprising:
- determining if one or more segments that follow the lost segment in the series of segments are available;
  
  if one or more segments that follow the lost segment in the series of segments are available, determining if the segment that precedes the lost segment and the first of the one or more segments that follow the lost segments are deemed voiced segments;
  
  performing packet loss concealment using periodic waveform extrapolation based on a segment that precedes the lost segment in the series of segments and on the one or more segments that follow the lost segment responsive to a determination that the one or more segments that follow the lost segment are available and to a determination that the segment that precedes the lost segment and the first of the one or more segments that follow the lost segment are deemed voiced segments; and
  
  performing packet loss concealment using waveform extrapolation based on the segment that precedes the lost segment but not on any segments that follow the lost segment responsive to a determination that the one or more segments that follow the lost segment are not available or to a determination that either the segment that precedes the lost segment or the first of the one or more segments that follow the lost segment is not deemed a voiced segment;
  
  wherein at least one of the determining or performing steps is performed by a processor.

18. A computer program product comprising a computer-readable storage unit having computer program logic recorded thereon for enabling a processor to conceal a lost segment in a speech or audio signal that comprises a series of segments, the computer program logic comprising:
- first means for enabling the processor to determine if one or more segments that follow the lost segment in the series of segments are available;
  
  second means for enabling the processor to determine if the segment that precedes the lost segment and the first of the one or more segments that follow the lost segments are deemed voiced segments if one or more segments that follow the lost segment in the series of segments are available;
  
  third means for enabling the processor to perform packet loss concealment using periodic waveform extrapolation based on a segment that precedes the lost segment in the series of segments and on the one or more segments that follow the lost segment responsive to a determination that the one or more segments that follow the lost segment are available and to a determination that the segment that precedes the lost segment and the first of the one or more segments that follow the lost segment are deemed voiced segments; and
  
  fourth means for enabling the processor to perform packet loss concealment using waveform extrapolation based on the segment that precedes the lost segment but not on any segments that follow the lost segment responsive to a determination that the one or more segments that follow the lost segment are not available or to a determination that either the segment that precedes the lost segment or the first of the one or more segments that follow the lost segment is not deemed a voiced segment.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Avago Technologies International Sales Pte Limited (Broadcom, Inc.)
Original Assignee
Broadcom Corporation (Broadcom, Inc.)
Inventors
Chen, Juin-Hwey
Primary Examiner(s)
Abebe, Daniel D

Application Number

US11/831,835
Publication Number

US 20080046235A1
Time in Patent Office

1,981 Days
Field of Search

704/207, 704/208, 704/218, 704/228
US Class Current

704/228
CPC Class Codes

G10L 19/005 Correction of errors induce...

Packet loss concealment based on forced waveform alignment after packet loss

First Claim

6 Assignments

0 Petitions

Accused Products

Abstract

Citations

18 Claims

Specification

Solutions

Use Cases

Quick Links

Packet loss concealment based on forced waveform alignment after packet loss

First Claim

6 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

18 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links