Apparatus and method for audio frame loss recovery

US 9,053,699 B2
Filed: 07/10/2012
Issued: 06/09/2015
Est. Priority Date: 07/10/2012
Status: Active Grant

First Claim

Patent Images

1. A method for processing a sequence of frames of coded audio data comprising the steps of:

identifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames;

identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method;

identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method;

generating replacement audio samples for the sequence of lost frames based on the first frame of coded data;

obtaining a pitch delay from at least one of the first and second frames of coded audio data;

generating a second decoded audio portion of the second frame based on the second frame of coded audio data;

generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the second decoded audio portion and the replacement audio samples; and

generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions,wherein the first decoded audio portion is determined as
ŝ

_g(i)=α

·

ŝ

_s(i−

T₁)+β

·

ŝ

_g(i+T₂);

0<

+i+l, wherein ŝ

_gis a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ

_s(i−

T₁) is weighted by the value 0<

=α

<

=1 and a second set of samples ŝ

_α(i+T₂) is weighted by the value β

=1−

α

, T₁is the pitch delay, T₂is an integer multiple of the pitch delay.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and apparatus provide for audio frame recovery by identifying a sequence of lost frames of coded audio data as being lost or corrupted; identifying a first frame of coded audio data which immediately preceded the sequence of lost frames, as having been encoded using a time domain coding method; identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method; obtaining a pitch delay; generating a second decoded audio portion of the second frame based on the second frame; generating a first decoded audio portion of the second frame based on the pitch delay and decoded audio samples; and generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions.

8 Citations

View as Search Results

9 Claims

1. A method for processing a sequence of frames of coded audio data comprising the steps of:
- identifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames;
  
  identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method;
  
  identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method;
  
  generating replacement audio samples for the sequence of lost frames based on the first frame of coded data;
  
  obtaining a pitch delay from at least one of the first and second frames of coded audio data;
  
  generating a second decoded audio portion of the second frame based on the second frame of coded audio data;
  
  generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the second decoded audio portion and the replacement audio samples; and
  
  generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions,wherein the first decoded audio portion is determined as
  ŝ
  
  _g(i)=α
  
  ·
  
  ŝ
  
  _s(i−
  
  T₁)+β
  
  ·
  
  ŝ
  
  _g(i+T₂);
  
  0<
  
  +i+l, wherein ŝ
  
  _gis a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ
  
  _s(i−
  
  T₁) is weighted by the value 0<
  
  =α
  
  <
  
  =1 and a second set of samples ŝ
  
  _α(i+T₂) is weighted by the value β
  
  =1−
  
  α
  
  , T₁is the pitch delay, T₂is an integer multiple of the pitch delay.
- View Dependent Claims (2, 3)
- - 2. The method of claim 1 further comprising:
    - generating a sequence of replacement audio output frames for the sequence of lost frames of coded audio data based at least on the first frame of coded data.
  - 3. The method of claim 1 wherein the audio samples used in the determination of the first decoded audio portion comprise audio samples from a last replacement frame of the sequence of lost frames and the second decoded audio portion.

4. An apparatus for decoding an audio signal, comprising:
- a receiver for receiving a sequence of frames of coded audio data; and
  
  a processing system foridentifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames,identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method,identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method,generating replacement audio samples for the sequence of lost frames based on the first frame of coded data;
  
  obtaining a pitch delay from at least one of the first and second frames of coded audio data,generating a second decoded audio portion of the second frame based on the second frame of coded audio data,generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the second decoded audio portion and the replacement audio samples, andgenerating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions,wherein the processor determines the first decoded audio portion as
  ŝ
  
  _g(i)=α
  
  ·
  
  ŝ
  
  _s(i−
  
  T₁)+β
  
  ·
  
  ŝ
  
  _α(i+T₂);
  
  0<
  
  +i+l, wherein ŝ
  
  _gis a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ
  
  _s(i−
  
  T₁) is weighted by the value 0<
  
  =α
  
  <
  
  =1 and a second set of samples ŝ
  
  _α(i+T₂) is weighted by the value β
  
  =1−
  
  α
  
  , T₁is the pitch delay, T₂is an integer multiple of the pitch delay.
- View Dependent Claims (5, 6)
- - 5. The apparatus according to claim 4, wherein the processor is further for:
    - generating a sequence of replacement audio output frames for the sequence of lost frames of coded audio data based at least on the first frame of coded data.
  - 6. The apparatus according to claim 4, wherein the audio samples used in the determination of the first decoded audio portion comprise audio samples from a last replacement frame of the sequence of lost frames and the second decoded audio portion.

7. A non-transitory computer readable medium that stores programming instructions that, when executed on a processor having hardware associated therewith for receiving an audio signal, performs processing of a sequence of frames of coded audio data, comprising:
- identifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames;
  
  identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method;
  
  identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method;
  
  generating replacement audio samples for the sequence of lost frames based on the first frame of coded data;
  
  obtaining a pitch delay from at least one of the first and second frames of coded audio data;
  
  generating a second decoded audio portion of the second frame based on the second frame of coded audio data;
  
  generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the decoded audio portion and the replacement audio samples; and
  
  generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions,wherein the first decoded audio portion is determined as
  ŝ
  
  _g(i)=α
  
  ·
  
  ŝ
  
  s(i−
  
  T₁)+β
  
  ·
  
  ŝ
  
  _α(i+T₂);
  
  0<
  
  +i+l, wherein ŝ
  
  _gis a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ
  
  _s(i−
  
  T₁) is weighted by the value 0<
  
  =α
  
  <
  
  =1 and a second set of samples ŝ
  
  _α(i+T₂) is weighted by the value β
  
  =1−
  
  α
  
  , T₁is the pitch delay, T₂is an integer multiple of the pitch delay.
- View Dependent Claims (8, 9)
- - 8. The non-transitory computer readable medium according to claim 7, wherein the instructions further perform:
    - generating a sequence of replacement audio output frames for the sequence of lost frames of coded audio data based at least on the first frame of coded data.
  - 9. The non-transitory computer readable medium according to claim 7, wherein the audio samples used in the determination of the first decoded audio portion comprise audio samples from a last replacement frame of the sequence of lost frames and the second decoded audio portion.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google Technology Holdings LLC (Alphabet Inc.)
Original Assignee
Google Technology Holdings LLC (Alphabet Inc.)
Inventors
Mittal, Udar, Ashley, James P.
Primary Examiner(s)
Saint Cyr, Leonard

Application Number

US13/545,277
Publication Number

US 20140019142A1
Time in Patent Office

1,064 Days
Field of Search

704500-504
US Class Current

1/1
CPC Class Codes

G10L 19/005 Correction of errors induce...

G10L 19/20 using sound class specific ...

Apparatus and method for audio frame loss recovery

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

8 Citations

9 Claims

Specification

Solutions

Use Cases

Quick Links

Apparatus and method for audio frame loss recovery

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

8 Citations

9 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links