Apparatus and method for audio frame loss recovery
First Claim
1. A method for processing a sequence of frames of coded audio data comprising the steps of:
- identifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames;
identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method;
identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method;
generating replacement audio samples for the sequence of lost frames based on the first frame of coded data;
obtaining a pitch delay from at least one of the first and second frames of coded audio data;
generating a second decoded audio portion of the second frame based on the second frame of coded audio data;
generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the second decoded audio portion and the replacement audio samples; and
generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions,wherein the first decoded audio portion is determined as
ŝ
g(i)=α
·
ŝ
s(i−
T1)+β
·
ŝ
g(i+T2);
0<
+i+l, wherein ŝ
g is a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ
s(i−
T1) is weighted by the value 0<
=α
<
=1 and a second set of samples ŝ
α
(i+T2) is weighted by the value β
=1−
α
, T1 is the pitch delay, T2 is an integer multiple of the pitch delay.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus provide for audio frame recovery by identifying a sequence of lost frames of coded audio data as being lost or corrupted; identifying a first frame of coded audio data which immediately preceded the sequence of lost frames, as having been encoded using a time domain coding method; identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method; obtaining a pitch delay; generating a second decoded audio portion of the second frame based on the second frame; generating a first decoded audio portion of the second frame based on the pitch delay and decoded audio samples; and generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions.
8 Citations
9 Claims
-
1. A method for processing a sequence of frames of coded audio data comprising the steps of:
-
identifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames; identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method; identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method; generating replacement audio samples for the sequence of lost frames based on the first frame of coded data; obtaining a pitch delay from at least one of the first and second frames of coded audio data; generating a second decoded audio portion of the second frame based on the second frame of coded audio data; generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the second decoded audio portion and the replacement audio samples; and generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions, wherein the first decoded audio portion is determined as
ŝ
g(i)=α
·
ŝ
s(i−
T1)+β
·
ŝ
g(i+T2);
0<
+i+l,wherein ŝ
g is a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ
s(i−
T1) is weighted by the value 0<
=α
<
=1 and a second set of samples ŝ
α
(i+T2) is weighted by the value β
=1−
α
, T1 is the pitch delay, T2 is an integer multiple of the pitch delay. - View Dependent Claims (2, 3)
-
-
4. An apparatus for decoding an audio signal, comprising:
-
a receiver for receiving a sequence of frames of coded audio data; and a processing system for identifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames, identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method, identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method, generating replacement audio samples for the sequence of lost frames based on the first frame of coded data; obtaining a pitch delay from at least one of the first and second frames of coded audio data, generating a second decoded audio portion of the second frame based on the second frame of coded audio data, generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the second decoded audio portion and the replacement audio samples, and generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions, wherein the processor determines the first decoded audio portion as
ŝ
g(i)=α
·
ŝ
s(i−
T1)+β
·
ŝ
α
(i+T2);
0<
+i+l,wherein ŝ
g is a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ
s(i−
T1) is weighted by the value 0<
=α
<
=1 and a second set of samples ŝ
α
(i+T2) is weighted by the value β
=1−
α
, T1 is the pitch delay, T2 is an integer multiple of the pitch delay. - View Dependent Claims (5, 6)
-
-
7. A non-transitory computer readable medium that stores programming instructions that, when executed on a processor having hardware associated therewith for receiving an audio signal, performs processing of a sequence of frames of coded audio data, comprising:
-
identifying a sequence of lost frames of coded audio data as being lost or corrupted, wherein the sequence of lost frames comprises one or more lost frames; identifying a first frame of coded audio data, which immediately preceded the sequence of lost frames of coded audio data, as having been encoded using a time domain coding method; identifying a second frame of coded audio data, which immediately followed the sequence of lost frames of coded audio data, as having been encoded using a transform domain coding method; generating replacement audio samples for the sequence of lost frames based on the first frame of coded data; obtaining a pitch delay from at least one of the first and second frames of coded audio data; generating a second decoded audio portion of the second frame based on the second frame of coded audio data; generating a first decoded audio portion of the second frame based on the pitch delay and at least one of the decoded audio portion and the replacement audio samples; and generating a decoded audio output of the second frame based on a sequential combination of the first and second decoded audio portions, wherein the first decoded audio portion is determined as
ŝ
g(i)=α
·
ŝ
s(i−
T1)+β
·
ŝ
α
(i+T2);
0<
+i+l,wherein ŝ
g is a vector of length l determined as a weighted sum of decoded audio samples, wherein a first set of samples ŝ
s(i−
T1) is weighted by the value 0<
=α
<
=1 and a second set of samples ŝ
α
(i+T2) is weighted by the value β
=1−
α
, T1 is the pitch delay, T2 is an integer multiple of the pitch delay. - View Dependent Claims (8, 9)
-
Specification