Audio frame labeling to achieve unequal error protection for audio frames of unequal importance
First Claim
1. A method comprising:
- receiving a sequence of audio frames;
labeling each audio frame with a respective one of multiple possible labels for silence, noise, concealable speech, and un-concealable speech based on content in the audio frame;
determining for each audio frame based on the respective label of the audio frame a respective importance level among possible importance levels ranging from a low importance level to a high importance;
producing different subsets of audio frames such that all of the audio frames in each subset have the same label;
applying forward error correction to a respective fraction of audio frames of each subset of audio frames, such that the respective fraction increases in an order of the labels silence, noise, concealable speech and un-concealable speech, wherein the applying forward error correction further includes applying to the audio frames labeled as un-concealable speech first forward error correction that uses a sequential recovery code, and applying to audio frames not labeled as un-concealable speech second forward error correction that uses a code having a longer delay than the sequential recovery code; and
not applying forward error correction to remaining audio frames in each subset.
1 Assignment
0 Petitions
Accused Products
Abstract
An endpoint device receives a sequence of audio frames. The endpoint device determines for each audio frame a respective importance level among possible importance levels ranging from a low importance level to a high importance level based on content in the audio frame indicative of the respective importance level. The endpoint device associates each audio frame with the respective importance level, to produce different subsets of audio frames associated with respective ones of different importance levels. The endpoint device, for each subset of audio frames, applies forward error correction to a fraction of audio frames in the subset of audio frames, wherein the fraction increases as the importance level of the audio frames in the subset increases, and does not apply forward error correction to remaining audio frames in the subset.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving a sequence of audio frames; labeling each audio frame with a respective one of multiple possible labels for silence, noise, concealable speech, and un-concealable speech based on content in the audio frame; determining for each audio frame based on the respective label of the audio frame a respective importance level among possible importance levels ranging from a low importance level to a high importance; producing different subsets of audio frames such that all of the audio frames in each subset have the same label; applying forward error correction to a respective fraction of audio frames of each subset of audio frames, such that the respective fraction increases in an order of the labels silence, noise, concealable speech and un-concealable speech, wherein the applying forward error correction further includes applying to the audio frames labeled as un-concealable speech first forward error correction that uses a sequential recovery code, and applying to audio frames not labeled as un-concealable speech second forward error correction that uses a code having a longer delay than the sequential recovery code; and not applying forward error correction to remaining audio frames in each subset. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An apparatus comprising:
-
a network interface unit configured to enable communications over a communication network; and a processor coupled to the network interface unit and configured to; receive a sequence of audio frames; label each audio frame with a respective one of multiple possible labels for silence, noise, concealable speech, and un-concealable speech based on content in the audio frame; determine for each audio frame based on the respective label of the audio frame a respective importance level among possible importance levels ranging from a low importance level to a high importance level; produce different subsets of audio frames such that all of the audio frames in each subset have the same label; apply forward error correction to a respective fraction of audio frames of each subset of audio frames, such that the respective fraction increases in an order of the labels silence, noise, concealable speech and un-concealable speech, wherein the processor is configured to apply the forward error correction by applying to the audio frames labeled as un-concealable speech first forward error correction that uses a sequential recovery code, and applying to audio frames not labeled as un-concealable speech second forward error correction that uses a code having a longer delay than the sequential recovery code; and not apply forward error correction to remaining audio frames in the subset. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A non-transitory processor readable medium encoded with instructions that, when executed by a processor, cause the processor to perform operations including:
-
receiving a sequence of audio frames; labeling each audio frame with a respective one of multiple possible labels for silence, noise, concealable speech, and un-concealable speech based on content in the audio frame; determining for each audio frame based on the respective label of the audio frame a respective importance level among possible importance levels ranging from a low importance level to a high importance level; producing different subsets of audio frames such that all of the audio frames in each subset have the same label; applying forward error correction to a respective fraction of audio frames of each subset of audio frames, such that the respective fraction increases in an order of the labels silence, noise, concealable speech and un-concealable speech, wherein the applying forward error correction further includes applying to the audio frames labeled as un-concealable speech first forward error correction that uses a sequential recovery code, and applying to audio frames not labeled as un-concealable speech second forward error correction that uses a code having a longer delay than the sequential recovery code; and not applying forward error correction to remaining audio frames in the subset. - View Dependent Claims (18, 19, 20)
-
Specification