Dynamic control of voice codec data rate
First Claim
Patent Images
1. A method for dynamic codec adaptation, comprising:
- estimating an importance value of a data frame utilizing information, wherein estimating said importance value of said data frame utilizing said information further comprises measuring sub-metrics, said sub-metrics comprising at least one of objective informational importance, dialog-control importance, interpersonal-relationship-management importance, and meta-dialog importance;
dynamically varying a data rate based on said estimated said importance value of said data frame;
devoting a greater bitrate to data frame quality in real time for a data frame with an assigned high importance value;
devoting less bitrate to data frame quality in real time for a data frame with an assigned low importance value; and
transmitting important voice frames, communications, and utterances with greater accuracy and a certain quality level for a data rate, wherein important voice frames comprise audio with a high pitch range, pitch height, volume, and overlapped speech by multiple speakers.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, system, and computer-usable non-transitory storage device for dynamic voice codec adaptation are disclosed. The voice codec adapts in real time to devote more bits to audio quality when it is most needed, and fewer bits to less important parts of utterances are disclosed. Dialog knowledge is utilized for compression opportunities to adjust the bitrate moment-by-moment, based on the inferred value of each frame. Frame importance and appropriate transmission fidelity is predicted based on prosodic features and models of dialog dynamics. This technique provides the same communications quality with less spectrum needs, fewer antennas, and less battery drain.
-
Citations
17 Claims
-
1. A method for dynamic codec adaptation, comprising:
-
estimating an importance value of a data frame utilizing information, wherein estimating said importance value of said data frame utilizing said information further comprises measuring sub-metrics, said sub-metrics comprising at least one of objective informational importance, dialog-control importance, interpersonal-relationship-management importance, and meta-dialog importance; dynamically varying a data rate based on said estimated said importance value of said data frame; devoting a greater bitrate to data frame quality in real time for a data frame with an assigned high importance value; devoting less bitrate to data frame quality in real time for a data frame with an assigned low importance value; and transmitting important voice frames, communications, and utterances with greater accuracy and a certain quality level for a data rate, wherein important voice frames comprise audio with a high pitch range, pitch height, volume, and overlapped speech by multiple speakers. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for dynamic codec adaptation, comprising:
-
a processor; a data bus coupled to said processor; and a computer-usable non-transitory storage medium embodying computer code, said computer-usable non-transitory storage medium being coupled to said data bus, said computer program code comprising program instructions executable by said processor, said program instructions comprising; program instructions to estimate an importance value of a data frame utilizing information, wherein said program instructions to estimate said importance value of said data frame further comprises program instructions to measure sub-metrics, said sub-metrics comprising at least one of objective informational importance, dialog-control importance, interpersonal-relationship-management importance, and meta-dialog importance; program instructions to dynamically vary a data rate based on said estimated said importance value of said data frame; program instructions to devote a greater bitrate to data frame quality in real time for a data frame with an assigned high importance value; program instructions to devote less bitrate to data frame quality in real time for a data frame with an assigned low importance value; and program instruction to transmit important voice frames, communications, and utterances with greater accuracy and a certain quality level for a data rate, wherein important voice frames comprise audio with a high pitch range, pitch height, volume, and overlapped speech by multiple speakers. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer-usable non-transitory storage device storing computer program code for dynamic codec adaptation, said computer program code comprising program instructions executable by a processor, said program instructions comprising:
-
program instructions to estimate an importance value of a data frame utilizing information, wherein said program instructions to estimate said importance value of said data frame utilizing said information comprises program instructions to measure sub-metrics, said sub-metrics comprising at least one of objective informational importance, dialog-control importance, interpersonal-relationship-management importance, and meta-dialog importance; program instructions to dynamically vary a data rate based on said estimated said importance value of said data frame; program instructions to devote a greater bitrate to data frame quality in real time for a data frame with an assigned high importance value; and program instructions to devote less bitrate to data frame quality in real time for a data frame with an assigned low importance value; and program instructions to transmit important voice frames, communications, and utterances with greater accuracy and a certain quality level for a data rate, wherein important voice frames comprise audio with a high pitch range, pitch height, volume, and overlapped speech by multiple speakers. - View Dependent Claims (14, 15, 16, 17)
-
Specification