Method and apparatus for estimating subjective audio signal quality from objective distortion measures
First Claim
1. A method of estimating audio signal quality, the method comprising the steps of:
- generating a mapping function between a plurality of actual subjective measures determined for a given set of audio signals and corresponding objective distortion measures determined for the given set of audio signals; and
utilizing the mapping function to generate an estimated subjective measure from an objective distortion measure determined for another audio signal;
wherein a portion of at least one of the objective distortion measures associated with an mth frame of a given source speech sequence is given by where X(m, i) and Y(m, i) are auditory representations of source and processed speech, respectively, for the sequence, 1≦
i≦
Nb denotes a frequency bin index, Nb is the dimension of a frame vector, and C(m, i) is an asymmetric weighting factor;
wherein an overall auditory-based objective distortion measure between the source and processed speech sequences X and Y is determined by
1 Assignment
0 Petitions
Accused Products
Abstract
A mapping function is generated between subjective measures of audio signal quality, e.g., mean opinion score (MOS) or degradation MOS (DMOS) measures, and corresponding objective distortion measures, e.g., auditory speech quality measures (ASQMs) or perceptual speech quality measures (PSQMs), for known audio signals. The subjective measures and corresponding objective distortion measures are determined in accordance with modulated noise reference unit (MNRU) conditions or other suitable distortion conditions placed on the source speech, and a regression analysis is applied to the results to generate the mapping function. The mapping function may then be utilized, e.g., to evaluate speech quality of additional source speech from a particular speech coding system. In this case, the objective distortion measure is generated using the additional source speech, and the resulting objective measure is applied as an input to the mapping function to generate an estimate of the value of the subjective measure. Advantageously, the mapping function is database-independent, and can thus be used, e.g., to generate accurate estimates of subjective measures of speech quality for speech databases unrelated to those used in generating the mapping function.
-
Citations
23 Claims
-
1. A method of estimating audio signal quality, the method comprising the steps of:
-
generating a mapping function between a plurality of actual subjective measures determined for a given set of audio signals and corresponding objective distortion measures determined for the given set of audio signals; and
utilizing the mapping function to generate an estimated subjective measure from an objective distortion measure determined for another audio signal;
wherein a portion of at least one of the objective distortion measures associated with an mth frame of a given source speech sequence is given by where X(m, i) and Y(m, i) are auditory representations of source and processed speech, respectively, for the sequence, 1≦
i≦
Nb denotes a frequency bin index, Nb is the dimension of a frame vector, and C(m, i) is an asymmetric weighting factor;wherein an overall auditory-based objective distortion measure between the source and processed speech sequences X and Y is determined by - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An apparatus comprising a processing system operative to generate a mapping function between a plurality of actual subjective measures determined for a given set of audio signals and corresponding objective distortion measures determined for the given set of audio signals, and to utilize the mapping function to generate an estimated subjective measure from an objective distortion measure determined for another audio signal;
-
wherein a portion of at least one of the objective distortion measures associated with an mth frame of a given source speech sequence is given by where X(m, i) and Y(m, i) are auditory representations of source and processed speech, respectively, for the sequence, 1≦
i≦
Nb denotes a frequency bin index, Nb is the dimension of a frame vector, and C(m, i) is an asymmetric weighting factor;wherein an overall auditory-based objective distortion measure between the source and processed speech sequences X and Y is determined by - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. An article of manufacture comprising a machine-readable medium for storing one or more software programs which when executed in a data processor implement the steps of:
-
generating a mapping function between a plurality of actual subjective measures determined for a given set of audio signals and corresponding objective distortion measures determined for the given set of audio signals; and
utilizing the mapping function to generate an estimated subjective measure from an objective distortion measure determined for another audio signal;
wherein a portion of at least one of the objective distortion measures associated with an mth frame of a given source speech sequence is given by where X(m, i) and Y(m, i) are auditory representations of source and processed speech, respectively, for the sequence, 1≦
i≦
Nb denotes a frequency bin index, Nb is the dimension of a frame vector, and C(m, i) is an asymmetric weighting factor;wherein an overall auditory-based objective distortion measure between the source and processed speech sequences X and Y is determined by
-
Specification