Trained artificial neural networks using an imperfect vocal tract model for assessment of speech signal quality
First Claim
Patent Images
1. A non-intrusive method of assessing the quality of a first signal carrying speech, said method comprising the steps of:
- analyzing said signal carrying speech to generate output parameters according to a spectral representation imperfect vocal tract model capable of generating coefficients that can parametrically represent both speech and distortion signal elements, andweighting the output parameters according to a network definition function to generate an output derived from the weighted output parameters, the network definition function being generated using a trainable process, using well conditioned and/or ill-conditioned samples of a test signal, modeled by imperfect the vocal tract model.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech signal is subjected imperfect to vocal tract analysis model and the output therefrom is analyzed by a neural network. The output from the neural network is compared with the parameters stored in the network definition function, to derive measurement of the quality of the speech signal supplied to the source. The network definition function is determined by applying to the trainable processing apparatus a distortion perception measure indicative of the extent to which a distortion would be perceptible to a human listener.
32 Citations
25 Claims
-
1. A non-intrusive method of assessing the quality of a first signal carrying speech, said method comprising the steps of:
-
analyzing said signal carrying speech to generate output parameters according to a spectral representation imperfect vocal tract model capable of generating coefficients that can parametrically represent both speech and distortion signal elements, and weighting the output parameters according to a network definition function to generate an output derived from the weighted output parameters, the network definition function being generated using a trainable process, using well conditioned and/or ill-conditioned samples of a test signal, modeled by imperfect the vocal tract model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. Apparatus for non-intrusively assessing the quality of a first signal carrying speech, said apparatus comprising:
-
means for analysing said first signal carrying speech using a spectral representation imperfect vocal tract model capable of generating coefficients that can parametrically represent both speech and distortion signal elements to generate output parameters, storage means for storing a set of weightings defining a network definition function, means for generating an output value derived from the output parameters and the network definition function; and training means for generating the stored set of weightings, the training means comprising means for supplying a sample of speech to the analysis means; and means for generating weightings relating to the speech sample, and inserting them in the storage means. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
Specification