Method and apparatus for rapid adapt via cumulative distribution function matching for continuous speech
First Claim
1. A method of adapting a speech recognition system to one or more acoustic conditions, the method comprising the steps of:
- computing cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
computing cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
computing a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
applying the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of adapting a speech recognition system to one or more acoustic conditions comprises the steps of: (i) computing cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system; (ii) computing cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system; (iii) computing a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and (iv) applying the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data.
26 Citations
37 Claims
-
1. A method of adapting a speech recognition system to one or more acoustic conditions, the method comprising the steps of:
-
computing cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
computing cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
computing a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
applying the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus for adapting a speech recognition system to one or more acoustic conditions, the apparatus comprising:
-
at least one processing device operative to;
(i) compute cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
(ii) compute cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
(iii) compute a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
(iv) apply the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. An article of manufacture for use in adapting a speech recognition system to one or more acoustic conditions, comprising a machine readable medium containing one or more programs which when executed implement the steps of:
-
computing cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
computing cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
computing a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
applying the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data.
-
-
22. A method of adapting a speech recognition system to one or more acoustic conditions, the method comprising the steps of:
-
computing cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
computing cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
computing a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
applying the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data;
wherein the step of computing the cumulative distribution functions associated with the training speech data further comprises the steps of determining, for each dimension, a maximum value and a minimum value across the training speech data, and uniformly dividing a range associated with the minimum value and the maximum value into non-overlapping intervals. - View Dependent Claims (23)
-
-
24. A method of adapting a speech recognition system to one or more acoustic conditions, the method comprising the steps of:
-
computing cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
computing cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
computing a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
applying the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data;
wherein the step of computing the cumulative distribution functions associated with the test speech data further comprises the step of determining, for each dimension, a maximum value and a minimum value across the test speech data. - View Dependent Claims (25, 26)
-
-
27. A method of adapting a speech recognition system to one or more acoustic conditions, the method comprising the steps of:
-
computing cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
computing cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
computing a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
applying the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data;
wherein the nonlinear transformation mapping is represented as (FT)−
1 FA, where FT represents the cumulative distribution functions associated with the training speech data and FA represents the cumulative distribution functions associated with the test speech data.- View Dependent Claims (28, 29)
-
-
30. Apparatus for adapting a speech recognition system to one or more acoustic conditions, the apparatus comprising:
-
at least one processing device operative to;
(i) compute cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
(ii) compute cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
(iii) compute a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
(iv) apply the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data;
wherein the operation of computing the cumulative distribution functions associated with the training speech data further comprises the operations of determining, for each dimension, a maximum value and a minimum value across the training speech data, and uniformly dividing a range associated with the minimum value and the maximum value into non-overlapping intervals. - View Dependent Claims (31)
-
-
32. Apparatus for adapting a speech recognition system to one or more acoustic conditions, the apparatus comprising:
-
at least one processing device operative to;
(i) compute cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
(ii) compute cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
(iii) compute a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
(iv) apply the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data;
wherein the operation of computing the cumulative distribution functions associated with the test speech data further comprises the operation of determining, for each dimension, a maximum value and a minimum value across the test speech data. - View Dependent Claims (33, 34)
-
-
35. Apparatus for adapting a speech recognition system to one or more acoustic conditions, the apparatus comprising:
-
at least one processing device operative to;
(i) compute cumulative distribution functions based on dimensions of speech vectors associated with training speech data provided to the speech recognition system;
(ii) compute cumulative distribution functions based on dimensions of speech vectors associated with test speech data provided to the speech recognition system;
(iii) compute a nonlinear transformation mapping based on the cumulative distribution functions associated with the training speech data and the cumulative distribution functions associated with the test speech data; and
(iv) apply the nonlinear transformation mapping to speech vectors associated with the test speech data prior to recognition, wherein the speech vectors transformed in accordance with the nonlinear transformation mapping are substantially similar to speech vectors associated with the training speech data;
wherein the nonlinear transformation mapping is represented as (FT)−
1 FA, where FT represents the cumulative distribution functions associated with the training speech data and FA represents the cumulative distribution functions associated with the test speech data.- View Dependent Claims (36, 37)
-
Specification