Calibration of speech data acquisition path
First Claim
1. A method of calibrating the data acquisition path comprising the steps of:
- applying a voice utterance signal simultaneously to two speech transmission paths where the first path is a high quality reference path and a second test path being the acquisition path;
detecting output power density of the reference path to produce signal YR and detecting output power density of the second test path to produce the power density of the mismatch signal YN;
processing said reference signal YR and said mismatch signal YN to determine channel estimate and noise estimate of the mismatch using equations derived by modeling convolutive HΔ
and additive noise NN as polynomial functions of frequency with one order P for HΔ
and a different order Q for NN, estimating model parameters using maximum likelihood criterion to determine the parameter set and simultaneously solving linear equation for both order P and Q.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for calibration of a data acquisition path is achieved by applying a voice utterance to a first high quality microphone and reference path and to a test acquisition path including a test microphone such as a lower quality one used in a car. The calibration device includes detecting the power density of the reference signal YR through the reference path and detecting the power density of the signal YN through the acquisition path. A processor processes these signals to provide an output signal representing a noise estimate and channel estimate. The processing uses equation derived by modeling convolutive and additive noise as polynomials with different orders and estimating model parameters using maximum likelihood criterion and simultaneously solving linear equations for the different orders.
12 Citations
7 Claims
-
1. A method of calibrating the data acquisition path comprising the steps of:
-
applying a voice utterance signal simultaneously to two speech transmission paths where the first path is a high quality reference path and a second test path being the acquisition path;
detecting output power density of the reference path to produce signal YR and detecting output power density of the second test path to produce the power density of the mismatch signal YN;
processing said reference signal YR and said mismatch signal YN to determine channel estimate and noise estimate of the mismatch using equations derived by modeling convolutive HΔ
and additive noise NN as polynomial functions of frequency with one order P for HΔ
and a different order Q for NN, estimating model parameters using maximum likelihood criterion to determine the parameter set and simultaneously solving linear equation for both order P and Q. - View Dependent Claims (2, 3)
-
-
4. A method of calibrating the data acquisition path for each utterance comprising the steps of:
-
applying a voice utterance signal simultaneously to two speech transmission paths where the first path is a high quality reference path and a second test path being the acquisition path;
detecting output power density of the reference path to produce signal YR and detecting output power density of the second test path to produce the power density of the mismatch signal YN;
determining for each frame of the utterance the power spectrum for YR and YN;
calculating for all of the frames of the utterance the following A(P×
P)Δ
[A1,A2, . . . AP]t B(P×
Q)Δ
[B1,B2, . . . BP]t C(Q×
P)Δ
[C1,C2, . . . CQ]t=BTD(Q×
Q)Δ
[D1,D2, . . . DQ]t uΔ
[u1,u2, . . . up]t with up=α
(p−
1,YRYN) vΔ
[υ
1,υ
2, . . . υ
Q]t with υ
q=β
(q−
1,YN)for A, B, C, D, u and v;
calculating for the utterance the noise estimate θ
N using the following;
(D−
BtA−
1B)θ
N=v−
BtA−
1u;
and calculating for the utterance the channel estimate θ
H using the following;
θ
H=A−
1(u−
Bθ
N).
-
-
5. A method of calibrating the data acquisition path for each utterance comprising the steps of:
-
applying a voice utterance signal simultaneously to two speech transmission paths where the first path is a high quality reference path a high and a second test path being the acquisition path to produce the mismatch signal YN;
determining for each frame of the utterance the power spectrum for YR and YN;
calculating for all of the frames of each utterance the following A(P×
P)Δ
[A1,A2, . . . AP]t B(P×
Q)Δ
[B1,B2, . . . BP]t C(Q×
P)Δ
[C1,C2, . . . CQ]t=BT D(Q×
Q)Δ
[D1,D2, . . . DQ]t uΔ
[u1,u2, . . . up]t with up=α
(p−
1,YRYN) vΔ
[υ
1,υ
2, . . . υ
Q]t with υ
q=β
(q−
1,YN)for A, B, C, D, u and v;
calculating the noise estimate θ
N and the channel estimate θ
H using the following;
-
-
6. A data acquisition path calibration device comprising:
-
a first speech transmission path including a high quality microphone and pre-A/D processing and a second test acquisition path including a lower quality microphone;
means for detecting for each frame the power spectrum density of an input signal received over said first path to produce a reference signal YR and for detecting for each frame the power spectrum density of said input signal received over said acquisition path to produce mismatch signal YN means for determining the noise estimate and the channel estimate by calculating for each utterance;
A(P×
P)Δ
[A1,A2, . . . AP]t B(P×
Q)Δ
[B1,B2, . . . BP]t C(Q×
P)Δ
[C1,C2, . . . CQ]t=BT D(Q×
Q)Δ
[D1,D2, . . . DQ]t uΔ
[u1,u2, . . . up]t with up=α
(p−
1,YRYN) vΔ
[υ
1,υ
2, . . . υ
Q]t with υ
q=β
(q−
1,YN)for A, B, C, D, u and v;
calculating for the utterance the noise estimate θ
N using the following;
(D−
BtA−
1B)θ
N=v−
BtA−
1u;
and calculating for the utterance the channel estimate θ
H using the following;
θ
H=A−
1(u−
Bθ
N).
-
-
7. A data acquisition path calibration device comprising:
-
a first speech transmission path including a high quality microphone and pre-A/D processing and a second test acquisition path including a lower quality microphone;
means for detecting for each frame the power spectrum density of an input signal received over said first path to produce a reference signal YR and for detecting for each frame the power spectrum density of said input signal received over said acquisition path to produce mismatch signal YN means for determining the noise estimate and the channel estimate by calculating for each utterance;
A(P×
P)Δ
[A1,A2, . . . AP]t B(P×
Q)Δ
[B1,B2, . . . BP]tC(Q×
P)Δ
[C1,C2, . . . CQ]t=BT D(Q×
Q)Δ
[D1,D2, . . . DQ]t uΔ
[u1,u2, . . . u9]t with up=α
(p−
1,YRYN) vΔ
[υ
1,υ
2, . . . υ
Q]t with υ
q=β
(q−
1,YN)for A, B, C, D, u and v;
calculating the noise estimate θ
N and the channel estimate θ
H using the following;
-
Specification