Decreasing noise sensitivity in speech processing under adverse conditions
First Claim
1. A method comprising:
- determining signal attributes and noise attributes of at least two signal portions including speech; and
deriving a distance measure for one signal portion by using the signal attributes of both signal portions.
1 Assignment
0 Petitions
Accused Products
Abstract
To perform reliable speech or speaker recognition (e.g., verification or identification) in adverse conditions, such as noisy environments, a noise compensation mechanism increases noise robustness while speech processing by decreasing noise sensitivity. Signal attributes and noise attributes of at least two signal portions including speech may be determined. Using the signal attributes of both signal portions, a distance measure for one signal portion by using the signal attributes of both signal portions may be derived. In one embodiment, using a Parallel Model Combination (PMC) algorithm, a normalized absolute distance score may be obtained for a noisy speech signal including an utterance. For accurate rejection or acceptance of speech or speaker (registered speakers or imposters), the normalized absolute distance score may be compared to a dynamic threshold or one or more speech or speaker profiles.
-
Citations
30 Claims
-
1. A method comprising:
-
determining signal attributes and noise attributes of at least two signal portions including speech; and
deriving a distance measure for one signal portion by using the signal attributes of both signal portions. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
extracting from a noisy speech signal an utterance, said noisy speech signal including a first portion with first signal-and-noise attributes and a second portion with second signal-and-noise attributes, wherein said utterance extracted from the noisy speech signal based on a first model trained on training speech data;
selectively combining across the noisy speech signal the first and second signal-and-noise attributes of both the first and second portions to derive a compensation term for the first model;
deriving a second model by compensating the first model based on the compensation term; and
correcting a mismatch indicative of a noise differential between the first portion and the second portion based on the second model. - View Dependent Claims (7, 8, 9, 10)
-
-
11. An article comprising a medium storing instructions that enable a processor-based system to:
-
determine signal attributes and noise attributes of at least two signal portions including speech; and
derive a distance measure for one signal portion by using the signal attributes of both signal portions. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An article comprising a medium storing instructions that enable a processor-based system to:
-
extract from a noisy speech signal an utterance, said noisy speech signal including a first portion with first signal-and-noise attributes and a second portion with second signal-and-noise attributes, wherein said utterance extracted from the noisy speech signal based on a first model trained on training speech data;
selectively combine across the noisy speech signal the first and second signal-and-noise attributes of both the first and second portions to derive a compensation term for the first model;
derive a second model by compensating the first model based on the compensation term; and
correct a mismatch indicative of a noise differential between the first portion and the second portion based on the second model. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. An apparatus comprising:
-
an audio interface to receive at least two signal portions including speech; and
a control unit operably coupled to the audio interface, the control unit to determine signal attributes and noise attributes of the at least two signal portions including speech and to derive a distance measure for one signal portion by using the signal attributes of both signal portions. - View Dependent Claims (28)
-
-
29. A wireless device comprising:
-
an audio interface to receive a noisy speech signal including an utterance;
a control unit operably coupled to the audio interface; and
a storage unit operably coupled to the control unit, said control unit enables;
determining signal attributes and noise attributes of at least two signal portions including speech, and deriving a distance measure for one signal portion by using the signal attributes of both signal portions. - View Dependent Claims (30)
-
Specification