Robust parameters for noisy speech recognition
First Claim
1. A method of automatic processing of noise-affected speech comprising at least the following steps:
- capture and digitising of the speech in the form of at least one digitised signal (1), extraction of several time-based sequences or frames (15), corresponding to said signal, by means of an extraction system (10), decomposition of each frame (15) by means of an analysis system (20, 40) into at least two different frequency bands so as to obtain at least two first vectors of representative parameters (45) for each frame (15), one for each frequency band, and conversion, by means of converter systems (50), of the first vectors of representative parameters (45) into second vectors of parameters relatively insensitive to noise (55), each converter system (50) being associated with one frequency band and converting the first vector of representative parameters (45) associated with said same frequency band, and the learning of said converter systems (50) being achieved on the basis of a learning corpus which corresponds to a corpus of speech contaminated by noise (102).
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to a method of automatic processing of noise-affected speech comprising at least the following steps:
capture and digitising of the speech in the form of at least one digitised signal (1),
extraction of several time-based sequences or frames (15), corresponding to said signal, by means of an extraction system (10),
decomposition of each frame (15) by means of an analysis system (20, 40) into at least two different frequency bands so as to obtain at least two first vectors of representative parameters (45) for each frame (15), one for each frequency band, and
conversion, by means of converter systems (50), of the first vectors of representative parameters (45) into second vectors of parameters relatively insensitive to noise (55), each converter system (50) being associated with one frequency band and converting the first vector of representative parameters (45) associated with said same frequency band, and
the learning of said converter systems (50) being achieved on the basis of a learning corpus which corresponds to a corpus of speech contaminated by noise (102).
19 Citations
12 Claims
-
1. A method of automatic processing of noise-affected speech comprising at least the following steps:
-
capture and digitising of the speech in the form of at least one digitised signal (1), extraction of several time-based sequences or frames (15), corresponding to said signal, by means of an extraction system (10), decomposition of each frame (15) by means of an analysis system (20, 40) into at least two different frequency bands so as to obtain at least two first vectors of representative parameters (45) for each frame (15), one for each frequency band, and conversion, by means of converter systems (50), of the first vectors of representative parameters (45) into second vectors of parameters relatively insensitive to noise (55), each converter system (50) being associated with one frequency band and converting the first vector of representative parameters (45) associated with said same frequency band, and the learning of said converter systems (50) being achieved on the basis of a learning corpus which corresponds to a corpus of speech contaminated by noise (102). - View Dependent Claims (2, 3, 4, 5, 6, 10, 11, 12)
-
-
7. An automatic speech-processing system comprising at least:
-
an acquisition system for obtaining at least one digitised speech signal (1), an extraction system (10), for extracting several time-based sequences or frames (15) corresponding to said signal (1), means (20, 40) for decomposing each frame (15) into at least two different frequency bands so as to obtain at least two first vectors of representative parameters (45), one vector for each frequency band, and several converter systems (50), each converter system (50) being associated with one frequency band and making it possible to convert the first vector of representative parameters (45) associated with this same frequency band into a second vector of parameters which are relatively insensitive to noise (55), and the learning by the said converter systems (50) being achieved on the basis of a corpus of speech corrupted by noise (102). - View Dependent Claims (8, 9)
-
Specification