Microphone array based speech recognition system and target speech extracting method of the system
First Claim
1. A microphone-array-based speech recognition system comprising:
- a signal separator configured to separate mixed signals input through a plurality of microphone into sound-source signals by an ICA algorithm;
a target speech extractor configured to extract one target speech spoken for speech recognition from the sound-source signals separated by the signal separator; and
a speech recognition unit configured to recognize a desired speech from the extracted target speech,wherein the target speech extractor is configured to extract feature vector sequences from the separated sound-source signals, calculate logarithm likelihood ratios (LLRs) of the extracted feature vector sequences, calculate a maximum value by using the calculated LLRs, compare the maximum value with a predetermined threshold value, and determine the maximum value to be the target speech if the maximum value is larger than the threshold value.
1 Assignment
0 Petitions
Accused Products
Abstract
A microphone-array-based speech recognition system using a blind source separation (BBS) and a target speech extraction method in the system are provided. The speech recognition system performs an independent component analysis (ICA) to separate mixed signals input through a plurality of microphone into sound-source signals, extracts one target speech spoken for speech recognition from the separated sound-source signals by using a Gaussian mixture model (GMM) or a hidden Markov Model (HMM), and automatically recognizes a desired speech from the extracted target speech. Accordingly, it is possible to obtain a high speech recognition rate even in a noise environment.
26 Citations
16 Claims
-
1. A microphone-array-based speech recognition system comprising:
-
a signal separator configured to separate mixed signals input through a plurality of microphone into sound-source signals by an ICA algorithm; a target speech extractor configured to extract one target speech spoken for speech recognition from the sound-source signals separated by the signal separator; and a speech recognition unit configured to recognize a desired speech from the extracted target speech, wherein the target speech extractor is configured to extract feature vector sequences from the separated sound-source signals, calculate logarithm likelihood ratios (LLRs) of the extracted feature vector sequences, calculate a maximum value by using the calculated LLRs, compare the maximum value with a predetermined threshold value, and determine the maximum value to be the target speech if the maximum value is larger than the threshold value. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A target speech extraction method for a microphone-array-based speech recognition system, comprising:
-
separating mixed signals input through a plurality of microphone into sound-source signals by an ICA; extracting one target speech spoken for speech recognition from the separated sound-source signals; and recognizing a desired speech from the extracted target speech, wherein the extracting of the target speech comprises; extracting feature vector sequence Xi from the separated sound-source signals; calculating an ith LLR (logarithm likelihood ratio) LLRi of the extracted feature vector sequence; calculating a maximum value using the LLRi; comparing the maximum value with a predetermined threshold value; and determining the maximum value to be the target speech when the maximum value is larger than the threshold value. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16)
-
Specification