MULTI-PASS SPEECH ACTIVITY DETECTION STRATEGY TO IMPROVE AUTOMATIC SPEECH RECOGNITION
First Claim
1. A method performed by an automatic speech recognition system, comprising:
- performing at least two passes of speech activity detection on an acoustic utterance uttered by a speaker, the at least two passes including an initial pass and a subsequent pass;
estimating at least one of feature statistics and transforms for acoustic feature extraction and acoustic modeling based on an output of an initial pass; and
performing automatic speech recognition using an output of the subsequent pass while bypassing an output of the initial pass to recognize the acoustic utterance.
1 Assignment
0 Petitions
Accused Products
Abstract
An automatic speech recognition system and a method performed by an automatic speech recognition system are provided. The method includes performing at least two passes of speech activity detection on an acoustic utterance uttered by a speaker. The at least two passes include an initial pass and a subsequent pass. The method further includes estimating at least one of feature statistics and transforms for acoustic feature extraction and acoustic modeling based on an output of an initial pass. The method further includes performing automatic speech recognition using an output of the subsequent pass while bypassing an output of the initial pass to recognize the acoustic utterance.
-
Citations
20 Claims
-
1. A method performed by an automatic speech recognition system, comprising:
-
performing at least two passes of speech activity detection on an acoustic utterance uttered by a speaker, the at least two passes including an initial pass and a subsequent pass; estimating at least one of feature statistics and transforms for acoustic feature extraction and acoustic modeling based on an output of an initial pass; and performing automatic speech recognition using an output of the subsequent pass while bypassing an output of the initial pass to recognize the acoustic utterance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product for automatic speech recognition, the computer program product comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to perform a method comprising:
-
performing, by an automatic speech recognition system, at least two passes of speech activity detection on an acoustic utterance uttered by a speaker, the at least two passes including an initial pass and a subsequent pass; estimating, by the automatic speech recognition system, at least one of feature statistics and transforms for acoustic feature extraction and acoustic modeling based on an output of an initial pass; and performing, by the automatic speech recognition system, automatic speech recognition using an output of the subsequent pass while bypassing an output of the initial pass to recognize the acoustic utterance. - View Dependent Claims (18, 19)
-
-
20. An automatic speech recognition system, comprising:
-
a speech activity detector for performing at least two passes of speech activity detection on an acoustic utterance uttered by a speaker, the at least two passes including an initial pass and a subsequent pass, and for estimating at least one of feature statistics and transforms for acoustic feature extraction and acoustic modeling based on an output of an initial pass; and a speech decoder for performing automatic speech recognition using an output of the subsequent pass while bypassing an output of the initial pass to recognize the acoustic utterance.
-
Specification