Method of speech detection
First Claim
Patent Images
1. A method of detecting speech in noisy signals, comprising the steps of:
- sampling plural speech frames including plural noise frames, at least one voiced frame and additional plural noise frames after said at least one voiced frame;
identifying said at least one voiced frame;
identifying said plural noise frames preceding said at least one voiced frame;
constructing an autoregressive model of noise and a mean noise spectrum based on said plural noise frames preceding said at least one voiced frame;
bleaching said plural noise frames preceding said at least one voiced frame by using a rejector filter;
removing noise by spectral noise removal from said plural noise frames preceding said at least one voiced frame;
finding an actual start of speech in the bleached plural noise frames;
extracting acoustic vectors used by a voice recognition system from the plural noise-removed frames lying between the actual start of speech and a first of said at least one voiced frame;
removing noise from and parameterizing said at least one voiced frame;
finding an actual end of speech; and
removing noise and parameterizing frames lying between a last of said at least one voiced frame and the actual end of speech.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for detecting the start and end of speech from a noisy signal including the steps of:
detecting a voiced frame;
searching for noise frames preceding this voiced frame;
constructing an autoregressive model of the noise and a mean noise spectrum;
bleaching the flames preceding the voicing,
searching for the actual start of speech in the bleached frames;
removing the noise from the voiced frames and parameterizing them; and
searching for the actual end of speech.
-
Citations
11 Claims
-
1. A method of detecting speech in noisy signals, comprising the steps of:
-
sampling plural speech frames including plural noise frames, at least one voiced frame and additional plural noise frames after said at least one voiced frame; identifying said at least one voiced frame; identifying said plural noise frames preceding said at least one voiced frame; constructing an autoregressive model of noise and a mean noise spectrum based on said plural noise frames preceding said at least one voiced frame; bleaching said plural noise frames preceding said at least one voiced frame by using a rejector filter; removing noise by spectral noise removal from said plural noise frames preceding said at least one voiced frame; finding an actual start of speech in the bleached plural noise frames; extracting acoustic vectors used by a voice recognition system from the plural noise-removed frames lying between the actual start of speech and a first of said at least one voiced frame; removing noise from and parameterizing said at least one voiced frame; finding an actual end of speech; and removing noise and parameterizing frames lying between a last of said at least one voiced frame and the actual end of speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
Specification