Method and apparatus for extracting information-bearing portions of a signal for recognizing varying instances of similar patterns
First Claim
Patent Images
1. A method for processing an acoustic input speech signal for extraction of individual utterances comprising the steps of:
- (a) converting said speech signal into a first and second sequence of speech related samples;
(b) correlating the first sequence of speech related samples to derive a first histogram representing the input speech signal;
(c) correlating the second sequence of speech related samples to derive a second histogram representing the input speech signal;
(d) compressing the first and second histograms to derive a plurality of spaced channels;
(e) generating a compression histogram representing at least a part of the input speech signal from the spaced channels;
(f) repeating steps (a)-(e) to generate a sequence of compression histograms said sequence of compression histogram representing a transformation of the input speech signal;
(g) identifying end points for each utterance in the sequence of compression histograms; and
(h) extracting individual utterances from the sequence of compression histograms between the identified utterance end points.
3 Assignments
0 Petitions
Accused Products
Abstract
Speech signals are analyzed by correlating a sequence of samples to derive a sliding average magnitude difference function (SAMDF) whereby histograms are formed which are compressed and normalized to form histogram sequences representing the speech signal for comparison and recognition.
48 Citations
38 Claims
-
1. A method for processing an acoustic input speech signal for extraction of individual utterances comprising the steps of:
-
(a) converting said speech signal into a first and second sequence of speech related samples; (b) correlating the first sequence of speech related samples to derive a first histogram representing the input speech signal; (c) correlating the second sequence of speech related samples to derive a second histogram representing the input speech signal; (d) compressing the first and second histograms to derive a plurality of spaced channels; (e) generating a compression histogram representing at least a part of the input speech signal from the spaced channels; (f) repeating steps (a)-(e) to generate a sequence of compression histograms said sequence of compression histogram representing a transformation of the input speech signal; (g) identifying end points for each utterance in the sequence of compression histograms; and (h) extracting individual utterances from the sequence of compression histograms between the identified utterance end points. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for processing a sequence of histograms representing a transformation of an extracted portion of a time varying input signal for recognition of certain signal patterns within said extracted portion, each histogram in the sequence of histograms having a plurality of channels, each channel comprising a plurality of measurements, said method for processing comprising the steps of:
-
generating and storing a plurality of identification templates, each identification template representing a signal pattern to be identified; time normalizing the sequence of histograms; amplitude normalizing the sequence of histograms; generating and storing a test template from the time and amplitude normalized histogram sequence; and comparing the identification templates with the test template for a match to identify the signal pattern. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for extracting the information bearing portions of an acoustic speech signal comprising the steps of:
-
(a) digitizing the acoustic speech signal to produce a plurality of sequences of speech samples; (b) correlating each sequence of speech samples to derive a histogram comprising a plurality of measurements; (c) compressing the plurality of measurements for the histogram to generate a compression histogram representing at least a part of the acoustic speech signal, said step of compressing comprising the step of averaging selected measurements for the histogram to generate the measurements that comprise the compression histogram; and (d) repeating steps (a)-(c) to output a sequence of compression histograms representing a transformation of the acoustic speech signal. - View Dependent Claims (24, 25, 26, 27)
-
-
28. A method for processing a sequence of histograms representing information bearing portions of an acoustic speech signal for recognition of individual utterances comprising the steps of:
-
storing a plurality of identification templates representing the individual utterances to be recognized; time and amplitude normalizing the sequence of histograms; generating a test template from the time and amplitude normalized sequence of histograms; and comparing the test template to the identification templates for matching and recognition. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35)
-
-
36. Apparatus for processing an acoustic speech signal for recognition of individual utterances comprising:
-
(a) means for converting said speech signal into a first and second sequence of data samples; (b) means for correlating the first sequence of data samples into a first histogram representing the input speech signal, said first histogram comprising a plurality of data measurements; (c) means for correlating the second sequence of data samples into a second histogram representing the input speech signal, said second histogram comprising a plurality of data measurements; (d) means for selectively compressing the plurality of data measurements in the first and second histograms into a plurality of data channels, each data channel comprised of a plurality of data measurements, the total number of measurements in all channels being less than the total measurements in said first and second histograms; and (e) means for repeating steps (a)-(d) to produce a sequence of histograms within each data channel, said sequence of histograms representing a transformation of the speech signal. - View Dependent Claims (37)
-
-
38. Apparatus for processing a sequence of histograms representing a transformation of an utterance extracted from an acoustic speech signal, comprising:
-
means for storing a vocabulary template representing the utterance to be processed; means for time and amplitude normalizing the sequence of histograms to generate a test template representing the extracted utterance; and means for comparing the vocabulary template to the test template for matching and recognition.
-
Specification