Continuous speech recognition apparatus
First Claim
1. Speech recognition apparatus capable of operating in either a learn mode or a recognize mode comprising:
- means for converting audible speech into an electrical signal;
2 Assignments
0 Petitions
Accused Products
Abstract
An apparatus and method wherein speech or other signals are sampled during a time slice of approximately 1/30 second and spectrum analysis is performed on the samples, producing measures of amplitude in several frequency bands with each frequency band being characterized by a binary digit indicating the presence or absence of significant amplitude. The binary digits are collectively referred to as a sonogram. Sonograms for several time slices are then concatenated, randomized and decoded using an n-tuple technique to produce a pattern corresponding to the current speech signal. This pattern is learned by superimposing it on an existing vocabulary entry and is subsequently recognized if it is sufficiently similar to one vocabulary entry and different from all others.
63 Citations
19 Claims
-
1. Speech recognition apparatus capable of operating in either a learn mode or a recognize mode comprising:
means for converting audible speech into an electrical signal; - View Dependent Claims (3, 5, 6, 7, 8, 9, 11, 13, 15, 17, 19)
-
2. signal processing means for sampling a time slice of said electrical signal and for developing a plurality of digital signals representative thereof;
-
spectrum analyzing means for receiving said plurality of digital signals and for developing a series of frequency component signals each of which is indicative of the amplitude of a particular frequency component in said time slice; means for converting said series of frequency component signals into a series of binary digits respectively indicating the presence or absence of significant amplitude in each said frequency component signal; means for pseudo-randomly selecting various ones of said binary digits from a plurality of concatenated series of said digits and for combining the selected binary digits into groups of n data bits, where n is an integer; means for decoding each of said groups of n bits to develop a corresponding binary word of length 2n ; means for storing said binary word when said apparatus is operated in the learn mode; and means for comparing words developed from subsequently input speech to each of said stored words, and for developing an output signal when a predetermined correlation is found to exist between the compared input word and a particular stored word, such output signal indicating that said subsequently input speech has been recognized. - View Dependent Claims (4)
-
-
10. Signal recognition apparatus for recognizing data contained in an analog electrical signal comprising:
-
preprocessing means for converting a segment of the analog signal into a predetermined number of digital signals, each representing the amplitude of a portion of said segment; spectrum analyzing means for receiving said digital signals and for determining the frequency content of the signal segment represented by said digital signals and for developing a series of frequency component signals each corresponding to the relative magnitude of a particular frequency component of said segment; means for converting said series of frequency component signals into an input sonogram comprised of a number of data bits corresponding to said frequency component signals and indicating the presence or absence of component frequencies of significant amplitude; means for concatenating a plurality of said sonograms and for using an n-tuple pattern generating technique to convert the concatenated sonograms into an n-tuple input binary word; means for comparing said input binary word with each of a predetermined number of previously stored binary words and for developing an output signal when a predetermined number of bits of said input binary word correspond with a like number of bits of one of said stored binary words; and means responsive to said output signal for indicating recognition of the data. - View Dependent Claims (12, 14)
-
-
16. A speech recognition method comprising;
-
converting a voice signal into a corresponding analog electrical signal; separating said analog signal into time slices of signal; analyzing each said time slice for frequency content to develop a sonogram comprised of a series of digital characters each of which corresponds to the relative magnitude of a particular frequency component of the time slice; accumulating a plurality of said sonograms; using an n-tuple technique to develop a word pattern from said accumulated sonograms; comparing said word pattern to each of a plurality of previously stored word patterns, each time counting the number of bits in the compared patterns; and generating a recognition indication when the number of matched bits satisfies a predetermined criteria. - View Dependent Claims (18)
-
Specification