System and device for advanced voice recognition word spotting
First Claim
Patent Images
1. A speech recognition system that pre-processes speech data and inputs the pre-processed speech data in several forms to a voice recognition engine for determination of the most likely spoken word, comprising:
- audio input means to input audio data comprised of phonemes;
phoneme identification means to detect individual phonemes;
wave segment grouping means to group phonemes into wave segments;
a wave segment pre-processor to select groups of wave segments having at least one wave segment and output the selected wave segment groups to a speech recognition engine; and
a speech recognition engine having means to compare wave segments groups output by the wave segment pre-processor with a predetermined list of words;
means to determine which wave segment groups match entries in the predetermined list of words;
whereby wave segments are pre-processed into wave segment groups and analyzed to determine which of the wave segment groups most likely represent words that are input to the speech recognition engine.
0 Assignments
0 Petitions
Accused Products
Abstract
A speaker independent, continuous speech, word spotting voice recognition system and method. The edges of phonemes in an utterance are quickly and accurately isolated. The utterance is broken into wave segments based upon the edges of the phonemes. A voice recognition engine is consulted multiple times for several wave segments and the results are analyzed to correctly identify the words in the utterance.
-
Citations
20 Claims
-
1. A speech recognition system that pre-processes speech data and inputs the pre-processed speech data in several forms to a voice recognition engine for determination of the most likely spoken word, comprising:
-
audio input means to input audio data comprised of phonemes; phoneme identification means to detect individual phonemes; wave segment grouping means to group phonemes into wave segments; a wave segment pre-processor to select groups of wave segments having at least one wave segment and output the selected wave segment groups to a speech recognition engine; and a speech recognition engine having means to compare wave segments groups output by the wave segment pre-processor with a predetermined list of words; means to determine which wave segment groups match entries in the predetermined list of words; whereby wave segments are pre-processed into wave segment groups and analyzed to determine which of the wave segment groups most likely represent words that are input to the speech recognition engine. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of recognizing continuous speech voice input with a computer, including the steps of:
-
inputting audio data comprised of phonemes; identifying individual phonemes; grouping phonemes into wave segments; selecting groups of wave segments that each contain at least one wave segment with a wave segment pre-processor by first comparing the wave segments with a table of preselected valid words and then only outputting those wave segment groups which correspond to wave segments that are stored in a table of valid words to a speech recognition engine; whereby wave segments are pre-processed into wave segment groups and analyzed to determine which of the wave segment groups most likely represent words that are input to the speech recognition engine. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A method of word spotting in a speech recognition system, including the steps of:
-
identifying groups of phonemes which represent possible valid words; comparing the possible valid words with a preselected list of valid words; selecting words which correspond to an entry in the list of valid words; and outputting the selected words to a speech recognition engine; whereby groups of phonemes are pre-processed such that only selected groups of phonemes are input to the voice recognition engine. - View Dependent Claims (17, 18, 19)
-
-
20. A method of using rules based responses in a speech recognition system, including the steps of:
-
inputting audio data from a user into a computer; identifying a keyword in the data input by the user; comparing the keyword with a list of valid words in a database; selecting the keyword when the keyword corresponds to only a valid word in the database; entering the selected keyword to a speech recognition engine only when the keyword matches an entry in the list of valid words; querying the user with a list of choices when the keyword corresponds to more than one valid word in the database; querying the user to reenter the audio data when the keyword does not correspond to a valid word in the database.
-
Specification