System and methods for improving accuracy of speech recognition
First Claim
1. A speech recognition system for providing a textual output from an audible signal representative of spoken words, said system comprising:
- a natural language processor for parsing a partially recognized sentence into a sentence type and an associated ordered list of recognized words and unrecognized sound groupings, said sentence type having an ordered list of concepts, said partially recognized sentence corresponding to the audible signal;
a grammar rule generator for expanding each of said concepts at a location corresponding to one of said unrecognized sound groupings into a plurality of related words;
a speech recognition engine for converting the audible signal to the textual output, said speech recognition engine being operatively connected to said plurality of related words for resolving the one of said unrecognized sound grouping.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention provides a system and method for improving speech recognition. A computer software system is provided for implementing the system and method. A user of the computer software system may speak to the system directly and the system may respond, in spoken language, with an appropriate response. Grammar rules may be generated automatically from sample utterances when implementing the system for a particular application. Dynamic grammar rules may also be generated during interaction between the user and the system. In addition to arranging searching order of grammar files based on a predetermined hierarchy, a dynamically generated searching order based on history of contexts of a single conversation may be provided for further improved speech recognition. Dialogue between the system and the user of the system may be recorded and extracted for use by a speech recognition engine to refine or create language models so that accuracy of speech recognition relevant to a particular knowledge area may be improved.
-
Citations
35 Claims
-
1. A speech recognition system for providing a textual output from an audible signal representative of spoken words, said system comprising:
-
a natural language processor for parsing a partially recognized sentence into a sentence type and an associated ordered list of recognized words and unrecognized sound groupings, said sentence type having an ordered list of concepts, said partially recognized sentence corresponding to the audible signal;
a grammar rule generator for expanding each of said concepts at a location corresponding to one of said unrecognized sound groupings into a plurality of related words;
a speech recognition engine for converting the audible signal to the textual output, said speech recognition engine being operatively connected to said plurality of related words for resolving the one of said unrecognized sound grouping.
-
-
2. A system for improving recognition accuracy of an audible signal representative of spoken words, the audible signal being converted to a textual output by a speech recognition engine, said system comprising:
-
a natural language processor for parsing a sentence in a textual format into an ordered list of keywords;
a grammar rule generator for expanding each keyword of said ordered list into a plurality of related words to obtain a grammar rule from said ordered list of keywords;
wherein said speech recognition engine is operatively connected to said grammar rule for resolving unrecognized sound groupings in the audible signal into the corresponding spoken words in the textual output. - View Dependent Claims (3, 4, 5)
-
-
6. A method of generating a grammar rule for use by a speech recognition engine, said method comprising the steps of:
-
parsing a sample sentence using a natural language processor into an ordered list of keywords;
matching each keyword of said ordered list to a concept using a concept to keywords mapping; and
producing the grammar rule from said ordered list by replacing each said concept with a list of keywords using the concept to keywords mapping. - View Dependent Claims (7, 8, 9)
-
-
10. A speech recognition method for resolving unrecognized sound groups in a partially recognized speech using concept to keywords mappings and sentence types, each sentence type having a plurality of associated grammar rules, the grammar rules being expressed in concepts, said method comprising the steps of:
-
parsing the partially recognized speech using a natural language processor into a pre-determined sentence type and an associated ordered list of recognized words and the unrecognized sound groups;
selecting a list of grammar rules associated with the sentence type from a plurality of grammar rules, each grammar rule of said list having a plurality of constituent concepts, each of said constituent concepts corresponding to one of the recognized words and the unrecognized sound groups;
for each said unrecognized sound group, merging said corresponding constituent concepts in all said selected grammar rules into a list of concepts;
expanding said list of merged concepts using the concept to keywords mappings to produce a list of candidate words; and
resolving each said unrecognized sound group using the list of candidate words. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A method for generating a dynamic grammar file for use by a speech recognition engine to resolve unrecognized sound groups in a speech using concept to keywords mappings and sentence types, each sentence type having a plurality of associated grammar rules, the grammar rules being expressed in concepts, said method comprising the steps of:
-
parsing the partially recognized speech using a natural language processor into a pre-determined sentence type and an associated ordered list of recognized words and the unrecognized sound groups;
selecting a list of grammar rules associated with the sentence type from a plurality of grammar rules, each grammar rule of said list having a plurality of constituent concepts, each of said constituent concepts corresponding to one of the recognized words and the unrecognized sound groups;
for each said unrecognized sound group, merging said corresponding constituent concepts in all said selected grammar rules into a list of concepts; and
generating the dynamic grammar rule from said ordered list by replacing each concept of said list of merged concepts with a list of keywords using the concept to keywords mappings. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23)
-
-
24. A method of speech recognition, said method comprising the steps of:
-
preparing a plurality of grammar rules, each of said plurality of grammar rules having a context designation assigned thereto;
determining a conversation context of a speech being recognized by a speech recognition engine and recording said conversation context in a context history;
if said conversation context corresponds to one of said context designations, assigning a ranking order to said context designation in a search sequence as a function of said context history; and
directing said speech recognition engine to search said plurality of grammar rules following said search sequence. - View Dependent Claims (25, 26, 27)
-
-
28. A method of compiling a corpus for use by a language model generator, said method comprising the steps of:
-
storing text of user input from a user and response to said user input generated by a knowledge base system in a log file;
extracting a thread of conversation between said user and said knowledge base system, said thread of conversation containing literal texts of said user input and said system response; and
adding said thread of conversation to said corpus. - View Dependent Claims (29, 30, 31)
-
-
32. A method for improving recognition accuracy of a speech recognition system, the speech recognition system having a speech recognition engine for converting audible signal representative of spoken words into a textual output, the method comprising the steps of:
-
selecting a first plurality of concepts;
preparing a second plurality of concept to keywords mappings, each concept of said first plurality of concepts having at least one concept to keywords mapping;
defining a third plurality of sentence types, each sentence type being associated with an ordered list of concepts, said ordered list of concepts being formed from said first plurality of concepts;
providing said first plurality of concepts, said second plurality of concept to keywords mappings and said third plurality of sentence types, together with said associated ordered lists of concepts, to the speech recognition system for resolving unrecognized sound groupings in the audible signal. - View Dependent Claims (33, 34, 35)
-
Specification