Speech based query system using semantic decoding
First Claim
1. An interactive system adapted for responding to speech-based queries concerning topics addressed by such interactive system, the system comprising:
- a query file for storing a plurality of topic query entries, each topic query entry including a query relating to one or more of the topics covered by the speech-based interactive system; and
an answer file for storing a plurality of topic answer entries, each topic answer entry including an answer to one or more of said plurality of topic query entries, such that each topic query entry has at least one associated topic answer entry; and
a speech recognition system for generating recognized speech data from partially processed speech data associated with a speech-based query concerning one of said topics, said partially processed speech data being received from a remote speech capturing system; and
said speech recognition system further cooperating with a natural language engine which generates recognized speech sentence data corresponding to said speech-based query using both semantic decoding and statistical based processing performed on said recognized speech utterance data;
a query formulation system for converting said recognized speech data into a search query suitable for identifying a topic query entry corresponding to said speech-based query, and for locating at least one topic answer entry best matching said speech-based query.
1 Assignment
0 Petitions
Accused Products
Abstract
An intelligent query system for processing voiced-based queries is disclosed, which uses a combination of both statistical and semantic based processing to identify the question posed by the user by understanding the meaning of the user'"'"'s utterance. Based on identifying the meaning of the utterance, the system selects a single answer that best matches the user'"'"'s query. The answer that is paired to this single question is then retrieved and presented to the user. The system, as implemented, accepts environmental variables selected by the user and is scalable to provide answers to a variety and quantity of user-initiated queries.
773 Citations
17 Claims
-
1. An interactive system adapted for responding to speech-based queries concerning topics addressed by such interactive system, the system comprising:
-
a query file for storing a plurality of topic query entries, each topic query entry including a query relating to one or more of the topics covered by the speech-based interactive system; and an answer file for storing a plurality of topic answer entries, each topic answer entry including an answer to one or more of said plurality of topic query entries, such that each topic query entry has at least one associated topic answer entry; and a speech recognition system for generating recognized speech data from partially processed speech data associated with a speech-based query concerning one of said topics, said partially processed speech data being received from a remote speech capturing system; and said speech recognition system further cooperating with a natural language engine which generates recognized speech sentence data corresponding to said speech-based query using both semantic decoding and statistical based processing performed on said recognized speech utterance data; a query formulation system for converting said recognized speech data into a search query suitable for identifying a topic query entry corresponding to said speech-based query, and for locating at least one topic answer entry best matching said speech-based query.
-
-
2. The system of claim 1, wherein said remote speech capturing system is located at a client site, and said speech recognition system is distributed across said client site and a separate server site.
-
3. The system of claim 1, wherein said speech recognition system is comprised of a first portion at a client based computing system for performing first signal processing operations on a speech input signal to create said partially processed speech data, and a second portion at a server based computing system for performing a second signal processing operation for completing processing of said partially processed speech data.
-
4. The system of claim 1, wherein said statistical processing is based on calculating noun phrases in said recognized speech utterance data.
-
5. The system of claim 1, wherein said statistical processing is further based on calculating verb phrases, adjective phrases and adverb phrases in said recognized speech utterance data.
-
6. The system of claim 1, wherein said semantic decoding is based on a term frequency calculation, which term frequency calculation is based on calculating a lexical distance between each word in said recognized speech utterance data with each word of one or more topic query entries.
-
7. The system of claim 6, wherein said one or more topic query entries are retrieved by an SQL search.
-
8. The system of claim 6, wherein semantic decoding further includes a coverage calculation, which coverage calculation is based on identifying a percentage of a number of terms in said recognized speech utterance data which appear in each of said one or more topic query entries.
-
9. The system of claim 6, wherein semantic decoding further includes a semantic similarity calculation, which semantic similarity calculation is based on identifying a semantic distance between two like parts of speech of said recognized speech utterance data and each of said one or more topic query entries.
-
10. The system of claim 1, wherein said query formulation system uses context parameters for recognizing said speech-based query.
-
11. A method of implementing a speech-based interactive query system, including the steps of:
-
(a) storing a plurality of topic query entries, each topic query entry including a query relating to one or more of topics covered by the speech-based interactive query system; and (b) storing a plurality of topic answer entries, each topic answer entry including an answer to one or more of said plurality of topic query entries, such that each topic query entry has at least one associated topic answer entry; and (c) generating recognized speech utterance data associated with a speech-based query concerning one of said topics, such that said recognized speech utterance data is generated by partial recognition processing of said speech-based query by a first signal processing routine executing at a first computing device, and then completing recognition of said speech-based query through processing performed by a second signal processing routine executing at a second computing device; and (d) converting said recognized speech utterance data with a natural language process into recognized speech sentence data, said recognized speech data being used by a search query suitable for identifying a topic query entry corresponding to said speech-based query; and (e) locating at least one topic answer entry best matching said speech-based query; wherein said natural language process includes both semantic decoding and statistical based processing operations performed on said recognized speech utterance data.
-
-
12. The method of claim 11, wherein said first computing device is located at a client site, and said second computing device is located at a separate server site.
-
13. The method of claim 11, wherein said semantic decoding includes a term frequency calculation, which term frequency calculation is based on calculating a lexical distance between each word in said recognized speech utterance data with each word of one or more topic query entries.
-
14. The method of claim 13, wherein said semantic decoding further includes a coverage calculation, which coverage calculation is based on identifying a percentage of a number of terms in said recognized speech utterance data which appear in each of said one or more topic query entries.
-
15. The method of claim 14, wherein said semantic decoding further includes a semantic similarity calculation, which semantic similarity calculation is based on identifying a semantic distance between two like parts of speech of said recognized speech utterance data and each of said one or more topic query entries.
-
16. The method of claim 11, wherein during step (d) context parameters are used for formulating said search query, and said context parameters are used for dynamically determining and loading an appropriate grammar and dictionary file to be used for said speech-based query.
-
17. The method of claim 11, wherein said response undergoes a text to speech process so that said topic answer entry is expressed in audible form to a user.
Specification