Processing speech to text queries by optimizing conversion of speech queries to text
First Claim
Patent Images
1. A computing device for improving an accuracy of a generated text query based upon a phrase of an audible natural language query, comprising:
- a storage device;
a processor;
the storage device having instructions that when executed by the processor, cause the computing device to;
receive a plurality of speech to text translation alternatives for the phrase of the audible natural language query;
compare each of the translation alternatives to elements of a static analysis of a known domain including at least one of word lists of the known domain, user matrices of the known domain, and facets, each of which having a limited number of possible values;
identify a respective term in at least one translation alternative from the plurality of translation alternatives based on the comparing;
substitute, for the identified term in each of the at least one translation alternative, a respective known name from the known domain based on a distance measure between the identified term and a closed list of names from the known domain;
tag and parse each of the translation alternatives with generic tags and specific tags for the known domain that is at least partially structured, wherein the tagging and parsing further comprises generating one or more new tags based on previous findings to build up complex expressions incrementally and create new structures from primitive structures, and the tagging and parsing is based on;
the static analysis of the known domain which determines known relationships between known values of the known domain,known tags of the known domain, andcustom rules, each of which includes a respective name of an action to invoke and one or more respective conditions that trigger the action, wherein the one or more generated new tags are available upon creation for use by subsequent rules;
identify a particular translation alternative from the translation alternatives having a section of the phrase that is not tagged and covered by the tagging and parsing;
correct an error due to speech to text translation and associated with the section that is not covered by replacing one or more terms of the section based on a known list of common errors associated with the known domain;
rank the translation alternatives based on the tagging and parsing; and
generate a text query to perform the natural language query by translating the phrase of the natural language query to a machine readable statement based on the ranking, wherein;
the tagging and parsing, the substituting, for each of the identified term in each of the at least one translation alternative, the respective known name from the known domain, the correcting an error due to speech to text translation and the ranking of the translation alternatives improve an accuracy of the generated text query.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for processing a speech to text query are described herein. The techniques may include receiving a plurality of speech to text translation alternatives for a phrase of a natural language query, and tagging and parsing each of the translation alternatives based on a static analysis of the known domain that is at least partially structured, known tags of the known domain, and custom rules. The techniques may also include ranking the translation alternatives based on the tagging and parsing and translating the phrase based on the ranking.
52 Citations
12 Claims
-
1. A computing device for improving an accuracy of a generated text query based upon a phrase of an audible natural language query, comprising:
-
a storage device; a processor; the storage device having instructions that when executed by the processor, cause the computing device to; receive a plurality of speech to text translation alternatives for the phrase of the audible natural language query; compare each of the translation alternatives to elements of a static analysis of a known domain including at least one of word lists of the known domain, user matrices of the known domain, and facets, each of which having a limited number of possible values; identify a respective term in at least one translation alternative from the plurality of translation alternatives based on the comparing; substitute, for the identified term in each of the at least one translation alternative, a respective known name from the known domain based on a distance measure between the identified term and a closed list of names from the known domain; tag and parse each of the translation alternatives with generic tags and specific tags for the known domain that is at least partially structured, wherein the tagging and parsing further comprises generating one or more new tags based on previous findings to build up complex expressions incrementally and create new structures from primitive structures, and the tagging and parsing is based on; the static analysis of the known domain which determines known relationships between known values of the known domain, known tags of the known domain, and custom rules, each of which includes a respective name of an action to invoke and one or more respective conditions that trigger the action, wherein the one or more generated new tags are available upon creation for use by subsequent rules; identify a particular translation alternative from the translation alternatives having a section of the phrase that is not tagged and covered by the tagging and parsing; correct an error due to speech to text translation and associated with the section that is not covered by replacing one or more terms of the section based on a known list of common errors associated with the known domain; rank the translation alternatives based on the tagging and parsing; and generate a text query to perform the natural language query by translating the phrase of the natural language query to a machine readable statement based on the ranking, wherein; the tagging and parsing, the substituting, for each of the identified term in each of the at least one translation alternative, the respective known name from the known domain, the correcting an error due to speech to text translation and the ranking of the translation alternatives improve an accuracy of the generated text query. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product for improving an accuracy of a generated text query based upon a phrase of an audible natural language query, the computer program product comprising a computer readable storage medium having program code embodied therewith, the program code executable by a processor to perform a method, comprising:
-
receiving a plurality of speech to text translation alternatives for the phrase of the audible natural language query; comparing each of the translation alternatives to elements of a static analysis of a known domain including at least one of word lists of the known domain, user matrices of the known domain, and facets, each of which having a limited number of possible values; identify a respective term in at least one translation alternative from the plurality of translation alternatives based on the comparing; substitute, for the identified term in each of the at least one translation alternative, a respective known name from the known domain based on a distance measure between the identified term and a closed list of names from the known domain; tagging and parsing each of the translation alternatives with generic tags and specific tags for the known domain that is at least partially structured, wherein the tagging and parsing further comprises generating one or more new tags based on previous findings to build up complex expressions incrementally and create new structures from primitive structures, and the tagging and parsing is based on; the static analysis of the known domain which determines known relationships between known values of the known domain, known tags of the known domain, and custom rules, each of which includes a respective name of an action to invoke and one or more respective conditions that trigger the action, wherein the one or more generated new tags are available upon creation for use by subsequent rules; identifying a particular translation alternative from the translation alternatives having a section of the phrase that is not tagged and covered by the tagging and parsing; correcting an error due to speech to text translation and associated with the section that is not covered by replacing one or more terms of the section based on a known list of common errors associated with the known domain; ranking the translation alternatives based on the tagging and parsing; and generating a text query to perform the natural language query by translating the phrase of the natural language query to a machine readable statement based on the ranking, wherein; the tagging and parsing, the substituting, for each of the identified term in each of the at least one translation alternative, the respective known name from the known domain, the correcting an error due to speech to text translation and the ranking of the translation alternatives improve an accuracy of the generated text query. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification