Mapping and boosting of terms in a format independent data retrieval query
First Claim
Patent Images
1. A method for retrieving data, comprising:
- analyzing each query term of a format-independent query for data retrieval to determine whether the query term corresponds to at least one of;
a domain keyword or a formula designator;
wherein in the case that at least one domain keyword is retrieved as corresponding to the query term, mapping the retrieved domain keyword to the query term such that the at least one retrieved domain keyword is substituted for the query term in the format-independent query,wherein in the case that at least one formula designator is retrieved as corresponding to the query term, mapping a set of formula terms associated with the retrieved formula designator to the query term such that the at least one retrieved formula designator is substituted for the query term in the format-independent query,wherein in the case that both at least one domain keyword and at least one formula designator are retrieved as corresponding to the query term, mapping the set of formula terms associated with the at least one of the retrieved formula designators to the at least one of the retrieved domain keywords of the query term;
creating a unique weighted boosting for each of the retrieved domain keywords, the retrieved formula designators, and the mapped formula terms; and
searching a dataset using a structured search query that includes all weighted boosted terms and any remaining query terms that do not correspond to one of the weighted boosted terms.
1 Assignment
0 Petitions
Accused Products
Abstract
Aspects of the present invention provide a tool for retrieving data in response to a query in any format. In an embodiment, each query term of the query is analyzed to determine whether it corresponds to either a domain keyword or a formula designator. If a formula designator is retrieved, each formula term associated with the formula designator is mapped to a domain keyword. Each of the domain keywords, the formula designators, and the mapped formula terms are boosted. A dataset is searched using a structured search query that includes all boosted terms, as well as any remaining query terms that do not correspond to one of the boosted terms.
46 Citations
25 Claims
-
1. A method for retrieving data, comprising:
-
analyzing each query term of a format-independent query for data retrieval to determine whether the query term corresponds to at least one of;
a domain keyword or a formula designator;wherein in the case that at least one domain keyword is retrieved as corresponding to the query term, mapping the retrieved domain keyword to the query term such that the at least one retrieved domain keyword is substituted for the query term in the format-independent query, wherein in the case that at least one formula designator is retrieved as corresponding to the query term, mapping a set of formula terms associated with the retrieved formula designator to the query term such that the at least one retrieved formula designator is substituted for the query term in the format-independent query, wherein in the case that both at least one domain keyword and at least one formula designator are retrieved as corresponding to the query term, mapping the set of formula terms associated with the at least one of the retrieved formula designators to the at least one of the retrieved domain keywords of the query term; creating a unique weighted boosting for each of the retrieved domain keywords, the retrieved formula designators, and the mapped formula terms; and searching a dataset using a structured search query that includes all weighted boosted terms and any remaining query terms that do not correspond to one of the weighted boosted terms. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for retrieving data, comprising at least one computer device that performs a method, comprising:
-
analyzing each query term of a format-independent query for data retrieval to determine whether the query term corresponds to at least one of;
a domain keyword or a formula designator;wherein in the case that at least one domain keyword is retrieved as corresponding to the query term, mapping the retrieved domain keyword to the query term such that the at least one retrieved domain keyword is substituted for the query term in the format-independent query, wherein in the case that at least one formula designator is retrieved as corresponding to the query term, mapping a set of formula terms associated with the retrieved formula designator to the query term such that the at least one retrieved formula designator is substituted for the query term in the format-independent query, wherein in the case that both at least one domain keyword and at least one formula designator are retrieved as corresponding to the query term, mapping the set of formula terms associated with the at least one of the retrieved formula designators to the at least one of the retrieved domain keywords of the query term; creating a unique weighted boosting for each of the retrieved domain keywords, the retrieved formula designators, and the mapped formula terms; and searching a dataset using a structured search query that includes all weighted boosted terms and any remaining query terms that do not correspond to one of the weighted boosted terms. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product stored on a non-transitory computer readable storage medium, which, when executed performs a method for retrieving data, comprising:
-
analyzing each query term of a format-independent query for data retrieval to determine whether the query term corresponds to at least one of;
a domain keyword or a formula designator;wherein in the case that at least one domain keyword is retrieved as corresponding to the query term, mapping the retrieved domain keyword to the query term such that the at least one retrieved domain keyword is substituted for the query term in the format-independent query, wherein in the case that at least one formula designator is retrieved as corresponding to the query term, mapping a set of formula terms associated with the retrieved formula designator to the query term such that the at least one retrieved formula designator is substituted for the query term in the format-independent query, wherein in the case that both at least one domain keyword and at least one formula designator are retrieved as corresponding to the query term, mapping the set of formula terms associated with the at least one of the retrieved formula designators to the at least one of the retrieved domain keywords of the query term; creating a unique weighted boosting for each of the retrieved domain keywords, the retrieved formula designators, and the mapped formula terms; and searching a dataset using a structured search query that includes all weighted boosted terms and any remaining query terms that do not correspond to one of the weighted boosted terms. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
-
25. A method for deploying an application for retrieving data, comprising:
providing a computer infrastructure being configured to; analyze each query term of a format-independent query for data retrieval to determine whether the query term corresponds to at least one of;
a domain keyword or a formula designator;wherein in the case that at least one domain keyword is retrieved as corresponding to the query term, map the retrieved domain keyword to the query term such that the at least one retrieved domain keyword is substituted for the query term in the format-independent query, wherein in the case that at least one formula designator is retrieved as corresponding to the query term, map a set of formula terms associated with the retrieved formula designator to the query term such that the at least one retrieved formula designator is substituted for the query term in the format-independent query, wherein in the case that both at least one domain keyword and at least one formula designator are retrieved as corresponding to the query term, map the set of formula terms associated with the at least one of the retrieved formula designators to the at least one of the retrieved domain keywords of the query term; create a unique weighted boosting for each of the retrieved domain keywords, the retrieved formula designator, and the mapped formula terms; and search a dataset using a structured search query that includes all weighted boosted terms and any remaining query terms that do not correspond to one of the weighted boosted terms.
Specification