Expansion of database search queries
First Claim
Patent Images
1. A method comprising:
- receiving a token that is usable in search queries for data in a database, the database including items of data that are represented by data strings;
determining a synonym candidate for the token; and
adding the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that the number of data strings having the synonym candidate exceeds a threshold.
2 Assignments
0 Petitions
Accused Products
Abstract
In some embodiments, a method includes receiving a token that is usable in search queries for data in a database. The database includes items of data that are represented by data strings. The method also includes determining a synonym candidate for the token. The method includes adding the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that the number of data strings having the synonym candidate exceeds a threshold.
97 Citations
40 Claims
-
1. A method comprising:
-
receiving a token that is usable in search queries for data in a database, the database including items of data that are represented by data strings;
determining a synonym candidate for the token; and
adding the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that the number of data strings having the synonym candidate exceeds a threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 36)
-
-
8. A method comprising:
-
receiving a token that is usable in search queries for data in a database, the database including items of data stored in a tree hierarchy of categories;
determining a synonym candidate for the token; and
adding the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that the token and the synonym candidate are in a same category for a level of the tree hierarchy. - View Dependent Claims (9, 10, 11, 12, 13, 37)
-
-
14. A method comprising:
-
receiving a token that is usable in search queries for data in a database, the database including items of data stored in a tree hierarchy of categories;
determining a synonym candidate for the token;
determining a probability distribution across leaf categories in the tree hierarchy for the token;
determining a probability distribution across leaf categories in the tree hierarchy for the synonym candidate; and
adding the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that a similarity between the probability distribution for the token and the probability for the synonym candidate is above a predetermined level. - View Dependent Claims (15, 16, 17, 18, 19, 38)
-
-
20. A method comprising:
-
receiving a token that is usable in search queries for data in a database, the database including items of data stored in a tree hierarchy of categories;
determining a synonym candidate for the token;
determining a probability distribution across leaf categories in the tree hierarchy for the token;
determining a probability distribution across leaf categories in the tree hierarchy for the synonym candidate;
determining a cross entropy for the probability distribution for the token and the probability for the synonym candidate; and
adding the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that a cross entropy is above a predetermined level. - View Dependent Claims (21, 22, 23, 24, 39)
-
-
25. A method comprising:
-
receiving a token that is usable in search queries for data in a database, the database including items of data stored in a tree hierarchy of categories;
determining a synonym candidate for the token;
determining a probability distribution across leaf categories in the tree hierarchy for the token;
determining a probability distribution across leaf categories in the tree hierarchy for the synonym candidate;
determining a KL-divergence measure of the probability distribution for the token and the probability for the synonym candidate; and
determining a cross entropy for the probability distribution for the token and the probability for the synonym candidate; and
adding the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that the KL-divergence is below a threshold. - View Dependent Claims (26, 27, 28, 29, 30, 40)
-
-
31. A system comprising:
-
a data store to store auction items for an online auction database, wherein the auction items are represented by data strings; and
an expansion module to determine a synonym candidate for a token that is usable in search queries for the auction items in the online auction database, wherein the expansion module is to add the synonym candidate as a synonym for the token into an expansion dictionary in response to a determination that the number of data strings having the synonym candidate exceeds a threshold. - View Dependent Claims (32, 33, 34, 35)
-
Specification