Protecting information in search queries
First Claim
Patent Images
1. A search method, said method comprising:
- accepting user input of a main query including a plurality of keywords;
accepting user input that identifies at least one combination of said keywords inhibited to be used as a sub-query;
generating a plurality of sub queries based on the user input each including one or more keywords by combining the keywords included in the input main query;
excluding sub-queries that are identified as inhibited combinations;
calculating a co-occurrence probability for the keywords of each sub-query;
sorting keyword combinations in a descending order based on the co-occurrence probabilities;
excluding from use as a sub-query any combination of keywords having a co-occurrence probability below a threshold;
separately sending the generated sub queries to a number of search engines;
re-searching a plurality of search results returned from the search engine in response to the plurality of sub queries, with the input main query; and
displaying search results acquired by the re-searching step.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of protecting information in search queries uses a search apparatus with a user interface that is configured for connection to a computer network that comprises a plurality of search engines on a plurality of servers. The method includes receiving a search query comprising a plurality of keywords; dividing the search query into a number of sub-queries, each sub-query comprising at least one of the keywords; and submitting the sub-queries to different search engines.
19 Citations
23 Claims
-
1. A search method, said method comprising:
-
accepting user input of a main query including a plurality of keywords; accepting user input that identifies at least one combination of said keywords inhibited to be used as a sub-query; generating a plurality of sub queries based on the user input each including one or more keywords by combining the keywords included in the input main query; excluding sub-queries that are identified as inhibited combinations; calculating a co-occurrence probability for the keywords of each sub-query; sorting keyword combinations in a descending order based on the co-occurrence probabilities; excluding from use as a sub-query any combination of keywords having a co-occurrence probability below a threshold; separately sending the generated sub queries to a number of search engines; re-searching a plurality of search results returned from the search engine in response to the plurality of sub queries, with the input main query; and displaying search results acquired by the re-searching step. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A search apparatus implemented in a computer, said apparatus comprising a processor programmed to:
-
receive a search query comprising a plurality of keywords; determine each mathematical combination of the plurality of keywords; divide the search query into a number of sub-queries, in which each sub-query comprising at least one of said keywords and in which each mathematical combination is represented as a separate sub-query; compare each sub-query to an inhibited combinations list; exclude sub-queries that are identified as inhibited combinations; after excluding sub-queries that are identified as inhibited combinations, submit remaining sub-queries to different search engines such that private information from the search query is less discernible at any one of said search engines than if more of the keywords of the search query were provided to that individual search engine; and re-search a plurality of search results returned from the different search engines in response to the submission of said sub-queries, said re-searching being performed using all the keywords of the search query. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer program product for protecting information in search queries, the computer program product comprising:
a tangible computer readable storage medium having computer readable program code embodied therewith, the computer readable program code comprising; computer readable program code configured to receive a search query comprising a plurality of keywords; computer readable program code configured to determine each mathematical combination of the plurality of keywords; computer readable program code configured to divide the search query into a number of sub-queries, in which each sub-query comprising at least one, but less than all, of said keywords and in which each mathematical combination is represented as a separate sub-query; computer readable program code configured to exclude sub-queries identified as inhibited combinations and based on a co-occurrence probability, in which the co-occurrence probability is a ratio of percentage chance that multiple keywords will be used in a same search result; computer readable program code configured to submit remaining sub-queries to different search engines after said exclusion such that each search engine receives only a sub-query and less than all of said plurality of keywords such that private information from the search query is less discernible at any one of said search engines than if that search engine received more of said plurality of keywords of said search query; computer readable program code configured to, generate an integrated search results based on a rank of hits returned from the different search engines in response to the submission of said sub-queries, in which hits that share rank are sorted based on a number of rules, in which the number of rules comprise; a first rule indicating a hit returned from a priority search engine is ranked above a hit from another search engine; a second rule indicating a hit returned by more search engines is ranked above a hit returned by fewer search engines; and a third rule indicating a hit returned from a search engine with more total number of hits is ranked above a hit returned from a search engine with fewer total number of hits; in which the first rule is evaluated before the second rule and in which the second rule is evaluated before the third rule; computer readable program code configured to search the integrated search results. - View Dependent Claims (17, 18)
-
19. A method of protecting information in search queries using a search apparatus with a user interface that is configured for connection to a computer network, said network comprising a plurality of search engines on a plurality of servers, said method comprising:
-
receiving a search query comprising a plurality of keywords; dividing the search query into a number of sub-queries, each sub-query comprising at least one of said keywords but less than all of said keywords; comparing each sub-query to an inhibited combinations list that includes a number of keywords from the search query; excluding sub-queries that are identified as inhibited combinations; submitting remaining sub-queries to different search engines such that each search engine receives less than all of said plurality of keywords such that private information from the search query is less discernible at any one of said search engines than if that search engine received more of said plurality of keywords of said search query; and generating an integrated search results based on a rank of hits included in individual search results returned from the different search engines in response to the submission of said sub-queries, in which hits that share rank are sorted based on a number of priority rules; and searching the integrated search results. - View Dependent Claims (20, 21, 22, 23)
-
Specification