Search data processor
DCFirst Claim
1. A method for generating search queries to be sent to a search engine for searching a information management system, comprising the steps of:
- a) receiving an initial search query;
b) converting the initial search query to general boolean language;
c) identifying a level in a respective hierarchy tree for each search query item in the initial search query;
d) formulating additional related search queries by substituting items from the respective hierarchy tree for selected items in the query, the substituted item having a level in the hierarchy tree that is greater than or less than the level of the query item in the initial query; and
e) forwarding the initial search query and the additional search queries in parallel to the search engine.
5 Assignments
Litigations
2 Petitions
Accused Products
Abstract
A tool to be used with a search engine for a information management system includes methods for refining, filtering, and organizing search queries and search results. A query tuner in the tool allows a user to automatically reformulate a query in order to find a reasonable number of matching documents from the search engine by selectively modifying individual search terms to be weaker or stronger and concurrently requesting a plurality of searches, each with a respectively different modified query. The tool also uses a dynamic filter which employs a dynamic set of record tokens to restrict the results of an arbitrary search query to selectively include or exclude records which correspond to the set of record tokens. The tool also includes a results organizer which aids the user in understanding and visualizing a large number of matching documents returned in response to a search query by clustering like items returned from the search. The query tuner, dynamic filter and results organizer may be used individually or in conjunction. The searched information management system may be consolidated or distributed and may span a global information network such as the Internet.
626 Citations
14 Claims
-
1. A method for generating search queries to be sent to a search engine for searching a information management system, comprising the steps of:
-
a) receiving an initial search query;
b) converting the initial search query to general boolean language;
c) identifying a level in a respective hierarchy tree for each search query item in the initial search query;
d) formulating additional related search queries by substituting items from the respective hierarchy tree for selected items in the query, the substituted item having a level in the hierarchy tree that is greater than or less than the level of the query item in the initial query; and
e) forwarding the initial search query and the additional search queries in parallel to the search engine. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for organizing a set of records into clusters comprising the steps of:
-
a) receiving the set of records;
b) analyzing at least one text field from each record of the set of records to determine any patterns;
c) partitioning each record by the text field analyzed in step (b) into clusters based on a shared pattern, wherein the step of partitioning a set of records consists of the steps of;
c1) hashing all the single words, all the pairs of consecutive words and all long sequences of words for each record in the set of records; and
c2) partitioning a set of records by the text field hashed in step (c1) into clusters d) analyzing the partitions to identify the clusters for display; and
e) displaying the clusters to the user.
-
-
9. A method for organizing a set of records into clusters, comprising the steps of:
-
a) receiving the set of records;
b) analyzing at least one text field from each record of the set of records to determine any patterns;
c) partitioning each record by the text field analyzed in step (b) into clusters based on a shared pattern;
d) analyzing the partitions to identify the clusters for display; and
e) displaying the clusters to the user;
wherein the step of analyzing the partitions to identify the clusters for display includes analyzing a best cluster for display by weighing both the number of documents that contain the shared pattern and the length of the shared pattern. - View Dependent Claims (10)
-
-
11. A system for generating search queries to be sent to a search engine for searching a information management system, comprising:
-
a) means for receiving an initial search query;
b) means for converting the initial search query to general Boolean language;
c) means for placing each search query item in the initial search query into a hierarchy tree;
d) means for formulating additional related search queries by replacing at least one search query item with an item either above or below the search query item in its hierarchy tree; and
e) means for forwarding the initial search query and the additional search queries in parallel to the search engine.
-
-
12. A system for organizing a search engine'"'"'s results including a set of documents each document including a plurality of fields, comprising:
-
a) means for receiving the set of documents;
b) means for analyzing several fields from each document to determine patterns c) means for partitioning each document by the fields analyzed in step (b) into clusters based on a shared pattern;
d) means for analyzing the partitions to identify clusters for display;
e) means for displaying the clusters of documents to the user; and
f) means for user selection of the clusters for viewing;
wherein the means for analyzing the partitions to identify the clusters for display includes means for analyzing a best cluster for display by weighing both a number of documents that contain the shared pattern and a length of the shared pattern.
-
-
13. A carrier including a computer program which, when executed by a processor, causes the processor to generate search queries to be sent to a search engine for searching a information management system, the computer program causing the computer to perform the steps of:
-
a) receiving an initial search query;
b) converting the initial search query to general Boolean language;
c) placing each search query item in the initial search query into a hierarchy tree;
d) means for formulating additional related search queries by replacing at least one search query item with an item either above or below the search query item in its hierarchy tree; and
e) forwarding the initial search query and the additional search queries in parallel to the search engine.
-
-
14. A carrier including a computer program which, when executed by a processor, causes the processor to organize a set of documents into clusters, by causing the computer to perform the steps of:
-
a) receiving the set of documents;
b) analyzing at least one field from each document to determine a pattern;
c) partitioning each document by the fields analyzed in step (b) into clusters based on a shared pattern;
d) analyzing the partitions to identify the clusters for display;
e) displaying the clusters to the user; and
f) allowing the user to select one or more of the clusters for viewing;
wherein the step of analyzing the partitions to identify the clusters for display includes the step of analyzing a best cluster for display by weighing both a number of documents that contain the shared pattern and a length of the shared pattern.
-
Specification