Algorithm for fast disk based text mining
First Claim
1. A method of executing a query for at least one document similar to a specified document, the method comprising:
- receiving the query;
forming a reduced query document based on ranks of terms in the specified document;
generating a modified query based on the query and the reduced query document;
executing the modified query on a data repository to generate a set of results; and
providing a result to a user interface.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus, including computer systems and program products, for executing a query, for example, a query for a document similar to another document. In one general aspect, the techniques feature a method of executing a query for at least one document similar to a specified document. That method includes receiving the query; forming a reduced query document based on ranks of terms in the specified document; generating a modified query based on the query and the reduced query document; executing the modified query on a data repository to generate a set of results; and, providing a result to a user interface.
-
Citations
18 Claims
-
1. A method of executing a query for at least one document similar to a specified document, the method comprising:
-
receiving the query;
forming a reduced query document based on ranks of terms in the specified document;
generating a modified query based on the query and the reduced query document;
executing the modified query on a data repository to generate a set of results; and
providing a result to a user interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An information management system, the system comprising:
-
a data repository, wherein the data repository is configured to store documents; and
a program for executing queries on the data repository, wherein the program is operative to;
receive a query for at least one document similar to a specified document;
form a reduced query document based on ranks of terms in the specified document;
generate a modified query based on the query and the reduced query document;
execute the modified query on the data repository to generate a set of results; and
provide a result to a user interface. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification