Query Correction Using Indexed Content on a Desktop Indexer Program
First Claim
1. A computerized method of searching computer files, comprising:
- applying a query to a database of indexed keywords extracted from computer files, the query including a query keyword;
obtaining, by said application of the query, a query result score representative of the number of computer files associated with indexed keywords that satisfy the query;
identifying a similarity list from the database of indexed keywords, the similarity list including a similar indexed keyword that is similar to the query keyword;
substituting the similar indexed keyword for the query keyword in the query to create a similar query;
applying the similar query to the indexed keyword database to obtain a similar query result score representative of the number of computer files associated with indexed keywords that satisfy the similar query; and
suggesting the similar query to a user if the similar query result score exceeds a threshold score.
2 Assignments
0 Petitions
Accused Products
Abstract
Apparatus and methods useful for desktop searching are disclosed. According to one embodiment of the invention, for each query, the system evaluates if similar ones would give more results. If so, the system suggests the user with one or several similar queries. Similar queries are based on indexed content so corrections conduct to (that is, lead to) at least one result. According to another embodiment of the invention, for each keyword of the query the system computes a similarity value for a subset of all keywords of the index database. The subset is composed of all keywords minus the ones too short and too lengthy to be similar. Therefore, each keyword from the query gets a list of similar indexed keywords. According to yet another embodiment of the invention, the system replaces each query keyword, one after another, by a member of its similarity list until all combinations of substitutions are listed. The number of results for each similar query is calculated. Corrected queries giving significantly more results than the original query are suggested to the user.
-
Citations
31 Claims
-
1. A computerized method of searching computer files, comprising:
-
applying a query to a database of indexed keywords extracted from computer files, the query including a query keyword;
obtaining, by said application of the query, a query result score representative of the number of computer files associated with indexed keywords that satisfy the query;
identifying a similarity list from the database of indexed keywords, the similarity list including a similar indexed keyword that is similar to the query keyword;
substituting the similar indexed keyword for the query keyword in the query to create a similar query;
applying the similar query to the indexed keyword database to obtain a similar query result score representative of the number of computer files associated with indexed keywords that satisfy the similar query; and
suggesting the similar query to a user if the similar query result score exceeds a threshold score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computerized method of searching computer files, comprising:
-
applying a query to a database of indexed keywords extracted from computer files, the query including one or more query keywords;
obtaining, by said application of the query, a query result score representative of the number of computer files associated with indexed keywords that satisfy the query;
for each query keyword, creating a similarity list from the database of indexed keywords, the similarity list including one or more similar indexed keywords, each similar indexed keyword being similar to the query keyword;
repeatedly replacing each query keyword with each of the similar indexed keywords in the similarity list associated therewith until all combinations of query keywords and similar indexed keywords in the query have been exhausted, each combination representing a similar query;
applying each similar query to the indexed keyword database to obtain, for each similar query, a similar query result score representative of the number of computer files associated with indexed keywords that satisfy the similar query; and
suggesting to a user each similar query having a similar query result score that exceeds a threshold score. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A method of query correction for desktop searching, comprising:
-
indexing content to create a database of indexed keywords, the content including computer files;
applying a query to the database of indexed keywords, the query including a query keyword;
obtaining, by said application of the query, a first list of computer files associated with indexed keywords that satisfy the query;
identifying a similarity list from the database of indexed keywords, the similarity list including a similar indexed keyword similar to the query keyword;
replacing the similar indexed keyword for the query keyword in the query to create a similar query;
applying the similar query to the indexed keyword database to obtain a second list of computer files associated with indexed keywords that satisfy the similar query; and
suggesting the similar query to a user if the number of computer files on the second list exceeds a threshold number. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31)
-
Specification