Method for search result clustering
First Claim
1. A method for clustering a set of documents that are obtained as the search results in response to a search query from a searcher using a computer or computer network, said search results are selected, based on the relevance to the search query, from a plurality of documents that are indexed with a set of keywords, comprising:
- a. prior to processing the search query, recording the classes of each indexed document when the document is searched with one or several of keywords, for at least some of the index keywords and some of the indexed documents; and
b. grouping the search results according to said classes of each result document with respect to the keyword or keywords included in the search query.
0 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems are presented to predetermine and record the classes of each indexed document with respect to each of its index keywords, and to provide high quality and relevant classification of the document when it is searched with said keyword. Document classes, recorded in advance, are used as the clustering information of each document in the search results to realize efficient, large-scale and high quality search result clustering. One embodiment provides a method for search result clustering, which includes recording the classes of each indexed document when the document is searched with each of its index keywords. This method further includes grouping the search results according to the classes of each result document with respect to the keyword or keywords contained in the search query. By prerecording the classes of each document with respect to each index keyword, the classes of each document in the search results in response to a search query can be directly determined via the keywords included in the search query. Each result document is put into each of its classes associated with each of the search keywords, and the union of all the classes of the result documents is used to construct the final document clusters for the search results. The clusters are ranked according to the ranks of documents included in each cluster and the weights of the clustered documents in the corresponding cluster. The clustered search results are presented to the user in such a way that clusters with higher ranks, and documents with higher ranks in each cluster are preferentially presented. Each cluster can be displayed and navigated in an independent framed subarea of the output window.
350 Citations
10 Claims
-
1. A method for clustering a set of documents that are obtained as the search results in response to a search query from a searcher using a computer or computer network, said search results are selected, based on the relevance to the search query, from a plurality of documents that are indexed with a set of keywords, comprising:
-
a. prior to processing the search query, recording the classes of each indexed document when the document is searched with one or several of keywords, for at least some of the index keywords and some of the indexed documents; and
b. grouping the search results according to said classes of each result document with respect to the keyword or keywords included in the search query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification