Query-based snippet clustering for search result grouping
First Claim
Patent Images
1. A system that facilitates the processing of search results, comprising:
- an input component that receives search results;
an analysis component that extracts content of the search results; and
a grouping component that groups the search results according to a ranked saliency parameter.
2 Assignments
0 Petitions
Accused Products
Abstract
A clustering architecture that dynamically groups the search result documents into clusters labeled by phrases extracted from the search result snippets. Documents related to the same topic usually share a common vocabulary. The words are first clustered based on their co-occurrences and each cluster forms a potentially interesting topic. Keywords are chosen and then clustered by counting co-occurrences of pairs of keywords. Documents are assigned to relevant topics based on the feature vectors of the clusters.
340 Citations
33 Claims
-
1. A system that facilitates the processing of search results, comprising:
-
an input component that receives search results;
an analysis component that extracts content of the search results; and
a grouping component that groups the search results according to a ranked saliency parameter. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system that facilitates the processing of search results, comprising:
-
an input component that receives search results of a search engine in the form of at least one of titles and snippets;
an analysis component that extracts words from the at least one of titles and snippets, and identifies co-occurring words as keywords; and
a clustering component that groups the search results in a cluster according to a saliency score, and outputs the search results as a ranked list of associated documents. - View Dependent Claims (12, 13, 14)
-
-
15. A computer-readable medium having computer-executable instructions for performing a method for processing search results, the method comprising:
-
receiving the search results;
extracting phrases from the search results;
ranking the phrases to obtain salient phrases; and
dynamically grouping the search results as a cluster that is labeled by the salient phrases. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A method of processing search results, comprising:
-
receiving the search results as a list that includes at least one of titles and snippets of information;
parsing the at least one of titles and snippets;
extracting phrases from content of the at least one of titles and snippets;
calculating properties for each of the phrases;
applying a regression model to combine the properties of a phrase into a salient score;
ranking the phrases according to the salient score to obtain salient phrases;
assigning documents to the salient phrases to generate candidate clusters; and
generating final clusters by merging the candidates clusters. - View Dependent Claims (29, 30)
-
-
31. A system that facilitates the processing of search results, comprising:
-
means for receiving the search results that contain at least one of titles and snippets of information;
means for extracting phrases from the search results;
means for calculating one or more properties for each of the phrases;
means for processing the one or more properties into a salient score for each phrase;
means for ranking the phrases according to the salient scores to obtain salient phrases; and
means for dynamically grouping the search results as a cluster that is labeled by the salient phrases. - View Dependent Claims (32, 33)
-
Specification