×

Clustering of search results

  • US 9,443,008 B2
  • Filed: 07/14/2010
  • Issued: 09/13/2016
  • Est. Priority Date: 07/14/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • clustering a plurality of documents to obtain one or more first sets of clusters, wherein a first cluster of the one or more first sets of clusters comprises at least two first individual documents of the plurality of documents;

    accessing a search query after the clustering the plurality of documents;

    identifying a search result in response to the search query, wherein the search result comprises the at least two first individual documents of the plurality of documents; and

    clustering the search result to obtain a second set of clusters, wherein second individual documents of the search result belong to one second cluster of the second set of clusters, the clustering the search result comprising;

    for a unique pair of the second individual documents, computing a similarity measure for the second individual documents with respect to the search query based, at least in part, on the one or more first sets of clusters, wherein the similarity measure for the second individual documents is computed based, at least in part, on a weighted sum of a clustering similarity between the second individual documents with respect to the one or more first sets of clusters and a query-based similarity between the second individual documents with respect to the search query; and

    clustering the second individual documents based, at least in part, on the similarity measure;

    wherein the query-based similarity between the second individual documents is based, at least in part, on a fraction of a sum of;

    a textual match between the search query and the second individual documents to the textual match between the query, andthe intersection of the documents; and

    wherein the clustering similarity between the second individual documents with respect to the one or more first sets of clusters is based, at least in part, on a weighted combination of agreements between the one or more first sets of clusters and the second individual documents.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×