×

Automatic taxonomy generation in search results using phrases

  • US 7,426,507 B1
  • Filed: 07/26/2004
  • Issued: 09/16/2008
  • Est. Priority Date: 07/26/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method of presenting documents in response to a search of a document collection, the method comprising:

  • retrieving a plurality of documents in response to a query, the query comprising at least one query phrase;

    determining related phrases that are related to the query phrase, wherein for each query phrase gj, gk is a related phrase of phrase gj where an information gain I of gk with respect to gj exceeds a predetermined threshold, the information gain I being a function of A(j,k) and E(j,k), where A(j,k) is a measure of an actual co-occurrence rate of gj and gk, and E(j,k) is an expected co-occurrence rate gj and gk;

    determining a plurality of clusters, each cluster associated with one of the related phrases, and having a cluster name corresponding to the related phrase; and

    for each cluster, presenting a number of documents containing the related phrase associated with the cluster, along with the cluster name.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×