×

Method and apparatus for characterizing documents based on clusters of related words

  • US 8,688,720 B1
  • Filed: 06/02/2008
  • Issued: 04/01/2014
  • Est. Priority Date: 10/03/2002
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • receiving resource that includes a set of words;

    identifying a set of candidate clusters from a probabilistic model that are classified as likely to be active in generating the set of words;

    generating a vector that characterizes the resource, wherein, for each cluster of the set of candidate clusters, a component of the vector indicates a degree to which the cluster was active in generating the set of words; and

    using the vector in performing an operation related to the resource.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×