×

Method and apparatus for characterizing documents based on clusters of related words

  • US 20040068697A1
  • Filed: 09/30/2003
  • Published: 04/08/2004
  • Est. Priority Date: 10/03/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method for characterizing a document with respect to clusters of conceptually related words, comprising:

  • receiving the document, wherein the document contains a set of words;

    selecting candidate clusters of conceptually related words that are related to the set of words;

    wherein the candidate clusters are selected using a model that explains how sets of words are generated from clusters of conceptually related words; and

    constructing a set of components to characterize the document, wherein the set of components includes components for candidate clusters, wherein each component indicates a degree to which a corresponding candidate cluster is related to the set of words.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×