×

System and method for performing efficient document scoring and clustering

  • US 20050022106A1
  • Filed: 07/25/2003
  • Published: 01/27/2005
  • Est. Priority Date: 07/25/2003
  • Status: Active Grant
First Claim
Patent Images

1. A system for grouping clusters of semantically scored documents, comprising:

  • a scoring module determining a score assigned to at least one concept extracted from a plurality of documents based on at least one of a frequency of occurrence of the at least one concept within at least one such document, a concept weight, a structural weight, and a corpus weight; and

    a clustering module forming clusters of the documents by applying the score for the at least one concept to a best fit criterion for each such document.

View all claims
  • 13 Assignments
Timeline View
Assignment View
    ×
    ×