×

System and method for performing efficient document scoring and clustering

  • US 7,610,313 B2
  • Filed: 07/25/2003
  • Issued: 10/27/2009
  • Est. Priority Date: 07/25/2003
  • Status: Active Grant
First Claim
Patent Images

1. A system for providing efficient document scoring of concepts within and clustering of documents in an electronically-stored document set, comprising:

  • a database electronically storing a document set;

    a scoring module scoring a document in the electronically-stored document set, comprising;

    a frequency submodule determining a frequency of occurrence of at least one concept within a document;

    a concept weight submodule analyzing a concept weight reflecting a specificity of meaning for the at least one concept within the document, wherein the concept weight is based on a number of terms for the at least one concept;

    a structural weight submodule analyzing a structural weight reflecting a degree of significance based on structural location within the document for the at least one concept;

    a corpus weight submodule analyzing a corpus weight inversely weighing a reference count of occurrences for the at least one concept within the document;

    a scoring evaluation submodule evaluating a score to be associated with the at least one concept as a function of a summation of the frequency, concept weight, structural weight, and corpus weight in accordance with the formula;

View all claims
  • 13 Assignments
Timeline View
Assignment View
    ×
    ×