×

System and method for scoring concepts in a document set

  • US 8,626,761 B2
  • Filed: 10/26/2009
  • Issued: 01/07/2014
  • Est. Priority Date: 07/25/2003
  • Status: Active Grant
First Claim
Patent Images

1. A system for scoring concepts in a document set, comprising:

  • a database to maintain a set of documents;

    a concept identification module to identify concepts comprising two or more terms extracted from the document set and to designate each document having one or more of the concepts as a candidate seed document;

    a value module to determine for each of the concepts identified within each candidate seed document, values for a frequency of occurrence of that concept within that candidate seed document, a concept weight reflecting a specificity of meaning for that concept within that candidate seed document, a structural weight reflecting a degree of significance based on a location of that concept within that candidate seed document, and a corpus weight inversely weighing a reference count of the occurrence for that concept within the document set according to the equation;

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×