×

Systems and methods for employing an orthogonal corpus for document indexing

  • US 7,275,061 B1
  • Filed: 04/13/2000
  • Issued: 09/25/2007
  • Est. Priority Date: 04/13/2000
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for topical indexing a document collection that is initially unconnected with a body of textual reference material, comprising:

  • processing the body of textual reference material into a plurality of text portions, each text portion being associated with a single topic from a plurality of topics,processing said plurality of text portions to derive keywords for each topic,assigning a weight to each keyword in a text portion;

    associating a keyword with a corresponding text portion if the weight of said keyword in said corresponding text portion is equal to or greater than a weight of said keyword in the text portions other than the corresponding text portion, or is equal to or greater than a predetermined threshold value;

    forming first keyword-weight pairs of said associated keywords;

    applying the associated keywords to at least one document from said initially unconnected document collection and forming second keyword-weight pairs associated with said at least one document;

    forming a numeric score between the first and second keyword-weight pairs; and

    based on said score, associating said at least one document from said initially unconnected document collection and said single topic from said plurality of topics.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×