×

Method for document comparison and selection

  • US 7,113,943 B2
  • Filed: 12/05/2001
  • Issued: 09/26/2006
  • Est. Priority Date: 12/06/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer-based method for representing latent semantic content of a plurality of documents, each document containing a plurality of terms, the method comprising:

  • identifying at least one idiom among the documents,each idiom containing at least one idiom term;

    replacing at least one identified idiom with a corresponding idiom elaboration, each elaboration comprising at least one elaboration term,forming a two-dimensional matrix,each matrix column corresponding to a document;

    each matrix row corresponding to a term;

    each matrix element representing a number of occurrences of the term corresponding to the element'"'"'s row in the document corresponding to element'"'"'s column,at least one matrix element corresponding to the number of occurrences of an elaboration term in a document corresponding to a matrix column;

    performing singular value decomposition and dimensionality reduction on the matrix to form a reduced matrix and storing the reduced matrix in an electronic form accessible to a user.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×