×

Method for document comparison and selection

  • US 20020103799A1
  • Filed: 12/05/2001
  • Published: 08/01/2002
  • Est. Priority Date: 12/06/2000
  • Status: Active Grant
First Claim
Patent Images

1. A method for representing the latent semantic content of a plurality of documents, each document containing a plurality of terms, the method comprising:

  • deriving at least one n-tuple term from the plurality of terms;

    forming a two-dimensional matrix, each matrix column c corresponding to a document, each matrix row r corresponding to a term occurring in at least one document corresponding to a matrix column, each matrix element (r, c) related to the number of occurrences of the term corresponding to the row r in the document corresponding to column c, at least one matrix element related to the number of occurrences of one at least one n-tuple term occurring in the at least one document, and performing singular value decomposition and dimensionality reduction on the matrix to form a latent semantic indexed vector space.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×