×

DOCUMENT SIMILARITY CALCULATION DEVICE

  • US 20120330955A1
  • Filed: 05/15/2012
  • Published: 12/27/2012
  • Est. Priority Date: 06/27/2011
  • Status: Abandoned Application
First Claim
Patent Images

1. A document similarity calculation device for calculating a similarity indicating a degree of how much a plurality of documents are similar to one another, the document similarity calculation device comprising:

  • a unit of storing associative word group for storing an associative word group composed of words associated with one another;

    a unit of generating matrix of word frequency in document for generating a matrix of word frequency in document which is a matrix each element of which is the frequency of a word present in a document with respect to each combination of the word and the document;

    a unit of transforming matrix of word frequency in document for transforming the generated matrix of word frequency in document based on the stored associative word group so as to reduce the number of dimensions of the matrix of word frequency in document; and

    a unit of calculating similarity for calculating the similarity based on the transformed matrix of word frequency in document.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×