×

Method and system for detection of authors

  • US 7,752,208 B2
  • Filed: 04/11/2007
  • Issued: 07/06/2010
  • Est. Priority Date: 04/11/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method for processing information, comprising:

  • calculating, using a computer, a compression distance between a pair of different documents that do not contain duplicated content, comprising measuring how much a respective compression of each of the documents is improved by using information included in the other of the documents, wherein said measuring comprises;

    compressing each of the documents to create first and second compressed files;

    concatenating the documents to generate a concatenated document and compressing the concatenated document to create a third compressed file;

    finding respective first and second differences in size between the first and second compressed files and the third compressed file; and

    computing a product of the first and second differences; and

    responsively to the compression distance, identifying the pair of documents as having a common author, wherein identifying the pair comprises grouping at least two of the documents between which the compression distance is below a specified threshold as belonging to the common author.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×