×

Document Comparison Method And Apparatus

  • US 20090198677A1
  • Filed: 12/12/2008
  • Published: 08/06/2009
  • Est. Priority Date: 02/05/2008
  • Status: Abandoned Application
First Claim
Patent Images

1. A document comparison and identification method, the method comprising the steps of:

  • identifying, in a source document, words of a predetermined number of characters or greater;

    generating a list containing the identified words, and excluding identified words from said list that occur with a predetermined frequency or greater in a set of documents to be searched;

    searching each of the plurality of documents in the set of documents for occurrences of the identified words stored in the list;

    for each of the plurality of documents, determining how many identified words from the list occur in the document; and

    calculating a similarity of each of the plurality of documents to the source document based on the total number of identified words in the list, the number of identified words in the list occurring in the document, and a predetermined minimum required number of matches.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×