×

Automatic identification of document versions

  • US 8,315,997 B1
  • Filed: 08/28/2008
  • Issued: 11/20/2012
  • Est. Priority Date: 08/28/2007
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for document management, the method comprising:

  • receiving an input document containing an input spreadsheet;

    computing a respective measure of similarity between the input spreadsheet and each of a plurality of stored spreadsheets contained in a group of stored documents;

    identifying one or more of the stored spreadsheets as versions of the input spreadsheet responsively to the measure of the similarity; and

    outputting an identification of the stored documents that are versions of the input document responsively to having identified the one or more of the stored spreadsheets as versions of the input spreadsheet,wherein computing the respective measure of the similarity comprises extracting respective formulas from the cells of the input and stored spreadsheets and computing respective data values of the cells of the input and stored spreadsheets, and comparing both the formulas and the data values in order to compute the respective measure of the similarity,wherein comparing both the formulas and the data values comprises computing a first association rate with respect to the formulas and computing a second association rate with respect to the data values, and finding the measure of the similarity as a weighted sum of the first and second association rates.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×