METHODS, APPARATUS AND SYSTEM FOR IDENTIFYING A DOCUMENT
First Claim
Patent Images
1. A computerized method for document processing, the method comprising:
- obtaining one or more counts of remarkable characteristics of a first document and one or more counts of remarkable characteristics of a second document;
comparing at least one of the one or more counts of remarkable characteristics of the first document and at least one of one or more corresponding counts of remarkable characteristics of the second document; and
determining a document similarity score between the first and second document.
3 Assignments
0 Petitions
Accused Products
Abstract
A computerized method for identifying a document. A signature may be determined for a first document and compared with a signature for each of one or more additional documents. A document similarity score may be determined and one or more similar documents may be identified based on the document similarity score.
-
Citations
20 Claims
-
1. A computerized method for document processing, the method comprising:
-
obtaining one or more counts of remarkable characteristics of a first document and one or more counts of remarkable characteristics of a second document; comparing at least one of the one or more counts of remarkable characteristics of the first document and at least one of one or more corresponding counts of remarkable characteristics of the second document; and determining a document similarity score between the first and second document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 15)
-
- 10. The computerized method of claim 9, further comprising repeating the selecting and determining the absolute value steps for each of a plurality of cells of the first document.
-
16. A computerized method for document processing, the method comprising:
-
capturing an image of a printed document; determining one or more counts of remarkable characteristics of the printed document; comparing at least one of the one or more counts of remarkable characteristics of the first document and at least one of one or more corresponding counts of remarkable characteristics of a second document of a plurality of stored documents; determining a document similarity score; repeating the comparing and determining steps for each of the plurality of stored documents; and identifying one or more of the plurality of stored documents corresponding to one or more lowest document similarity scores. - View Dependent Claims (17, 18)
-
-
19. Apparatus for document processing, the apparatus comprising:
-
a processor; memory to store instructions that, when executed by the processor cause the processor to; obtain one or more counts of remarkable characteristics of a first document and one or more counts of remarkable characteristics of a second document; compare at least one of the one or more counts of remarkable characteristics of the first document and at least one of one or more corresponding counts of remarkable characteristics of the second document; and determine a document similarity score between the first and second document.
-
-
20. A computer-readable medium embodying instructions that, when executed by a processor perform operations comprising:
-
obtaining one or more counts of remarkable characteristics of a first document and one or more counts of remarkable characteristics of a second document; comparing at least one of the one or more counts of remarkable characteristics of the first document and at least one of one or more corresponding counts of remarkable characteristics of the second document; and determining a document similarity score between the first and second document.
-
Specification