Validating aggregate documents
First Claim
Patent Images
1. A method for preparing an aggregate document comprising:
- attempting to retrieve data pages of the aggregate document;
generating an instance signature for a first instance of a data page retrieved for inclusion in the aggregate document;
comparing the instance signature to a baseline signature associated with a second instance of the data page;
calculating a similarity value in response to the comparing, the similarity value indicating a degree of similarity between the first instance and the second instance of the data page; and
determining whether to include, delete or bypass the data page in the aggregate document based on the similarity value.
6 Assignments
0 Petitions
Accused Products
Abstract
Embodiments described herein are directed to validating an aggregate document. An instance signature can be generated for a first instance of a data page retrieved for inclusion in the aggregate document and can be compared to a baseline signature associated with a second instance of the data page. A similarity value can be calculated in response to the comparison. The similarity value indicates a degree of similarity between the first instance and the second instance of the data page. Based on the similarity value it can be determined whether to delete or bypass the data page in the aggregate document.
21 Citations
20 Claims
-
1. A method for preparing an aggregate document comprising:
-
attempting to retrieve data pages of the aggregate document; generating an instance signature for a first instance of a data page retrieved for inclusion in the aggregate document; comparing the instance signature to a baseline signature associated with a second instance of the data page; calculating a similarity value in response to the comparing, the similarity value indicating a degree of similarity between the first instance and the second instance of the data page; and determining whether to include, delete or bypass the data page in the aggregate document based on the similarity value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 20)
-
-
8. A non-transitory computer readable medium storing instructions executable by a computing system including at least one computing device, wherein execution of the instructions implements a method for preparing an aggregate document comprising:
-
attempting to retrieve data pages of the aggregate document; generating an instance signature for a first instance of a data page retrieved for inclusion in the aggregate document; comparing the instance signature to a baseline signature associated with a second instance of the data page; calculating a similarity value in response to the comparing, the similarity value indicating a degree of similarity between the first instance and the second instance of the data page; and determining whether to include, delete or bypass the data page in the aggregate document based on the similarity value. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system for preparing an aggregate document comprising:
a computer system including at least one computing device, the computing system configured to; attempt to retrieve data pages of the aggregate document; generate an instance signature for a first instance of a data page retrieved for inclusion in the aggregate document; compare the instance signature to a baseline signature associated with a second instance of the data page; calculate a similarity value in response to the comparing, the similarity value indicating a degree of similarity between the first instance and the second instance of the data page; and determine whether to include, delete or bypass the data page in the aggregate document based on the similarity value. - View Dependent Claims (16, 17, 18, 19)
Specification