Public Electronic Document Dating List
First Claim
1. A computer implemented method of scoring a plurality of documents, comprising:
- identifying a plurality of linked documents;
identifying linking documents that link to the linked documents;
determining a score for each of the linked documents based on scores of the linking documents that link to the linked document; and
processing the linked document according to the determined scores;
wherein the improvement comprises;
generating a first integrity verification code (IVC) for each of the linked documents;
identifying at least one set of duplicates, using the IVCs; and
for a first linked document in the set of duplicates, adjusting at least one search result list generation parameter selected from the list consisting of;
the score, anda ranking of the document in a search result list.
0 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed which enable the establishment of file dates and the absence of tampering, even for documents held in secrecy and those stored in uncontrolled environments, but which does not require trusting a timestamping authority or document archival service. A trusted timestamping authority (TTSA) may be used, but even if the TTSA loses credibility or a challenger refuses to acknowledge the validity of a timestamp, a date for an electronic document may still be established. Systems and methods are disclosed which enable detection of file duplication in large collections of documents, which can improve searching for documents within the large collection.
34 Citations
20 Claims
-
1. A computer implemented method of scoring a plurality of documents, comprising:
-
identifying a plurality of linked documents; identifying linking documents that link to the linked documents; determining a score for each of the linked documents based on scores of the linking documents that link to the linked document; and processing the linked document according to the determined scores; wherein the improvement comprises; generating a first integrity verification code (IVC) for each of the linked documents; identifying at least one set of duplicates, using the IVCs; and for a first linked document in the set of duplicates, adjusting at least one search result list generation parameter selected from the list consisting of; the score, and a ranking of the document in a search result list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program embodied on a computer executable medium and configured to be executed by a processor, the program comprising:
-
code for identifying a plurality of linked documents; code for identifying linking documents that link to the linked documents; code for determining a score for each of the linked documents based on scores of the linking documents that link to the linked document; code for identifying, within the plurality of linked documents, at least one set of duplicates; and code for adjusting at least one search result list generation parameter responsive to identifying the set of duplicates.
-
-
20. An apparatus for scoring a plurality of documents, the apparatus comprising:
-
a processor; a computer readable medium comprising; a database correlating locations of each of a plurality of linked documents with keywords, importance scores, and indicia of content duplication, and a search module configured to adjusting at least one search result list generation parameter selected from the list consisting of; the importance score correlated with a location of a linked document, and a ranking of the document in a search result list.
-
Specification