Obfuscating document stylometry
First Claim
Patent Images
1. A method, implementable at least in part by a computing device, comprising:
- comparing indicators of distinctive stylometry in a document with corresponding indicators of stylometry in a stylometric reference; and
providing one or more alterations to the document that alter the indicators of distinctive stylometry compared to the stylometric reference.
2 Assignments
0 Petitions
Accused Products
Abstract
A new system has been invented that can obfuscate the stylometry of a document. This may be used to anonymize a document and make it resistant to forensic stylometry analysis, or to mimic the style of an existing set of documents, for example. A system may compare indicators of distinctive stylometry in a document with corresponding indicators of distinctive stylometry in a stylometric reference, and provide one or more alterations to the document that alter the indicators of distinctive stylometry compared to the stylometric reference, according to one illustrative embodiment.
47 Citations
20 Claims
-
1. A method, implementable at least in part by a computing device, comprising:
-
comparing indicators of distinctive stylometry in a document with corresponding indicators of stylometry in a stylometric reference; and providing one or more alterations to the document that alter the indicators of distinctive stylometry compared to the stylometric reference. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method, implementable at least in part by a computing device, comprising:
-
measuring a value of one or more linguistic features in a target corpus; measuring a value of one or more linguistic features in an input document; comparing the value of the linguistic features in the target corpus with the value of the linguistic features in the input document; and replacing one or more of the linguistic features in the input document with one or more of the linguistic features in the target corpus. - View Dependent Claims (19)
-
-
20. A medium, readable by a computing device and comprising executable instructions that are executable by the computing device, wherein the executable instructions enable a computing device to:
-
evaluate linguistic features in a document compared with a reference corpus; determine one or more of the linguistic features in the document that are stylometrically distinctive relative to the reference corpus; and modify one or more of the linguistic features in the document to make them less stylometrically distinctive relative to the reference corpus.
-
Specification