SYSTEMS AND METHODS FOR EFFICIENT DETECTION OF FINGERPRINTED DATA AND INFORMATION
First Claim
Patent Images
1. A system for compacting fingerprints of electronic content, the system comprising:
- a compact representation module having a processor, the processor configured to hash the contents of the electronic content to create an electronic fingerprint of the content.
18 Assignments
0 Petitions
Accused Products
Abstract
The disclosed embodiments provide systems, methods, and apparatus for efficient detection of fingerprinted content and relate generally to the field of information (or data) leak prevention. Particularly, a compact and efficient repository of fingerprint ingredients is used to analyze content and determine the content'"'"'s similarity to previously fingerprinted content. Some embodiments employ probabilistic indications regarding the existence of fingerprint ingredients in the repository.
-
Citations
33 Claims
-
1. A system for compacting fingerprints of electronic content, the system comprising:
a compact representation module having a processor, the processor configured to hash the contents of the electronic content to create an electronic fingerprint of the content. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A system for applying a transmission policy on electronic content stored in an electronic network, the system comprising:
-
a scanning engine having a processor and configured to identify electronic content; a fingerprinting module in electronic communication with the scanning engine and configured to hash the electronic content in order to generate a compact fingerprint of the electronic content identified by the scanning engine; at least one compact fingerprint repository in electronic communication with the fingerprinting module and configured to store fingerprints generated by the fingerprinting module; a content processor in electronic communication with the electronic network, the content processor configured to hash new electronic content of the electronic network; a match estimator in electronic communication with the fingerprint repository and the content processor, the match estimator configured to determine probabilistic matches between the fingerprint of the new electronic content and a fingerprint stored in the fingerprint repository; and a decision module in electronic communication with the match estimator and the electronic network, the decision module configured to apply the transmission policy to the new electronic content based upon an output from the match estimator. - View Dependent Claims (8, 9)
-
-
10. A method of transmitting electronic content over an electronic network, the method comprising the steps:
-
identifying electronic content on the electronic network with a scanning engine; fingerprinting the electronic content with at least one hash to generate a compact fingerprint of the electronic content; storing at least one compact fingerprint in a compact fingerprint repository; fingerprinting new electronic content with at least one hash, the new electronic content to be transmitted over the electronic network; comparing the fingerprint of the new electronic content with fingerprints stored in the compact fingerprint repository; determining whether to transmit the new electronic content over the electronic network in response to the comparison of the fingerprints. - View Dependent Claims (11, 12, 13)
-
-
14. A system for compacting fingerprints of an electronic content, the system comprising:
one or more processors, wherein; at least one of the one or more processors is configured to receive at least a portion of a first electronic fingerprint; and at least one of the one or more processors is configured to flag one or more addresses in an array, the one or more addresses associated with the at least a portion of the first electronic fingerprint. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21)
-
22. A system for applying a transmission policy on electronic content stored in an electronic network, the system comprising:
-
a scanning engine processor configured to identify electronic content; a hashing processor configured to receive at least a portion of an electronic fingerprint of the electronic content identified by the scanning engine processor, the hashing processor further configured to flag a plurality of addresses associated with the at least a portion of an electronic fingerprint in an array, the hashing processor in electrical communication with the scanning engine processor; at least one compact fingerprint repository in electrical communication with the hashing processor and configured to store the array associated with the at least a portion of an electronic fingerprint; a content processor in electrical communication with the electronic network, the content processor configured to generate fingerprints of new electronic content of the electronic network; a match estimation processor in electrical communication with the fingerprint repository and the content processor, the match estimation processor configured to determine probabilistic matches between the fingerprint of the new electronic content and the fingerprints stored in the fingerprint repository; and a decision processor in electrical communication with the match estimation processor and the electronic network, the decision processor configured to apply the transmission policy to the new electronic content based upon an output from the match estimation processor. - View Dependent Claims (23, 24, 25, 28)
-
-
26. A method of transmitting electronic content over an electronic network comprising:
performing the following on one or more electronic processors; identifying electronic content on the electronic network; generating a new fingerprint of the electronic content; comparing the new fingerprint of the electronic content with fingerprints stored in a compact fingerprint repository, comprising determining the one or more addresses flagged by new fingerprint when applied to a hash function and comparing these addresses with the one or more addresses flagged by the fingerprint of the repository when applied to the hash function; and determining whether to transmit the new electronic content over the electronic network in response to the comparison of the fingerprints. - View Dependent Claims (27, 29)
-
30. A computer-readable medium comprising code configured to cause one or more processors to perform the following:
-
receiving at least a portion of a first electronic fingerprint; flagging one or more addresses in an array, the one or more addresses associated with the at least a portion of the first electronic fingerprint; storing the array; receiving at least a portion of a second electronic fingerprint; determining if an address associated with the at least a portion of the second electronic fingerprint is the same as an address associated with the first electronic fingerprint. - View Dependent Claims (31, 32, 33)
-
Specification