System and method for string filtering
First Claim
Patent Images
1. A computer-based method for scanning data for one or more known data strings, the method comprising:
- (a) compressing a data string comprised of a plurality of characters to obtain a data string signature wherein compressing the data string comprises compressing each character in a manner independent of other characters in the data string and concatenating the compressed representations of each character;
(b) determining if the data-string signature matches a stored string signature; and
(c) if the signature is determined to have a match, then marking the data string with an indicator identifying the data string as a known data string.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-based method for scanning data using string filtering includes compressing a data string using a hashing algorithm to obtain a data-string signature, and then determining that the data-string signature matches a known data-string signatures stored in a lookup table if the signature is determined to have a match in the lookup table.
29 Citations
32 Claims
-
1. A computer-based method for scanning data for one or more known data strings, the method comprising:
-
(a) compressing a data string comprised of a plurality of characters to obtain a data string signature wherein compressing the data string comprises compressing each character in a manner independent of other characters in the data string and concatenating the compressed representations of each character; (b) determining if the data-string signature matches a stored string signature; and (c) if the signature is determined to have a match, then marking the data string with an indicator identifying the data string as a known data string. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer-based method for scanning data for known strings of data, the method comprising:
-
(a) receiving a stream of data strings, each data string comprising a plurality of eight-bit characters; (b) determining a three-bit hash code for each of the plurality of characters in the data string with the determination of the hash code for each character being independent of other characters in the data string; (c) concatenating each determined three-bit hash code to create a string signature; (d) determining that a data string contains a known data string if the string signature matches a string signature in a predetermined list of string signatures; and (e) if the data string is determined to contain a known data string, then marking the data string with an indicator identifying the data string as a known data string. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A computer-based method of scanning data for known strings of data, the method comprising:
-
(a) receiving a stream of data strings at a filter, the stream of data comprising a plurality of characters; (b) determining a first hash code for the first character in the stream of data, (c) determining a second hash code for the second character in the stream of data in a manner independent of the first character; (d) concatenating the first hash code with the second hash code; (e) comparing the concatenated hash codes to a list of known concatenated hash codes; and (f) if the concatenated hash codes match known concatenated hash codes, then marking the data string with an indicator identifying the data string as a known string of data. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26)
-
-
27. A system for scanning data for known strings of data, the system comprising:
-
(a) a lookup table in a memory operable to store known data-string signatures; and (b) a string-filtering unit connected to a network and coupled to the memory, the string filtering unit operable to;
(i) compress a data string comprised of a plurality of characters to obtain a data string signature;
(ii) determine if the data string signature matches a known data-string signatures in the lookup table; and
(iii) if the data string signature is determined to match a known data-string signature, then marking the data string with an indicator identifying the data string as a known string of data,wherein the string-filtering unit is operable to compress the data string by compressing each character in a manner independent of other characters in the data string and concatenating the compressed representations of each character. - View Dependent Claims (28, 29, 30, 31, 32)
-
Specification