Similarity string filtering
First Claim
1. A method for optimizing similarity string filtering, the method comprising:
- selecting a first data string comprising one or more data characters;
selecting a second data string comprising one or more data characters;
applying at least one of a defined set of shapes to the first data string to generate one or more patterns associated with the first data string, each of the shapes in the defined set of shapes capable of being configured to apply an operation, where the operation is a skip operation, a transposition operation, or a stop operation;
applying at least one of the defined set of shapes to the second data string to generate one or more patterns associated with the second data string;
comparing the one or more patterns associated with the first data string with the one or more patterns associated with the second data string to determine if one or more matching patterns exist; and
linking the first data string and the second data string if one or more matching patterns exist.
2 Assignments
0 Petitions
Accused Products
Abstract
A method, computer program and system for optimizing similarity string filtering are disclosed. A first data string comprising one or more data characters and selecting a second data string comprising one or more data characters are selected. At least one of a defined set of shapes is applied to the first data string to generate one or more patterns associated with the first data string. At least one of the defined set of shapes is applied to the second data string to generate one or more patterns associated with the second data string. The one or more patterns associated with the first data string are compared with the one or more patterns associated with the second data string to determine if one or more matching patterns exist. The first data string and the second data string are linked if one or more matching patterns exist.
-
Citations
52 Claims
-
1. A method for optimizing similarity string filtering, the method comprising:
-
selecting a first data string comprising one or more data characters; selecting a second data string comprising one or more data characters; applying at least one of a defined set of shapes to the first data string to generate one or more patterns associated with the first data string, each of the shapes in the defined set of shapes capable of being configured to apply an operation, where the operation is a skip operation, a transposition operation, or a stop operation; applying at least one of the defined set of shapes to the second data string to generate one or more patterns associated with the second data string; comparing the one or more patterns associated with the first data string with the one or more patterns associated with the second data string to determine if one or more matching patterns exist; and linking the first data string and the second data string if one or more matching patterns exist. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method for optimizing similarity string filtering, the method comprising:
-
selecting a first set of data strings, each data string comprising one or more data characters; applying at least one of a defined set of shapes to the first set of data strings to generate a first set of patterns associated with the first set of data strings, each of the shapes in the defined set of shapes capable of being configured to apply an operation, where the operation is a skip operation, a transposition operation, or a stop operation; selecting a second set of data strings, each data string comprising one or more data characters; applying at least one of the defined set of shapes to the second set of data strings to generate a second set of patterns associated with the second set of data strings; comparing the first set of patterns with the second set of patterns to determine at least one matching pattern, a matching pattern comprising at least one pattern from the first set of patterns and at least one pattern from the second set of patterns; and linking a first data string associated with the at least one pattern from the first set of patterns in the matching pattern and a second data string associated with the at least one pattern from the second set of patterns in the matching patterns. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A computer program, stored on a tangible storage medium, for optimizing similarity string filtering, the program including executable instructions that cause a computer to:
-
select a first data string comprising one or more data characters; select a second data string comprising one or more data characters; apply at least one of a defined set of shapes to the first data string to generate one or more patterns associated with the first data string; apply at least one of the defined set of shapes to the second data string to generate one or more patterns associated with the second data string, each of the shapes in the defined set of shapes capable of being configured to apply an operation, where the operation is a skip operation, a transposition operation, or a stop operation; compare the one or more patterns associated with the first data string with the one or more patterns associated with the second data string to determine if one or more matching patterns exist; and link the first data string and the second data string if one or more matching patterns exist. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
40. A computer program, stored on a tangible storage medium, for optimizing similarity string filtering, the program including executable instructions that cause a computer to:
-
select a first set of data strings, each data string comprising one or more data characters; apply at least one of a defined set of shapes to the first set of data strings to generate a first set of patterns associated with the first set of data strings, each of the shapes in the defined set of shapes capable of being configured to apply an operation, where the operation is a skip operation, a transposition operation, or a stop operation; select a second set of data strings, each data string comprising one or more data characters; apply at least one of the defined set of shapes to the second set of data strings to generate a second set of patterns associated with the second set of data strings; compare the first set of patterns with the second set of patterns to determine at least one matching pattern, a matching pattern comprising at least one pattern from the first set of patterns and at least one pattern from the second set of patterns; and link a first data string associated with the at least one pattern from the first set of patterns in the matching pattern and a second data string associated with the at least one pattern from the second set of patterns in the matching patterns. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52)
-
Specification