System, method and software for providing persistent entity identification and linking entity information in a data repository
First Claim
1. A computer-implemented method of creating a data structure stored in one or more data storage devices, wherein the data structure provides information to link records that are related to a same entity of a plurality of entities, for database searching and record matching operations, the method comprising:
- (a) associating a first record, of a plurality of records, with a first identifier of a plurality of identifiers that identifies at least one entity of the plurality of entities, wherein each identifier is unique and persistent for a predetermined period of time;
(b) associating a second record, of the plurality of records, with a second identifier that identifies at least one entity of the plurality of entities;
(c) determining a plurality of potentially matching records, and from the plurality of potentially matching records, differentially weighting one or more records having at least one match variation;
(d) from the plurality of potentially matching records, and using the differential weighting, eliminating a potentially matching record which does not match a selected or predetermined element of the first or second records within a selected or predetermined confidence level;
(e) determining that the first identifier and the second identifier correspond to a same entity by determining that a third record, of the plurality of potentially matching records, matches with a first element of a first plurality of elements of the first record within the selected or predetermined confidence level and further matches with a second element of a second plurality of elements of the second record within the selected or predetermined confidence level;
(f) linking the first identifier and the second identifier in a data structure that respectively references the first and second records without merging the first and second records under either the first or second identifiers and without merging the first and second identifiers; and
(g) using the link between the first and second identifiers in the data structure, grouping and providing retrieved information that is separately maintained in both the first and second records in response to a request for information about the same entity.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides a system, method and software for linking persistent and unique business identifiers, associated with business entities, for associating corresponding business records without a loss of historical records, regardless of subsequent changes, updates, and mergers. In the various embodiments, a first business record, of the plurality of business records, is associated with a first business identifier; and a second business record, which does not match the first record, is associated with a second business identifier. Subsequently, when the first business identifier and the second business identifier are determined to correspond to a singular business entity, the first business identifier and the second business identifier are linked, and respective associations of the first business record and the second business record with the linked first business identifier and second business identifier are maintained, with corresponding source references, and collectively utilized in subsequent search and match determinations.
-
Citations
27 Claims
-
1. A computer-implemented method of creating a data structure stored in one or more data storage devices, wherein the data structure provides information to link records that are related to a same entity of a plurality of entities, for database searching and record matching operations, the method comprising:
-
(a) associating a first record, of a plurality of records, with a first identifier of a plurality of identifiers that identifies at least one entity of the plurality of entities, wherein each identifier is unique and persistent for a predetermined period of time; (b) associating a second record, of the plurality of records, with a second identifier that identifies at least one entity of the plurality of entities; (c) determining a plurality of potentially matching records, and from the plurality of potentially matching records, differentially weighting one or more records having at least one match variation; (d) from the plurality of potentially matching records, and using the differential weighting, eliminating a potentially matching record which does not match a selected or predetermined element of the first or second records within a selected or predetermined confidence level; (e) determining that the first identifier and the second identifier correspond to a same entity by determining that a third record, of the plurality of potentially matching records, matches with a first element of a first plurality of elements of the first record within the selected or predetermined confidence level and further matches with a second element of a second plurality of elements of the second record within the selected or predetermined confidence level; (f) linking the first identifier and the second identifier in a data structure that respectively references the first and second records without merging the first and second records under either the first or second identifiers and without merging the first and second identifiers; and (g) using the link between the first and second identifiers in the data structure, grouping and providing retrieved information that is separately maintained in both the first and second records in response to a request for information about the same entity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for creating a data structure to link records that are related to a same entity of a plurality of entities, for data repository searching and for record matching operations, the system comprising:
-
one or more data storage devices to store one or more data repositories having a plurality of records and a plurality of identifiers, each identifier of the plurality of identifiers identifying an entity of the plurality of entities, wherein each identifier is unique and persistent for a predetermined period of time; and one or more processors coupled to the one or more data storage devices, the one or more processors adapted, configured or programmed to associate a first record of the plurality of records with a first identifier that identifies at least one entity of the plurality of entities;
to associate a second record of the plurality of records with a second identifier that identifies at least one entity of the plurality of entities;
to determine a plurality of potentially matching records, and from the plurality of potentially matching records, to differentially weight one or more records having at least one match variation, and using the differential weighting, to eliminate a potentially matching record which does not match a selected or predetermined element of the first or second records within a selected or predetermined confidence level;
to determine that the first identifier and the second identifier correspond to the same entity by determining that a third record, of the plurality of potentially matching records, matches with a first element of a first plurality of elements of the first record within the selected or predetermined confidence level and further matches with a second element of a second plurality of elements of the second record within the selected or predetermined confidence level;
to link in the one or more data repositories the first identifier and the second identifier in a data structure that respectively references the first and second records without merging the first and second records under either the first or second identifiers and without merging the first and second identifiers; and
using the link between the first and second identifiers in the data structure, to retrieve and group information that is separately maintained in both the first and second records in response to a request for information about the same entity. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A tangible computer-readable medium storing computer-readable software for creating a data structure in one or more data repositories stored in one or more data storage devices, wherein the data structure provides information to link records that are related to a same entity of a plurality of entities, for database searching and record matching operations, the tangible computer-readable medium storing computer-readable software comprising:
-
executable code for associating a first record, of a plurality of records of the one or more data repositories stored in the one or more data storage devices, with a first identifier of a plurality of identifiers that identifies at least one entity of the plurality of entities, wherein each identifier is unique and persistent for a predetermined period of time; executable code for associating a second record, of the plurality of records, with a second identifier that identifies at least one entity of the plurality of entities; executable code for determining a plurality of potentially matching records, and from the plurality of potentially matching records, differentially weighting one or more records having at least one match variation; executable code for determining a plurality of potentially matching records, and from the plurality of potentially matching records and using the differential weighting, eliminating a potentially matching record which does not sufficiently match a selected or predetermined element of the first or second records within a selected or predetermined confidence level; executable code for determining that the first identifier and the second identifier correspond to the same entity by determining that a third record, of the plurality of potentially matching records, matches with a first element of a first plurality of elements of the first record within the selected or predetermined confidence level and further matches with a second element of a second plurality of elements of the second record within a selected or predetermined confidence level; executable code for linking the first identifier and the second identifier in at least one data structure that respectively references the first and second records without merging the first and second records under either the first or second identifiers and without merging the first and second identifiers; and executable code for using the link between the first and second identifiers in the data structure and grouping and providing retrieved information that is separately maintained in both the first and second records in response to a request for information about the same entity.
-
Specification