Method and apparatus for identifying matching record candidates
First Claim
1. A method implemented by a health information infrastructure, the method comprising:
- receiving, via a communication interface, a plurality of records, each record having a plurality of demographic attributes associated with an individual, wherein receiving the plurality of records comprises receiving only a portion of a plurality of patient records created by one or more healthcare facilities by receiving, from the one or more healthcare facilities for each of the plurality of patient records created by the one or more healthcare facilities, information defining the demographic attributes associated with the individual, but not receiving information associated with encounters of the individual with a healthcare facility and not receiving documents included in the patient record;
for each record, determining, with processing circuitry, a digest by determining a fuzzy representation of one or more of the plurality of demographic attributes for the respective individual and combining by concatenating into a single string representations of one or more of the plurality of demographic attributes associated with the respective individual including the fuzzy representation of one or more of the plurality of demographic attributes associated with the respective individual;
receiving a query relating to a record for a person and demographic attributes associated with the person;
determining a digest based upon the demographic attributes associated with the person who is a subject of the query, wherein determining the digest comprises determining a fuzzy representation of one or more demographic attributes of the person who is the subject of the query and combining by concatenating into a single string representations of one or more of the demographic attributes of the person who is the subject of the query including the fuzzy representation of one or more demographic attributes of the person who is the subject of the query;
in response to the query, identifying one or more records that are associated with respective individuals who are candidates to match the person based upon a comparison of representations of the digests of the records and a representation of the digest of the person;
for each record that was identified and, as a result, for only a subset of the plurality of records, determining a confidence score by comparing the plurality of demographic attributes associated with the respective individuals to corresponding demographic attributes of the person;
identifying one or more records that are associated with respective individuals who match the person based upon the confidence scores; and
causing at least some of the one or more records that were identified based upon the confidence scores to be associated with respective individuals who are candidates to match the person to be provided via the communication interface.
9 Assignments
0 Petitions
Accused Products
Abstract
A method, computing device and computer program product are provided to identify records that are associated with same person, even in instances in which the records are created and stored by different entities. In a method, a plurality of records are received, each having attributes associated with a person. For each record, the method determines a digest by determining a fuzzy representation of one or more of the attributes for the person and then combining representations of the attributes. The method also receives a query relating to a record for the person and determines a digest based upon the attributes of the person. In response to the query, the method identifies one or more records that are associated with respective individuals who are candidates to match the person based upon a comparison of representations of the digests of the records and the person.
93 Citations
18 Claims
-
1. A method implemented by a health information infrastructure, the method comprising:
-
receiving, via a communication interface, a plurality of records, each record having a plurality of demographic attributes associated with an individual, wherein receiving the plurality of records comprises receiving only a portion of a plurality of patient records created by one or more healthcare facilities by receiving, from the one or more healthcare facilities for each of the plurality of patient records created by the one or more healthcare facilities, information defining the demographic attributes associated with the individual, but not receiving information associated with encounters of the individual with a healthcare facility and not receiving documents included in the patient record; for each record, determining, with processing circuitry, a digest by determining a fuzzy representation of one or more of the plurality of demographic attributes for the respective individual and combining by concatenating into a single string representations of one or more of the plurality of demographic attributes associated with the respective individual including the fuzzy representation of one or more of the plurality of demographic attributes associated with the respective individual; receiving a query relating to a record for a person and demographic attributes associated with the person; determining a digest based upon the demographic attributes associated with the person who is a subject of the query, wherein determining the digest comprises determining a fuzzy representation of one or more demographic attributes of the person who is the subject of the query and combining by concatenating into a single string representations of one or more of the demographic attributes of the person who is the subject of the query including the fuzzy representation of one or more demographic attributes of the person who is the subject of the query; in response to the query, identifying one or more records that are associated with respective individuals who are candidates to match the person based upon a comparison of representations of the digests of the records and a representation of the digest of the person; for each record that was identified and, as a result, for only a subset of the plurality of records, determining a confidence score by comparing the plurality of demographic attributes associated with the respective individuals to corresponding demographic attributes of the person; identifying one or more records that are associated with respective individuals who match the person based upon the confidence scores; and causing at least some of the one or more records that were identified based upon the confidence scores to be associated with respective individuals who are candidates to match the person to be provided via the communication interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computing device of a health information infrastructure, the computing device comprising a processing circuitry configured to:
-
receive a plurality of records, each record having a plurality of demographic attributes associated with an individual, wherein the plurality of records are received by receiving only a portion of a plurality of patient records created by one or more healthcare facilities by receiving, from the one or more healthcare facilities for each of the plurality of patient records created by the one or more healthcare facilities, information defining the demographic attributes associated with the individual, but not receiving information associated with encounters of the individual with a healthcare facility and not receiving documents included in the patient record; for each record, determine a digest by determining a fuzzy representation of one or more of the plurality of demographic attributes for the respective individual and combining by concatenating into a single string representations of one or more of the plurality of demographic attributes associated with the respective individual including the fuzzy representation of one or more of the plurality of demographic attributes associated with the respective individual; receive a query relating to a record for a person and demographic attributes associated with the person; determine a digest based upon the demographic attributes associated with the person who is a subject of the query, wherein the digest is determined by determining a fuzzy representation of one or more demographic attributes of the person who is the subject of the query and combining by concatenating into a single string representations of one or more of the demographic attributes of the person who is the subject of the query including the fuzzy representation of one or more demographic attributes of the person who is the subject of the query; in response to the query, identify one or more records that are associated with respective individuals who are candidates to match the person based upon a comparison of representations of the digests of the records and a representation of the digest of the person; for each record that was identified and, as a result, for only a subset of the plurality of records, determine a confidence score by comparing the plurality of demographic attributes associated with the respective individuals to corresponding demographic attributes of the person; identify one or more records that are associated with respective individuals who match the person based upon the confidence scores; and cause at least some of the one or more records that were identified based upon the confidence scores to be associated with respective individuals who are candidates to match the person to be provided via a communication interface. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A computer program product of a health information infrastructure, the computer program product comprising a non-transitory computer readable storage medium having program code portions stored thereon, the program code portions configured, upon execution, to:
-
receive a plurality of records, each record having a plurality of demographic attributes associated with an individual, wherein the plurality of records are received by receiving only a portion of a plurality of patient records created by one or more healthcare facilities by receiving, from the one or more healthcare facilities for each of the plurality of patient records created by the one or more healthcare facilities, information defining the demographic attributes associated with the individual, but not receiving information associated with encounters of the individual with a healthcare facility and not receiving documents included in the patient record; for each record, determine a digest by determining a fuzzy representation of one or more of the plurality of demographic attributes for the respective individual and combining by concatenating into a single string representations of one or more of the plurality of demographic attributes associated with the respective individual including the fuzzy representation of one or more of the plurality of demographic attributes associated with the respective individual; receive a query relating to a record for a person and demographic attributes associated with the person; determine a digest based upon the demographic attributes associated with the person who is a subject of the query, wherein the digest is determined by determining a fuzzy representation of one or more demographic attributes of the person who is the subject of the query and combining by concatenating into a single string representations of one or more of the demographic attributes of the person who is the subject of the query including the fuzzy representation of one or more demographic attributes of the person who is the subject of the query; in response to the query, identify one or more records that are associated with respective individuals who are candidates to match the person based upon a comparison of representations of the digests of the records and a representation of the digest of the person; for each record that was identified and, as a result, for only a subset of the plurality of records, determine a confidence score by comparing the plurality of demographic attributes associated with the respective individuals to corresponding demographic attributes of the person; identify one or more records that are associated with respective individuals who match the person based upon the confidence scores; and cause at least some of the one or more records that were identified based upon the confidence scores to be associated with respective individuals who are candidates to match the person to be provided via a communication interface. - View Dependent Claims (16, 17, 18)
-
Specification