Systems and methods for managing a master patient index including duplicate record detection
First Claim
Patent Images
1. A method comprising:
- receiving, by a processor, a plurality of electronic healthcare records each having a plurality of different fields;
determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with one or more of the plurality of electronic healthcare records;
arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the plurality of electronic healthcare records to find duplicate matches among the plurality of electronic healthcare records wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the plurality of electronic healthcare records;
storing to a memory device, by the processor, the list of the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format;
generating on a display, by the processor, an interface configured to receive a plurality of search input terms and a first duplicate probability score weight associated with a first search input term among the plurality of search input terms wherein the first duplicate probability score weight is used to determine a contribution to a duplicate probability score for each of the plurality of electronic healthcare records which include a match to the first search input term;
based upon the plurality of search input terms including the first search input term, searching, by the processor, across all of the plurality of electronic healthcare records in the master patient index database;
determining, by the processor, a first plurality of electronic healthcare records which include one or more of the plurality of search input terms;
based upon the first duplicate probability score weight, determining, by the processor, first duplicate probability scores for each of the first plurality of electronic healthcare records;
based upon the first duplicate probability scores, determining, by the processor, whether two or more the first plurality of electronic healthcare records are duplicates;
receiving, via the interface, a second duplicate probability score weight associated with the first input term;
based upon the second duplicate probability score weight, determining, by the processor, second duplicate probability scores for each of the first plurality of electronic healthcare records; and
based upon the second duplicate probability scores, determining, by the processor, whether two or more the first plurality of electronic healthcare records are duplicates.
0 Assignments
0 Petitions
Accused Products
Abstract
A system for managing a master patient index is described. The master patient index database is constructed using inverted indices. The inverted index formulation enables faster, more complete and more flexible duplicate detection as compared to traditional master patient database management techniques. A master patient index management system including a remote user interface configured to leverage the inverted index formulation is described. The user interface includes features for managing records in an MPI database including identifying, efficiently comparing, updating and merging duplicate records across a heterogeneous healthcare organization.
56 Citations
19 Claims
-
1. A method comprising:
-
receiving, by a processor, a plurality of electronic healthcare records each having a plurality of different fields; determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with one or more of the plurality of electronic healthcare records; arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the plurality of electronic healthcare records to find duplicate matches among the plurality of electronic healthcare records wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the plurality of electronic healthcare records; storing to a memory device, by the processor, the list of the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format; generating on a display, by the processor, an interface configured to receive a plurality of search input terms and a first duplicate probability score weight associated with a first search input term among the plurality of search input terms wherein the first duplicate probability score weight is used to determine a contribution to a duplicate probability score for each of the plurality of electronic healthcare records which include a match to the first search input term; based upon the plurality of search input terms including the first search input term, searching, by the processor, across all of the plurality of electronic healthcare records in the master patient index database; determining, by the processor, a first plurality of electronic healthcare records which include one or more of the plurality of search input terms; based upon the first duplicate probability score weight, determining, by the processor, first duplicate probability scores for each of the first plurality of electronic healthcare records; based upon the first duplicate probability scores, determining, by the processor, whether two or more the first plurality of electronic healthcare records are duplicates; receiving, via the interface, a second duplicate probability score weight associated with the first input term; based upon the second duplicate probability score weight, determining, by the processor, second duplicate probability scores for each of the first plurality of electronic healthcare records; and based upon the second duplicate probability scores, determining, by the processor, whether two or more the first plurality of electronic healthcare records are duplicates. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method comprising:
-
receiving, by a processor, a plurality of electronic healthcare records each having a plurality of different fields; determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with one or more of the plurality of electronic healthcare records; arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the plurality of electronic healthcare records to find duplicate matches among the plurality of electronic healthcare records wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the plurality of electronic healthcare records; storing to a memory device, by the processor, the list of the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format; receiving, by the processor, a plurality of search input terms and a selection of a first set of duplicate probability score weights wherein the first set of duplicate probability score weights are used to determine a contribution to a duplicate probability score for each of the plurality of electronic healthcare records which match the plurality of search input terms; based upon the plurality of search input terms, searching, by the processor, across all of the plurality of electronic healthcare records in the master patient index database; determining, by the processor, a first plurality of electronic healthcare records which include one or more of the plurality of search input terms; based upon the first set of duplicate probability score weights, determining, by the processor, first duplicate probability scores for each of the first plurality of electronic healthcare records; based upon the first duplicate probability scores, determining, by the processor, whether two or more the first plurality of electronic healthcare records are duplicates; receiving, by the processor, a selection of a second set of duplicate probability score weights; based upon the second set of duplicate probability score weights, determining, by the processor, second duplicate probability scores for each of the first plurality of electronic healthcare records; and based upon the second duplicate probability scores, determining, by the processor, whether two or more the first plurality of electronic healthcare records are duplicates.
-
-
18. A method comprising:
-
receiving, by a processor, a plurality of electronic healthcare records each having a plurality of different fields; determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with one or more of the plurality of electronic healthcare records; arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the plurality of electronic healthcare records to find duplicate matches among the plurality of electronic healthcare records wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the plurality of electronic healthcare records; storing to a memory device, by the processor, the list of the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format; receiving, by the processor, a plurality of search input terms and user identification information, based upon the user identification information, selecting a set of duplicate probability score weights wherein the set of duplicate probability score weights are used to determine a contribution to a duplicate probability score for each of the plurality of electronic healthcare records which match the plurality of search input terms; based upon the plurality of search input terms, searching, by the processor, across all of the plurality of electronic healthcare records in the master patient index database; determining, by the processor, a first plurality of electronic healthcare records which include one or more of the plurality of search input terms; based upon the set of duplicate probability score weights, determining, by the processor, first duplicate probability scores for each of the first plurality of electronic healthcare records; and based upon the first duplicate probability scores, determining, by the processor, whether two or more the first plurality of electronic healthcare records are duplicates. - View Dependent Claims (19)
-
Specification