Systems and methods for managing a master patient index including duplicate record detection
First Claim
Patent Images
1. A method of managing electronic protected healthcare information comprising:
- receiving, by a processor, a first plurality of healthcare records and a second plurality of healthcare records each of the healthcare records having electronic protected healthcare information;
arranging, by the processor, the electronic protected healthcare information in each of the healthcare records within the first and the second plurality of healthcare records into a plurality of different fields wherein each of the plurality of different fields accept at least one value;
determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with at least one of the first and the second plurality of healthcare records;
arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the first plurality and the second plurality of healthcare records to find duplicate matches among the first plurality of healthcare records and the second plurality of healthcare records in accordance with a search query wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the first and the second plurality of healthcare records;
storing to a memory device, by the processor, the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format wherein the master patient index database includes a first identifier which identifies each of the healthcare records as belonging to either the first plurality of healthcare records or the second healthcare records;
storing to the memory device, by the processor, a first set of duplicate scoring parameters associated with the first plurality of healthcare records and a second set of duplicate scoring parameters, different from the first set of duplicate scoring parameters, associated with the second plurality of healthcare records;
receiving, by the processor, one or more search input terms;
based upon the one or more search inputs terms, generating, in the processor, the search query;
based upon the search query and a search, in accordance with the search query, of all the distinct values in the list of distinct values associated with one or more of the plurality of different fields, by the processor, determining one or more distinct values which match the search query;
based upon the one or more distinct values determined to match the search query, retrieving a first healthcare record and a second healthcare record using the pointers to the healthcare records associated with the one or more matched distinct values;
using the first identifier stored with the first healthcare record, determining, by the processor, the first healthcare record is associated with the first plurality of healthcare records;
using the first identifier stored with the second healthcare record, determining, by the processor, the second healthcare record is associated with the second plurality of healthcare records;
determining, by the processor, a first duplicate probability score using the first set of duplicate scoring parameters and the healthcare information in the first healthcare record;
determining, by the processor, a second duplicate probability score using the second set of duplicate scoring parameters and the healthcare information in the second healthcare record;
determining, by the processor, using the first duplicate probability score and the second duplicate probability score, the first healthcare record and the second healthcare record are likely duplicate healthcare records; and
outputting, by the processor, to a display an indication that the first healthcare record and the second healthcare record are likely duplicate healthcare records and the healthcare information associated with each of the first healthcare record and the second healthcare record.
0 Assignments
0 Petitions
Accused Products
Abstract
A system for managing a master patient index is described. The master patient index database is constructed using inverted indices. The inverted index formulation enables faster, more complete and more flexible duplicate detection as compared to traditional master patient database management techniques. A master patient index management system including a remote user interface configured to leverage the inverted index formulation is described. The user interface includes features for managing records in an MPI database including identifying, efficiently comparing, updating and merging duplicate records across a heterogeneous healthcare organization.
-
Citations
20 Claims
-
1. A method of managing electronic protected healthcare information comprising:
-
receiving, by a processor, a first plurality of healthcare records and a second plurality of healthcare records each of the healthcare records having electronic protected healthcare information; arranging, by the processor, the electronic protected healthcare information in each of the healthcare records within the first and the second plurality of healthcare records into a plurality of different fields wherein each of the plurality of different fields accept at least one value; determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with at least one of the first and the second plurality of healthcare records; arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the first plurality and the second plurality of healthcare records to find duplicate matches among the first plurality of healthcare records and the second plurality of healthcare records in accordance with a search query wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the first and the second plurality of healthcare records; storing to a memory device, by the processor, the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format wherein the master patient index database includes a first identifier which identifies each of the healthcare records as belonging to either the first plurality of healthcare records or the second healthcare records; storing to the memory device, by the processor, a first set of duplicate scoring parameters associated with the first plurality of healthcare records and a second set of duplicate scoring parameters, different from the first set of duplicate scoring parameters, associated with the second plurality of healthcare records; receiving, by the processor, one or more search input terms; based upon the one or more search inputs terms, generating, in the processor, the search query; based upon the search query and a search, in accordance with the search query, of all the distinct values in the list of distinct values associated with one or more of the plurality of different fields, by the processor, determining one or more distinct values which match the search query; based upon the one or more distinct values determined to match the search query, retrieving a first healthcare record and a second healthcare record using the pointers to the healthcare records associated with the one or more matched distinct values; using the first identifier stored with the first healthcare record, determining, by the processor, the first healthcare record is associated with the first plurality of healthcare records; using the first identifier stored with the second healthcare record, determining, by the processor, the second healthcare record is associated with the second plurality of healthcare records; determining, by the processor, a first duplicate probability score using the first set of duplicate scoring parameters and the healthcare information in the first healthcare record; determining, by the processor, a second duplicate probability score using the second set of duplicate scoring parameters and the healthcare information in the second healthcare record; determining, by the processor, using the first duplicate probability score and the second duplicate probability score, the first healthcare record and the second healthcare record are likely duplicate healthcare records; and outputting, by the processor, to a display an indication that the first healthcare record and the second healthcare record are likely duplicate healthcare records and the healthcare information associated with each of the first healthcare record and the second healthcare record. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of managing electronic protected healthcare information comprising:
-
receiving, by a processor, a first plurality of healthcare records associated with a first healthcare organization and a second plurality of healthcare records associated with a second healthcare organization each of the healthcare records having electronic protected healthcare information; arranging, by the processor, the electronic protected healthcare information in each of the healthcare records within the first and the second plurality of healthcare records into a plurality of different fields wherein each of the plurality of different fields accept at least one value; determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with at least one of the first and the second plurality of healthcare records; arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the first plurality and the second plurality of healthcare records to find duplicate matches among the first plurality of healthcare records and the second plurality of healthcare records in accordance with a search query wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the first and the second plurality of healthcare records; storing to a memory device, by the processor, the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format wherein the master patient index database includes a first identifier which identifies each of the healthcare records as belonging to either the first healthcare organization or the second healthcare organization; storing to the memory device, by the processor, a first set of duplicate scoring parameters associated with the first healthcare organization and a second set of duplicate scoring parameters, different from the first set of duplicate scoring parameters, associated with the second healthcare organization; receiving, by the processor, one or more search input terms; based upon the one or more search inputs terms, generating, in the processor, the search query; based upon the search query and a search, in accordance with the search query, of all the distinct values in the list of distinct values associated with one or more of the plurality of different fields, by the processor, determining one or more distinct values which match the search query; based upon the one or more distinct values determined to match the search query, determining a first healthcare and a second healthcare record in the first plurality of healthcare records match the search query and a third healthcare record in the second plurality of healthcare records match the search query; determining the search query is associated with the first healthcare organization; determining the first healthcare record and the second healthcare record is associated with the first healthcare organization and the third healthcare record is associated with a second healthcare organization; retrieving only first healthcare record and the second healthcare record; determining, by the processor, a first duplicate probability score and a second duplicate probability score using the first set of duplicate scoring parameters and the healthcare information in the first healthcare record and the second healthcare record; determining, by the processor, using the first duplicate probability score and the second duplicate probability score, the first healthcare record and the second healthcare record are likely duplicate healthcare records; and outputting, by the processor, to a display an indication that the first healthcare record and the second healthcare record are likely duplicate healthcare records and the healthcare information associated with each of the first healthcare record and the second healthcare record. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A method of managing electronic protected healthcare information comprising:
-
receiving, by a processor, a plurality of healthcare records each of the healthcare records having electronic protected healthcare information; arranging, by the processor, the electronic protected healthcare information in each of the healthcare records within the plurality of healthcare records into a plurality of different fields wherein each of the plurality of different fields accept at least one value; determining, by the processor, a list of distinct values appearing in each of the plurality of different fields wherein each of the distinct values on the list is associated with at least one of the plurality of healthcare records; arranging, by the processor, values appearing in each of the plurality of different fields into an inverted index format to reduce CPU resources when performing real-time searches across all of the plurality of healthcare records to find duplicate matches among the plurality of healthcare records in accordance with a search query wherein the inverted index format includes the list of the distinct values and for each distinct value in the list at least one pointer to a healthcare record among the plurality of healthcare records; storing to a memory device, by the processor, the distinct values and their associated pointers as a master patient index database arranged according to the inverted index format; receiving, by the processor, one or more search input terms and a first value of a first duplicate scoring parameter associated with one of the plurality of different fields; based upon the one or more search inputs terms, generating, in the processor, the search query; based upon the search query and a search, in accordance with the search query, of all the distinct values in the list of distinct values associated with one or more of the plurality of different fields, by the processor, determining one or more distinct values which match the search query; based upon the one or more distinct values determined to match the search query, retrieving a first healthcare record and a second healthcare record using the pointers to the healthcare records associated with the one or more matched distinct values; determining, by the processor, a first duplicate probability score and a second duplicate probability score using at least the first value of the duplicate scoring parameter and the healthcare information in the first healthcare record and in the second healthcare record; receiving, by the processor, the one or more search input terms and a second value of the first duplicate scoring parameter; determining, by the processor, a third duplicate probability score and a fourth duplicate probability score using at least the second value of the first duplicate scoring parameter and the healthcare information in the first healthcare record and in the second healthcare record; and based upon the first duplicate probability score, the second duplicate probability score, the third duplicate probability and the fourth duplicate probability score, outputting, by the processor, to a display an indication that the first healthcare record and the second healthcare record are likely duplicate healthcare records and the healthcare information associated with each of the first healthcare record and the second healthcare record. - View Dependent Claims (18, 19, 20)
-
Specification