×

SYSTEM FOR ANONYMIZING AND AGGREGATING PROTECTED HEALTH INFORMATION

  • US 20150149208A1
  • Filed: 11/27/2013
  • Published: 05/28/2015
  • Est. Priority Date: 11/27/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system for anonymizing and aggregating protected health information (PHI) from a plurality of data sources, the system comprising:

  • a plurality of data hashing appliances each operatively coupled to a respective data source, each hashing appliance configured to receive from the respective data source, one or more patient medical records, each patient medical record containing at least one data element corresponding to confidential protected health information (PHI), and a master record number (MRN) assigned by the respective data source;

    each data hashing appliance configured to;

    append a salt value to each data element corresponding to confidential PHI in the patient medical record;

    generate a hash value for each data element corresponding to salted confidential PHI;

    replace the data element corresponding to confidential PHI with the corresponding generated hash value to generate an anonymized patient medical record;

    a master patient index server coupled to a data repository, configured to aggregate a plurality of anonymized patient medical records received from the plurality of data hashing appliances under a unique patient identifier;

    a vector and cluster matching engine operatively coupled to the master patient index server and the data repository, and configured to determine if the anonymized patient medical record received from respective hashing appliances match the unique patient identifier corresponding to at least a second anonymized patient medical record stored in the data repository, the matching determined by;

    generating a comparison vector by comparing the hash values corresponding to the confidential PHI in the received anonymized patient medical record with corresponding hash values in the second anonymized patient medical record;

    generating a confidence vector by assigning weights based on predetermined match conditions;

    crossing the comparison vector with the confidence vector to obtain a match confidence level;

    comparing the match confidence level to a predetermined threshold; and

    mapping the received anonymized patient medical record to the unique patient identifier if the confidence level is greater than the predetermined threshold.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×