Early detection of disease outbreak using electronic patient data to reduce public health threat from bio-terrorism
First Claim
1. A method for automated identification of disease outbreak, the method comprising:
- obtaining patient records for a plurality of patients, said records including structured and unstructured data;
for each patient of the plurality of patients, mining data from the unstructured data of the patient record using domain-specific knowledge contained in a knowledge base to produce one or more probabilistic assertions;
for each patient of the plurality of patients, correlating the structured data and the probabilistic assertions with disease indicia from at least one disease to determine a correlation value between the patient and each of the at least one disease;
outputting an alert when at least one correlation value exceeds a first predetermined threshold;
calculating a joint probability that at least one patient of the plurality of patients has the at least one disease based on each of the correlation values and a measure of geographic proximity between each of the plurality of patients; and
outputting an alert when the calculated joint probability exceeds a second predetermined threshold.
2 Assignments
0 Petitions
Accused Products
Abstract
A method is provided for automatically identifying a disease outbreak indicative of a potential bio-terror attack. Patient records are mined from structured and unstructured clinical sources. The patient records are then analyzed by correlating selected patient data contained in the patient records with disease indicia for each of a plurality of diseases. A probability of a disease outbreak is estimated at least in part based on these correlations. Suspicion may also be indicated if anomalous disease clusters are found. If any of the estimated probabilities exceeds a threshold value, a disease outbreak alert is outputted. The disease indicia may be defined by disease progression models, which may be stored in a disease knowledge base.
-
Citations
37 Claims
-
1. A method for automated identification of disease outbreak, the method comprising:
-
obtaining patient records for a plurality of patients, said records including structured and unstructured data; for each patient of the plurality of patients, mining data from the unstructured data of the patient record using domain-specific knowledge contained in a knowledge base to produce one or more probabilistic assertions; for each patient of the plurality of patients, correlating the structured data and the probabilistic assertions with disease indicia from at least one disease to determine a correlation value between the patient and each of the at least one disease; outputting an alert when at least one correlation value exceeds a first predetermined threshold; calculating a joint probability that at least one patient of the plurality of patients has the at least one disease based on each of the correlation values and a measure of geographic proximity between each of the plurality of patients; and outputting an alert when the calculated joint probability exceeds a second predetermined threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 33, 35)
-
-
26. A program storage device readable by a machine, the program storage device tangibly embodying a program of instructions executable on the machine to perform a method for automated identification of disease outbreaks, the method comprising:
-
obtaining patient records for a plurality of patients, said records including structured and unstructured data; for each patient of the plurality of patients, mining data from the unstructured data of the patient record using domain-specific knowledge contained in a knowledge base to produce one or more probabilistic assertions; for each patient of the plurality of patients, correlating the structured data and the probabilistic assertions with disease indicia from at least one disease to determine a correlation value between the patient and each of the at least one disease; outputting an alert when at least one correlation value exceeds a first predetermined threshold; calculating a joint probability that at least one patient of the plurality of patients has the at least one disease based on each of the correlation values and a measure of geographic proximity between each of the plurality of patients; and outputting an alert when the calculated joint probability exceeds a second predetermined threshold.
-
-
27. A method for automated identification of disease outbreak, the method comprising:
-
data mining of unstructured data from selected patient records using domain-specific knowledge contained in a knowledge base; determining a correlation value between the selected patients and indicia of a disease based on the data mined from the unstructured data; determining a joint probability that at least one patient of the selected patients has the disease based on each of the correlation values; indicating a disease outbreak when the joint probability exceeds a threshold value; and outputting a disease outbreak alert for the disease outbreak. - View Dependent Claims (28, 29, 30, 31, 32, 34, 36)
-
-
37. A method for automated identification of disease outbreak, the method comprising:
-
obtaining patient records for a plurality of patients, said records including structured and unstructured data; for each patient of the plurality of patients, mining data from the unstructured data of the patient record using domain-specific knowledge contained in a knowledge base to produce one or more probabilistic assertions; for each patient of the plurality of patients, correlating the structured data and the probabilistic assertions with disease early-stage indicia and disease late-stage indicia from at least one disease to determine an whether there is at least a partial match between the early-stage indicia of each patient and the disease and to determine whether there is a match between the late-stage indicia of each patient and the disease; and outputting an alert when the early-stage indicia for a particular patient only partially matches the disease and the late-stage indicia for the particular patient matches the disease.
-
Specification