Patient data mining with population-based analysis
First Claim
1. A method for analyzing patient records, the method comprising:
- data mining, by a processor, a plurality of first patient records including unstructured data, the data mining using a domain knowledge base relating to a disease of interest and mining the unstructured data, the data mining comprising;
mining from a plurality of different data sources for each of the first patient records, each of the first patient records being for a respective patient;
for each of a plurality of variables for each patient, extracting different values from multiple of the different data sources such that multiple of the values are provided for each of the variables representing the patient at a same time;
for each of the plurality of variables for each patient, assigning a probabilistic assertion to each of the values from the different data sources such that the multiple values for each variable are each assigned probability, resulting in different values and respective probabilistic assertions for each variable of the plurality of variables representing each patient; and
for each of the plurality of variables for each patient, combining by inference the probabilities for each of the values from the different data sources into a unified probability for the respective variable, the unified probability providing a factoid representing a final value for the respective variable such that final values are provided for the respective variables for each patient;
the mining resulting in a separate factoid for each variable of the plurality of variables, the separate factoids for the plurality of variables being provided by the mining for each of the patients from the different data sources;
compiling, with the processor, the factoids into a plurality of structured patient records of a structured computerized patient record database and associated with patient variables for each of the patients, each structured patient record for respective patients comprising the factoids; and
correlating, with a system, an outcome with the factoids associated with patient variables;
wherein correlating comprises correlating the outcome from the factoids.
6 Assignments
0 Petitions
Accused Products
Abstract
A system and method for analyzing population-based patient information is provided. The method includes the steps of data mining a plurality of patient records using a domain knowledge base relating to a disease of interest; compiling the mined data into a plurality of structured patient records; inputting at least one patient criteria relating to the disease of interest; and extracting at least one structured patient record matching the at least one patient criteria. The system includes a data miner for mining information from the plurality of patient records using a domain knowledge base relating to a disease of interest and for compiling the mined data into a plurality of structured patient records; an interface for inputting at least one patient criteria relating to the disease of interest; and a processor adapted for extracting at least one of the structured patient records matching the at least one patient criteria.
-
Citations
17 Claims
-
1. A method for analyzing patient records, the method comprising:
-
data mining, by a processor, a plurality of first patient records including unstructured data, the data mining using a domain knowledge base relating to a disease of interest and mining the unstructured data, the data mining comprising; mining from a plurality of different data sources for each of the first patient records, each of the first patient records being for a respective patient; for each of a plurality of variables for each patient, extracting different values from multiple of the different data sources such that multiple of the values are provided for each of the variables representing the patient at a same time; for each of the plurality of variables for each patient, assigning a probabilistic assertion to each of the values from the different data sources such that the multiple values for each variable are each assigned probability, resulting in different values and respective probabilistic assertions for each variable of the plurality of variables representing each patient; and for each of the plurality of variables for each patient, combining by inference the probabilities for each of the values from the different data sources into a unified probability for the respective variable, the unified probability providing a factoid representing a final value for the respective variable such that final values are provided for the respective variables for each patient; the mining resulting in a separate factoid for each variable of the plurality of variables, the separate factoids for the plurality of variables being provided by the mining for each of the patients from the different data sources; compiling, with the processor, the factoids into a plurality of structured patient records of a structured computerized patient record database and associated with patient variables for each of the patients, each structured patient record for respective patients comprising the factoids; and correlating, with a system, an outcome with the factoids associated with patient variables; wherein correlating comprises correlating the outcome from the factoids. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 13, 15, 16, 17)
-
-
9. A system for analyzing a plurality of patient records, the plurality of patient records being stored in structured and unstructured sources, the system comprising:
-
a data miner for mining information from the plurality of first patient records including the unstructured sources using a domain knowledge base relating to a disease of interest, the data miner configured to; mine from a plurality of different data sources including the unstructured sources for each of the first patient records, each of the first patient records being for a respective patient; for each of a plurality of variables for each patient, extract different values from multiple of the different data sources such that multiple of the values are provided for each of the variables representing the patient at a same time; for each of the plurality of variables for each patient, assign a probabilistic assertion to each of the values from the different data sources such that the multiple values for each variable are each assigned a probability resulting in different values and respective probabilistic assertions for each variable of the plurality of variables representing each patient; and for each of the plurality of variables for each patient, combine by inference the probabilities for each of the values from the different data sources into a unified probability for the respective variable, the unified probability providing a factoid representing a final value for the respective variable such that final values are provided for the respective variables for each patient; the mining resulting in a separate factoid for each variable of the plurality of variables, the separate factoids for the plurality of variables being provided by the mining for each of the patients from the different data sources; the data miner compiling mined data as the factoids, the mined data being from the unstructured sources, into a plurality of structured patient records of a structured computerized patient record database and associated with patient variables for each of the patients, each structured patient record for respective patients comprising the factoids; a processor operable to correlate an outcome with the mined data associated with the patient variables as a function of at least one of the structured patient records; wherein the processor is operable to correlate the outcome from the factoids. - View Dependent Claims (10, 11, 12, 14)
-
Specification