Patient data mining for automated compliance
First Claim
Patent Images
1. A method for automatically generating performance measurement information for health care organizations, the method comprising:
- mining, with a machine, free text, the mining on the free text comprising mining for health care data related to a health care guideline for a patient, the mining using medical knowledge, the medical knowledge associated with the health care guideline, the free text being stored physician notes, the mining comprising;
gleaning, as part of the mining for the patient, a plurality of pieces of evidence about the patient including at least one of the pieces being from the free text;
using, as part of the mining, probabilistic information, the probabilistic information comprising chances of occurrence for possible values of a variable being mined for the patient, the possible values being derived from the pieces of evidence such that a plurality of possible values and respective chances are provided for the variable;
calculating, as part of the mining, the chances from probabilities assigned to the pieces of evidence from the free text for the patient, the probabilities being less than 100% and greater than 0%;
as part of the mining, assigning, as the health care data for the variable for the patient, a value for the variable with the chance of occurrence greater than other chances for the possible values for the variable, the value being different than the respective chance;
populating, as part of the mining, a data source with at least some health care data mined from the free text, a structure of the mined data in the data source being different from a structure of the free text so that compliance querying may be performed using the data source, the structure of the mined data in the data source comprising the at least some health care data for the variable separate from the free text;
querying, with the machine executing a query script defining a plurality of constraints and formulated by the machine, the data source having the at least some data populated from the free text of the stored physician notes, the querying based on the health care guideline; and
outputting performance measurement information indicating a level of compliance with the health care guideline based on the querying.
8 Assignments
0 Petitions
Accused Products
Abstract
A technique is provided for automatically generating performance measurement information. At least some of the obtained performance measurement information may be derived from unstructured data sources, such as free text physician notes, medical images, and waveforms. The performance measurement may be sent to a health care accreditation organization. The health care accreditation organization can use the performance measurement to evaluate a health care provider for its quality of patient care. Alternatively, performance measurement information can be provided directly to consumers.
148 Citations
56 Claims
-
1. A method for automatically generating performance measurement information for health care organizations, the method comprising:
-
mining, with a machine, free text, the mining on the free text comprising mining for health care data related to a health care guideline for a patient, the mining using medical knowledge, the medical knowledge associated with the health care guideline, the free text being stored physician notes, the mining comprising; gleaning, as part of the mining for the patient, a plurality of pieces of evidence about the patient including at least one of the pieces being from the free text; using, as part of the mining, probabilistic information, the probabilistic information comprising chances of occurrence for possible values of a variable being mined for the patient, the possible values being derived from the pieces of evidence such that a plurality of possible values and respective chances are provided for the variable; calculating, as part of the mining, the chances from probabilities assigned to the pieces of evidence from the free text for the patient, the probabilities being less than 100% and greater than 0%; as part of the mining, assigning, as the health care data for the variable for the patient, a value for the variable with the chance of occurrence greater than other chances for the possible values for the variable, the value being different than the respective chance; populating, as part of the mining, a data source with at least some health care data mined from the free text, a structure of the mined data in the data source being different from a structure of the free text so that compliance querying may be performed using the data source, the structure of the mined data in the data source comprising the at least some health care data for the variable separate from the free text; querying, with the machine executing a query script defining a plurality of constraints and formulated by the machine, the data source having the at least some data populated from the free text of the stored physician notes, the querying based on the health care guideline; and outputting performance measurement information indicating a level of compliance with the health care guideline based on the querying. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method for automatically generating performance measurement information for health care organizations, the method comprising:
-
mining, with a machine, medical information related to a health care guideline from a computerized patient record; combining, with the machine, evidence from the mining, the evidence being combined referring to different values of a same variable and being probabilistic such that a probability is provided for each piece of evidence, the probability for each piece of evidence indicating a confidence in the respective value, at least some of the probabilities being less than 100% and greater than 0%, the combined evidence being a unified probability calculated from the probabilities for the evidence, the unified probability being less than 100% and greater than 0%, the combining being pursuant to a mathematical operation such that the unified probability is a numerical value that is based on the probabilities for the evidence applied as input to the mathematical operation; assigning a final value for the variable as a function of the probabilities for each piece of evidence, the final value being different than the unified probability; querying with the machine, the machine executing a query script formulated by the machine, a data source having the final value for the combined evidence, the querying based on the health care guideline; and outputting probabilistic performance measurement information indicating a level of compliance with the health care guideline as a function of the combined evidence. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47)
-
-
48. A system for automatically generating performance measurement information for health care organizations, the system comprising:
-
a machine configured as a data miner, the data miner configured to mine free text for health care data related to a health care guideline for a patient, the free text being stored physician notes, the mining using probabilistic information, the probabilistic information comprising a chance of occurrence for possible values of a variable being mined for the patient, the chances calculated from probabilities assigned to pieces of evidence extracted from the free text for the patient, the probabilities being less than 100% and greater than 0%, the possible value for the variable with the chance of occurrence greater than chances for other possible values for the variable being assigned for the patient, as part of the mining, as the health care data for the variable, and the mining being a function of a health care domain-specific knowledge, the health care domain-specific knowledge associated with the health care guideline, the mining populating a data source with at least some health care data mined from the free text, a structure of the mined data in the data source being different from a structure of the free text so that compliance querying may be performed using the data source, the structure of the mined data in the data source comprising the at least some health care data for the variable separate from the free text; the data source having the at least some data populated from the free text; and the machine configured to query, by executing a script defining a plurality of constraints and formulated by the machine, the data source, the querying based on health care guideline, and operable to output performance measurement information indicating a level of compliance with the health care guideline based on the querying. - View Dependent Claims (49, 50, 51, 52)
-
-
53. A system for automatically generating performance measurement information for health care organizations, the system comprising:
-
a machine configured as a data miner, the data miner configured to mine a computerized patient record for medical information related to a health care guideline, the mining based on a health care domain-specific knowledge, the health care domain-specific knowledge associated with the health care guideline, and combine evidence from the mining, the evidence being combined referring to different values of a same variable and being probabilistic such that a probability is provided for each piece of evidence, the probability for each piece of evidence indicating a confidence in the respective value, at least some of the probabilities being less than 100% and greater than 0%, the combined evidence being a unified probability calculated from the probabilities for the evidence, the unified probability being less than 100% and greater than 0%, the combining being pursuant to a mathematical operation such that the unified probability is a numerical value that is based on the probabilities for the evidence applied as input to the mathematical operation, the data miner configured to assign an element value for the variable as a function of the probabilities for each piece of evidence, the element value being different than the unified probability; a data source having the element value for the combined evidence stored in a tangible media; and the machine operable to query the data source, the query being performed by executing a query script formulated by the machine, the querying based on the health care guideline, and operable to output probabilistic performance measurement information indicating a level of compliance with the health care guideline based on the combined evidence.
-
-
54. A program storage device readable by a machine, tangibly embodying a program of instructions executable on the machine to perform method steps for automatically generating performance measurement information for health care organizations, the method steps comprising:
-
mining, with the machine, free text, the mining of the free text comprising mining for data related to a health care guideline, the mining using medical knowledge, the medical knowledge associated with the health care guideline, the free text being stored physician notes, the mining comprising; gleaning, as part of the mining for a patient, a plurality of pieces of evidence about a variable for the patient, at least one of the pieces extracted from the free text; using, as part of the mining, probabilistic information, the probabilistic information comprising chances of occurrence for possible values of the variable being mined for the patient, the possible values being derived from the pieces of evidence such that a plurality of possible values and respective chances are provided for the variable; calculating, as part of the mining, the chances from probabilities assigned to the pieces of evidence from the free text for the patient, the probabilities being less than 100% and greater than 0%; as part of the mining, assigning, as the health care data for the variable for the patient, a value for the variable with the chance of occurrence greater than the chances of other possible values for the variable, the value being different than the respective chance; populating a data source with at least some data mined from the free text, a structure of the mined data in the data source being different from a structure of the free text from which the data is mined, the more structure comprising the variable separate from the free text; querying, with the machine, by executing a query script defining a plurality of constraints and formulated by the machine, a data source having the at least some data populated from the free text representing physician notes, the querying based on the health care guideline; and outputting, by the machine, performance measurement information indicating a level of compliance with the health care guideline based on the querying.
-
-
55. A program storage device readable by a machine, tangibly embodying a program of instructions executable on the machine to perform method steps for automatically generating performance measurement information for health care organizations, the method steps comprising:
-
mining, with the machine, medical information related to a health care guideline from a computerized patient record, the mining based on a health care domain-specific knowledge, the health care domain-specific knowledge associated with the health care guideline; combining, with the machine, evidence from the mining, the evidence being combined referring to different values of a same variable and being probabilistic such that a probability is provided for each piece of evidence, the probability for each piece of evidence indicating a confidence in the respective value, at least some of the probabilities being less than 100% and greater than 0%, the combined evidence being a unified probability calculated from the probabilities for the evidence, the unified probability being less than 100% and greater than 0%, the combining being pursuant to a mathematical operation such that the unified probability is a numerical value that is based on the probabilities for the evidence applied as input to the mathematical operation; assigning an element value for the variable as a function of the probabilities for each piece of evidence, the element value being different than the unified probability; querying, with the machine, the machine executing a query script formulated by the machine, a data source having the element value for the combined evidence, the querying based on the health care guideline; and outputting, by the machine, probabilistic performance measurement information indicating a level of compliance with the health care guideline based on the combined evidence.
-
-
56. A method for automatically generating performance measurement information for health care organizations, the method comprising:
-
extracting, with a machine, multiple pieces of evidence for each variable of a plurality of variables for a first patient, at least one of the pieces of evidence extracted from free text for the first patient based on a domain-knowledge base; assigning, with the machine, a degree of confidence to each of the pieces of evidence, at least one of the degrees of confidence for each variable being greater than 0% and less than 100%, the degrees of confidence each indicating relative probability of at least two different values for the variable; combining, with the machine, the degrees of confidence for the multiple pieces of evidence for each variable into a unified probability; assigning, with the machine, one of the different values of each of the variables as a function of the respective unified probability; repeating the extracting, assigning the degree of confidence, combining, and assigning one of the different values for a plurality of other patients; storing in a computerize patient record the assigned ones of the different values for each of the variables for each of the patients; querying the computerized patient record for each of the patients based on the health care guideline; and outputting performance measurement information indicating a level of compliance with the health care guideline across a patient population of the patients based on the querying.
-
Specification