Method for Automatic Labeling of Unstructured Data Fragments From Electronic Medical Records
First Claim
1. A method for automatically labeling unstructured data from electronic medical records using a computer-based medical data processing system, comprising:
- selecting a data pattern based on a desired medical finding;
searching for the selected data pattern within source data including patient records;
identifying a context of a predetermined range around each data pattern match found within the source data;
training a classifier based on an association between the identified contexts and the desired medical finding; and
using the trained classifier to automatically identify likely instances of the desired medical finding from within subsequent data including patient records.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for automatically labeling unstructured data from electronic medical records using a computer-based medical data processing system includes selecting a data pattern based on a desired medical finding. The selected data pattern is searched for within source data including patient records to find one or more matches. A context of a predetermined range around each data pattern match found is identified within the source data and the found contexts are associated with a particular medical finding. The medical finding can be at the patient level or document level, not necessarily at the context level. Associations between contexts and medical findings are identified. A classifier based on an association between the identified contexts and the desired medical finding is trained. The trained classifier is used to automatically identify likely instances of passages, documents or patients related to the desired medical finding from within subsequent data including patient records.
58 Citations
24 Claims
-
1. A method for automatically labeling unstructured data from electronic medical records using a computer-based medical data processing system, comprising:
-
selecting a data pattern based on a desired medical finding; searching for the selected data pattern within source data including patient records; identifying a context of a predetermined range around each data pattern match found within the source data; training a classifier based on an association between the identified contexts and the desired medical finding; and using the trained classifier to automatically identify likely instances of the desired medical finding from within subsequent data including patient records. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for automatically labeling unstructured data from electronic medical records using a computer-based medical data processing system, comprising:
-
receiving patient medical data that does not include structured data indicating whether or not a desired medical finding is present; searching for a data pattern indicative of the desired medical finding from within the patient medical data; identifying a context of a predetermined range around each data pattern match found within the patient medical data; and using a trained classifier to automatically identify whether the patient medical data has the desired medical finding based on the identified contexts, wherein the trained classifier was generated based on an association between identified contexts and the desired medical finding within training data. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer system comprising:
-
a processor; and a program storage device readable by the computer system, embodying a program of instructions executable by the processor to perform method steps for automatically labeling unstructured data from electronic medical records, the method comprising; selecting a data pattern based on a desired medical finding; searching for the selected data pattern within source data including patient records; identifying a context of a predetermined range around each data pattern match found within the source data; training a classifier based on an association between the identified contexts and the desired medical finding using a machine learning technique; and using the trained classifier to automatically identify likely instances of the desired medical finding from within subsequent data including patient records. - View Dependent Claims (22, 23)
-
-
24. A method for determining contextual phrases that are indicative of a particular medical finding using a computer-based medical data processing system, comprising:
-
selecting a data pattern based on a desired medical finding; searching for the selected data pattern within source data including patient records; identifying a context of a predetermined range around each data pattern match found within the source data; and generating a set of associations between the contexts identified around each of the plurality of data pattern matches of the source data and the desired medical finding.
-
Specification