×

Automated systems and methods for textual extraction of relevant data elements from an electronic clinical document

  • US 10,789,461 B1
  • Filed: 01/15/2020
  • Issued: 09/29/2020
  • Est. Priority Date: 10/24/2019
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for extracting relevant data elements from an electronic file for conversion to tabular format, the method comprising:

  • receiving, in a computing device, an Extensible Markup Language (XML) format file, the XML file having at least one loop with nested blocks, wherein each of the nested blocks has at least one data element, the at least one data element having an unstructured or semi-structured format;

    extracting features from the data elements;

    processing, with a processor of the computing device, the extracted features using a machine learning algorithm to estimate a column header value for the data elements relative to a data schema;

    classifying, by the processor, the data elements from the XML file using the extracted features;

    generating, by the processor, a configuration file which maps the column header value to the data elements of the XML file;

    parsing the XML file using the configuration file to extract unstructured or semi-structured alphanumeric data values of the data elements from the XML file and convert the data elements to a structured tabular format; and

    ingesting the structured tabular format of the data elements into a data analytics processing system.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×