×

Intelligent extraction and organization of data from unstructured documents

  • US 10,296,578 B1
  • Filed: 02/20/2018
  • Issued: 05/21/2019
  • Est. Priority Date: 02/20/2018
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising a computer configured with instructions encoded on a non-transitory computer readable medium, the instructions operable to, when executed, cause the computer to perform acts comprising:

  • a) organizing a plurality of data elements from an input document into a set of groups by repeatedly, until satisfaction of a completion condition, performing a grouping process comprising;

    i) performing a matching determination of whether a subject data element satisfies a matching criterion for an existing group;

    ii) performing an overlap determination of whether the subject data element is associated with a horizontal extent that overlaps a horizontal neighbor of one of the existing group'"'"'s data elements; and

    iii) selectively adding the subject data element to the existing group based on the matching determination and the overlap determination, wherein when the subject data element is associated with a horizontal extent that overlaps a horizontal neighbor of one of the existing group'"'"'s data elements the subject data element is excluded from the existing group;

    b) assigning each group from the set of groups to a column based on horizontal positions of data elements from groups that have already been assigned to columns;

    c) for each data element, assigning that data element to a row based on that data element'"'"'s vertical position; and

    d) for each data element, adding that data element to a structured document in the row assigned to that data element and in the column assigned to the group which comprises that data element;

    wherein each data element has data comprising one or more non-whitespace characters.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×