×

System and method for automated processing of electronic documents

  • US 10,339,376 B2
  • Filed: 09/09/2014
  • Issued: 07/02/2019
  • Est. Priority Date: 06/25/2014
  • Status: Active Grant
First Claim
Patent Images

1. A system for automatically processing electronic documents, the system comprises:

  • a memory comprising programming instructions;

    a processor configured to execute the programming instructions stored in the memory and configured to;

    receive an electronic document comprising at least one of;

    a structured section or an unstructured section;

    convert the electronic document into a textual equivalent;

    scan the textual equivalent and demarcate those sections that correspond to one or more predetermined structural attributes;

    separate the one or more demarcated sections from the textual equivalent and retrieve the one or more demarcated sections corresponding to the structured sections and a remaining textual equivalent corresponding to the unstructured sections as distinct inputs;

    receive the one or more demarcated sections and the remaining textual equivalent as the distinct inputs;

    identify one or more master triggers within the received distinct inputs;

    generate one or more potential zones with the identified one or more master triggers, wherein the generated one or more potential zones is defined by at least one geometric shape formed by geometrically coupling the master triggers and co-triggers proximate to the master triggers into the geometric shape such that the master triggers and the co-triggers form one or more vertices of the geometric shape;

    generate one or more rules of extraction to determine at least one extraction type from a plurality of extraction types, wherein each of the plurality of extraction types represent a particular method of extraction, based on the type of electronic document, wherein the type of electronic document is ascertainable based on identification of a template type of the electronic document associated with the demarcated section; and

    capture the business relevant data contained in the generated one or more potential zones within the one or more demarcated sections and the remaining textual equivalent based on co-ordinates of the vertices of the geometric shape formed by the one or more master triggers and the co-triggers by applying the determined at least one extraction type.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×