×

Coordinate-based document processing and data entry system and method

  • US 9,740,995 B2
  • Filed: 10/28/2013
  • Issued: 08/22/2017
  • Est. Priority Date: 10/28/2013
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for document processing and data extraction, over a network, the method comprising the steps of:

  • receiving a first document at a processor, the first document containing data for extraction, wherein the processor performs at least the following;

    outputting the document in a preferred document format;

    receiving a selection of a first portion of the document;

    recognizing a type of data contained in the selection, wherein the type of data includes at least one type of text and at least one type of image;

    processing the first portion to extract a first layer of data and a second layer of data, wherein the first layer of data includes a plurality of text entries, and the second layer of data includes an image;

    automatically generating, based on the step of processing, a plurality of coordinate sets for the plurality of text entries;

    extracting, based on the plurality of coordinate sets, the plurality of text entries from the first layer;

    extracting the image from the second layer, wherein each of the one or more layers is extracted separately based on the at least one user preference;

    automatically generating and storing in computer memory, a structured data set that includes the extracted text data, the extracted text data being structured in the structured data set based on coordinates of the extracted text data;

    automatically generating an extraction rule based on the generated plurality of coordinate sets and the structured data set, the extraction rule executable by the processor to extract text entries of the at least one type of text from a second document received after the first document;

    receiving the second document from a computer system;

    automatically matching the second document based on the step of automatically generating the extraction rule; and

    extracting, based on the step of matching, data from the second document by executing the generated extraction rule, including based at least on a portion of the generated plurality of coordinate sets.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×