×

PERFORMING OPTICAL CHARACTER RECOGNITION USING SPATIAL INFORMATION OF REGIONS WITHIN A STRUCTURED DOCUMENT

  • US 20180032842A1
  • Filed: 07/26/2016
  • Published: 02/01/2018
  • Est. Priority Date: 07/26/2016
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for identifying information in an electronic document, comprising:

  • obtaining a set of training documents for each template of a plurality of templates for the electronic document;

    extracting spatial attributes for at least a first label region and at least a first corresponding value region from the set, the spatial attributes representing a position of at least the first label region and at least the first value region within the electronic document; and

    training a classifier model based on the extracted spatial attributes, wherein the classifier model is used to identify the information in the electronic document.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×