×

Two-dimensional document processing

  • US 10,540,579 B2
  • Filed: 05/18/2018
  • Issued: 01/21/2020
  • Est. Priority Date: 05/18/2018
  • Status: Active Grant
First Claim
Patent Images

1. A computer implemented method, comprising:

  • performing optical character recognition on a document;

    generating a character grid using character information obtained from the optical character recognition, wherein the character grid is a two-dimensional down-sampled version of the document;

    applying a machine learning algorithm to the character grid;

    in response to the applying, generating a segmentation mask depicting semantic data of the document; and

    wherein generating the character grid further comprises;

    identifying a character of the document;

    determining a pixel area for the character;

    assigning an index value to represent the pixel area in the character grid; and

    down-sampling the document by a factor equal to the pixel area covering a character of the document.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×