×

PDF extraction with text-based key

  • US 10,643,022 B2
  • Filed: 07/19/2018
  • Issued: 05/05/2020
  • Est. Priority Date: 07/19/2018
  • Status: Active Grant
First Claim
Patent Images

1. A computing device comprising:

  • an electronic processor; and

    a memory coupled to the electronic processor, the memory including program instructions that, when executed by the electronic processor, cause the electronic processor toreceive a standardized PDF (portable document format) report that is in a non-paragraph format and a configuration file including one or more values that correspond to one or more text-based keys in the standardized PDF report,determine X coordinates and Y coordinates of bounding boxes associated with the one or more text-based keys, the X coordinates associated with an X-direction and the Y coordinates associated with a Y-direction,determine one or more words in the standardized PDF report that share the Y coordinates of the bounding boxes associated with a first text-based key of the one or more text-based keys,sort the one or more words in the standardized PDF report that share the Y coordinates of the bounding boxes associated with the first text-based key based on respective X coordinates in the X-direction,determine a single word from the one or more words that is directly adjacent to the first text-based key, andcontrol a display to display the single word that is directly adjacent to the first text-based key,wherein, to control the display to display the single word that is directly adjacent to the first text-based key, the program instructions, when executed by the electronic processor, further cause the electronic processor to generate a graphical user interface to display the single word that is directly adjacent to the first text-based key.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×