×

INGESTION PLANNING FOR COMPLEX TABLES

  • US 20170116190A1
  • Filed: 10/17/2016
  • Published: 04/27/2017
  • Est. Priority Date: 10/23/2015
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product for generating a plan for document processing, the computer program product comprising:

  • one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising;

    program instructions to receive a plurality of electronic documents from a data store, by a computer using a network;

    program instructions to analyze the plurality of electronic documents, using the computer to identify a plurality of tabular data by performing a search for one or more table markers, based on the analyzed plurality of electronic documents;

    program instructions to identify textual data within the identified tabular data, by performing a first natural language search of the analyzed plurality of electronic documents;

    program instructions to generate textual hints, based on the identified textual data within the identified tabular data by associating identified textual data into a set using a second natural language search;

    program instructions to map the generated textual hints to a lookup set;

    program instructions to identify references, wherein references are based on mapped textual hints with associated identified textual data in the received plurality of electronic documents;

    program instructions to determine a count of identified references;

    program instructions to calculate a priority score based on the count of identified references, wherein the program instructions to calculate further comprises program instructions to multiply the count of identified references by a predetermined scale value;

    in response to program instructions to receive a priority score modifying value, wherein the priority score modifying value is a numerical value, program instructions to calculate a modified priority score, wherein the program instructions to calculate further comprises program instructions to multiply the priority score by the received priority score modifying value;

    program instructions to generate one or more ingestion plans based on the modified priority score; and

    program instructions to communicate the one or more generated ingestion plans by the computer using the network.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×