×

Segmenting and interpreting a document, and relocating document fragments to corresponding sections

  • US 10,176,889 B2
  • Filed: 02/09/2017
  • Issued: 01/08/2019
  • Est. Priority Date: 02/09/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computer program product comprising a non-transitory computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to:

  • receive a document having section headers;

    segment the document into at least first and second sections based on the section headers;

    segment items in the first section into fragments including a first fragment and a second fragment;

    identify a section type for each of the fragments using multiple section type-specific lexicons that include a first section type-specific lexicon that corresponds to a section type of the first section and a second section type-specific lexicon that corresponds to a section type of the second section, wherein the first fragment is identified as corresponding to a different section type than the second fragment;

    determine a first quantity of first fragments of the multiple fragments and a second quantity of second fragments of the multiple fragments, wherein the first fragments correspond to a first section type of the first section and the second fragments correspond to a second section type of a second section of the document;

    determine that the first quantity of the first fragments exceeds the second quantity of the second fragments by a predetermined quantity; and

    based on exceeding the predetermined quantity, re-locate the second fragments to the second section in the document or reclassify the second fragments to correspond to the first section type.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×