×

Information extraction and annotation systems and methods for documents

  • US 10,387,557 B2
  • Filed: 05/09/2018
  • Issued: 08/20/2019
  • Est. Priority Date: 07/22/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method, comprising:

  • receiving, by a context analysis module, annotated documents, the annotated documents comprising annotated fields;

    analyzing, by the context analysis module, the annotated documents to determine contextual information for each of the annotated fields;

    determining discriminative sequences using the contextual information by;

    determining, by a contiguity heuristics module, contiguous common subsequences between aligned pairs of strings of the annotated documents;

    determining, by the contiguity heuristics module, a frequency of occurrence of similar contiguous common subsequences; and

    wherein the contiguity heuristics module generates a proposed rule from contiguous common subsequences having a desired frequency of occurrence;

    providing, by the context analysis module, the proposed rule to a document annotator; and

    applying, by the document annotator, the proposed rule to a target document to annotate the target document.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×