×

AUTOMATIC EXTRACTION USING MACHINE LEARNING BASED ROBUST STRUCTURAL EXTRACTORS

  • US 20100223214A1
  • Filed: 02/27/2009
  • Published: 09/02/2010
  • Est. Priority Date: 02/27/2009
  • Status: Abandoned Application
First Claim
Patent Images

1. A computer-implemented method comprising:

  • producing a trained machine learning model based at least in part on a plurality of documents;

    applying the trained machine learning model to a set of documents;

    based at least in part on the applying the trained machine learning model to the set of documents, determining a plurality of locations of a particular attribute in the set of documents;

    associating a set of locations with the particular attribute, based at least in part on the plurality of locations; and

    based at least in part on the set of locations, extracting, from a particular document, an attribute value corresponding to the particular attribute;

    wherein the method is performed by one or more computing devices programmed to be special purpose machines pursuant to program instructions.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×