×

Automatic Crowd Sourcing for Machine Learning in Information Extraction

  • US 20130066818A1
  • Filed: 09/12/2012
  • Published: 03/14/2013
  • Est. Priority Date: 09/13/2011
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for enabling machine learning from unstructured documents, the method comprisinganalyzing, at an electronic device, one or more structured databases, thereby providing a mapping between a plurality of referenced character strings and a corresponding plurality of type labels;

  • providing, at the electronic device, a first unstructured document comprising a plurality of unstructured character strings;

    analyzing the first unstructured document to identify a first character string of the plurality of unstructured character strings which is associated with a first referenced character string of the plurality of referenced character strings;

    annotating, within the first unstructured document, the first character string with a first type label which is mapped to the first referenced character string; and

    determining a training set for machine learning from the first unstructured document comprising the annotation with the first type label.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×