×

Creation of structured data from plain text

  • US 7,324,936 B2
  • Filed: 03/05/2004
  • Issued: 01/29/2008
  • Est. Priority Date: 01/08/2001
  • Status: Expired due to Term
First Claim
Patent Images

1. A computerized method comprising:

  • tokenizing a plain text description;

    creating parse trees from the tokenized plain text description based on grammar from a grammar storage area;

    generating an instance tree from each parse tree based upon an application domain specific natural markup language provided by a natural markup language model module;

    discarding each invalid or incomplete instance tree;

    choosing an instance tree from remaining instance trees representing a best map based upon a cost function;

    processing the best map with a domain markup language generator to generate a structured data representation.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×