×

Information extraction using a trainable grammar

  • US 20060253273A1
  • Filed: 11/07/2005
  • Published: 11/09/2006
  • Est. Priority Date: 11/08/2004
  • Status: Abandoned Application
First Claim
Patent Images

1. A computer-implemented method for information extraction, comprising:

  • defining a stochastic context free grammar (SCFG) comprising symbols and rules applicable to the symbols, the symbols comprising at least one output concept;

    training the SCFG on a tagged training corpus so as to determine probabilities of the rules and of one or more of the symbols; and

    parsing a document using the rules and symbols responsively to the probabilities so as to extract occurrences of the at least one output concept from the document.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×