×

System and method for extracting information from text using text annotation and fact extraction

  • US 7,912,705 B2
  • Filed: 01/19/2010
  • Issued: 03/22/2011
  • Est. Priority Date: 11/19/2003
  • Status: Expired due to Term
First Claim
Patent Images

1. A fact extraction tool set for extracting information from a document, implemented using a client-server hardware architecture, wherein the document includes text, comprising:

  • means for breaking the text into tokens;

    a plurality of independent means for annotating the text with token attributes, constituent attributes, links, and tree-based attributes, using XML as a basis for representing the annotated text, wherein each of the means for annotating has at least one specific annotating function;

    means for resolving conflicting annotation boundaries in the annotated text, to produce a single XML-based representation of the document with well-formed XML, wherein the conflicting annotation boundaries result from annotating the text using a plurality of independent means for annotating; and

    means for extracting facts from the single XML-based representation of the document using text pattern recognition rules, wherein each text pattern recognition rule comprises a pattern that describes text of interest, a label that names the pattern for testing and debugging purposes, and an action that indicates what should be done in response to a matching of the pattern, wherein the text pattern recognition rules independently identify constituents by use of regular expression-based functionality, tree traversal functionality based on a language that can navigate XML representations of text, and user-defined matching functionality, and wherein the regular expression-based functionality identifies sequential constituents, and the tree traversal functionality identifies non-contiguous constituents that are distinct from the sequential constituents identified by the regular expression-based functionality.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×