×

Efficient method for information extraction

  • US 20020165717A1
  • Filed: 04/08/2002
  • Published: 11/07/2002
  • Est. Priority Date: 04/06/2001
  • Status: Abandoned Application
First Claim
Patent Images

1. A system for extracting information from text documents, comprising:

  • an input module for receiving a plurality of text documents for information extraction, wherein said plurality of documents may be formatted in accordance with any one of a plurality of formats;

    an input conversion module for converting said plurality of text documents into a single format for processing;

    a tokenizer module for generating and assigning tokens to symbols contained in said plurality of text documents;

    an extraction module for receiving said tokens from said tokenizer module and extracting desired information from each of said plurality of text documents;

    an output conversion module for converting said extracted information into a single output format; and

    an output module for outputting said converted extracted information, wherein each of the above modules operate simultaneous and independently of one another so as to process said plurality of text documents in a pipeline fashion.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×