×

Systems and methods for content extraction

  • US 20070050708A1
  • Filed: 03/30/2006
  • Published: 03/01/2007
  • Est. Priority Date: 03/30/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method for extracting content from input markup language text comprising:

  • (a) parsing the input markup language text into a first hierarchical data model;

    (b) generating a second hierarchical data model based on the first hierarchical data model using one or more filters to remove content from the first hierarchical data model; and

    (c) generating output markup language text from the second hierarchical data model.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×