×

Method for content mining of semi-structured documents

  • US 20030140311A1
  • Filed: 01/18/2002
  • Published: 07/24/2003
  • Est. Priority Date: 01/18/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method for content mining of semi-structured documents comprising:

  • receiving a semi-structured document;

    converting said semi-structured document to a document-type independent format;

    analyzing formatting information of said semi-structured document;

    adding information to said semi-structured document describing said semi-structured document'"'"'s structure, based upon said analyzing; and

    mining said semi-structured document for specified information, wherein said added information facilitates said content mining.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×