×

ROBUST WRAPPERS FOR WEB EXTRACTION

  • US 20100162097A1
  • Filed: 12/24/2008
  • Published: 06/24/2010
  • Est. Priority Date: 12/24/2008
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method to determine a robust wrapper representing a data item of a plurality of data items in a document represented by a markup language, comprising:

  • based on archival data representative of a temporal history of the document, developing a model indicative of the temporal history;

    based on the developed model, determining robustness characteristics for a plurality of different wrappers representing associated paths to the data item in a representation of the document;

    based on a result of the determining operation, providing, as a result wrapper, one of the plurality of wrappers that has a desired robustness characteristic.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×