×

Information block extraction apparatus and method for Web pages

  • US 20050066269A1
  • Filed: 09/17/2004
  • Published: 03/24/2005
  • Est. Priority Date: 09/18/2003
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for segmenting a Web page into information blocks with coherent contents comprising:

  • generating a structural information block tree of the Web page;

    clustering and merging the structural information blocks; and

    labeling the semantic of the resulting blocks.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×