×

Segmenting and indexing web pages using function-based object models

  • US 7,065,707 B2
  • Filed: 06/24/2002
  • Issued: 06/20/2006
  • Est. Priority Date: 06/24/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method comprising:

  • (A) generating a logical layered structure for an original web page by steps comprising;

    (1) detecting one or more objects that comprise a web page, said one or more objects including basic objects and composite objects;

    (2) ascertaining functional properties of any said basic object and generating them from a basic function-based object model (FOM);

    (3) ascertaining functional properties of any said composite object and generating them from a basic FOM;

    (4) generating a specific FOM (SFOM) for any said basic object using the ascertained functional properties thereof;

    (5) generating an SFOM for any said composite object using the ascertained functional properties thereof; and

    (6) applying one or more rules to assemble the logical layer structure for the original web page using one or more of the basic objects, the composite objects, the basic FOM for the basic objects, the basic FOM for the composite objects, the SFOM for the basic objects, and the SFOM for the composite objects;

    (B) generating one or more page files by steps comprising;

    (1) performing an object processing process comprising;

    (a) applying one or more rules to remove objects in the logical layered structure;

    (b) associating each remaining object in the logical layered structure with a mobile control;

    (2) performing a form extraction process comprising;

    (a) applying one or more rules to remove one or more layers in the logical layered structure;

    (b) applying one or more rules to segment the logical layered structure into forms;

    (3) performing a file generation process comprising;

    (a) generating one said page file for each said form segmented from the logical layered structure; and

    (b) generating an index for the page files.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×