×

Page information collection program, page information collection method, and page information collection apparatus

  • US 7,757,164 B2
  • Filed: 02/28/2005
  • Issued: 07/13/2010
  • Est. Priority Date: 08/17/2004
  • Status: Active Grant
First Claim
Patent Images

1. A computer-readable storage medium having recorded thereon a page information collection program for collecting a set of pages associated by link information from a server on a network, the page information collection program causing a computer to execute the processing of:

  • acquiring contents of a page through the network in response to a page acquisition request and creating page information including the contents of the page and a response status code used for page acquisition, the creating the page information being automatically performed without user interaction;

    taking the page information created as target page information, comparing an assignment determination condition defining the requirements of page information to be included in each group and the target page information, to find a group having the assignment determination condition satisfied by the target page information, and storing the target page information put into the group in a storage block, the comparing the assignment determination condition being automatically performed without user interaction;

    creating an assignment determination condition satisfied by the target page information if the target page information does not satisfy the assignment determination condition of any group, creating a group corresponding to the created assignment determination condition, and storing the target page information put into the created group in the storage block the creating the assignment determination condition and the creating the group being automatically performed without user interaction; and

    extracting the link information only from the target page information put first into the group created and outputting the page acquisition request for acquiring the page based on the extracted link information, the extracting the link information and the outputting the page acquisition request being automatically performed without user interaction, wherein the link information is not extracted from the target page information put into already existing groups;

    wherein the assignment determination condition includes a URL and an option to select a query conformity field;

    wherein the assignment determination condition defines a requirement concerning the conformity of the page acquisition request and the response status code given when the contents of the page are acquired, the assignment determination condition is specific to each group, and contents of the assignment determination condition are individually changeable; and

    wherein in the comparing the assignment determination condition and the target page information, page information acquired by different page acquisition requests and having different response status codes is determined to belong to different groups.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×