×

Enabling a web-crawling robot to collect information from web sites that tailor information content to the capabilities of accessing devices

  • US 20040205114A1
  • Filed: 01/05/2004
  • Published: 10/14/2004
  • Est. Priority Date: 02/25/2003
  • Status: Active Grant
First Claim
Patent Images

1. A server, comprising:

  • a proxy function unit for relaying data exchanged between a web server site on a network and a web-crawling robot collecting contents by accessing the site;

    a link deriving unit for expanding a link and acquiring information on a user agent corresponding to content of a link destination when said proxy function unit receives a response from the site to a content retrieval request issued from said web-crawling robot to said site and if a link destination of the link included in the response has dynamic content that differs according to a type of user agent which is an access source; and

    a user agent information editing unit for converting user agent information included in the content retrieval request to user agent information corresponding to said content of the link destination when said proxy function unit receives the content retrieval request from said web-crawling robot issued based on derived links.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×