×

Automated access to web content based on log analysis

  • US 7,483,910 B2
  • Filed: 01/11/2002
  • Issued: 01/27/2009
  • Est. Priority Date: 01/11/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of determining parameter combinations for automated web crawler access to World Wide Web content that is accessible based on parameters resulting from real user interactions with a World Wide Web site, said method comprising:

  • maintaining at least one log file containing user queries resulting from previous real user HTML interactions with said World Wide Web, said user queries comprising entries;

    analyzing said log file to determine parameter combinations and to generate synthetic queries for input to said web crawler, said web crawler using said input for automated access to said World Wide Web content, said analyzing step further comprising;

    ranking entries according to their frequency of occurence;

    for a set of entries resulting from unlimited text entries, excluding entries ranked below a predetermined number; and

    wherein said synthetic queries are determined by producing combinations of entries from each set of entries.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×