×

Method for estimating coverage of web search engines

  • US 6,711,568 B1
  • Filed: 11/08/2000
  • Issued: 03/23/2004
  • Est. Priority Date: 11/25/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. A computerized method for comparing search engine indices and estimating coverage of at least one search engine, each search engine maintaining an index of words of pages located at specific addresses in a network, wherein the estimate of coverage indicates the relative sizes of the indices of the first and second search engine, and the relative amount of overlap between the first and second search engine, comprising:

  • generating a random query, the random query being a logical combination of words found in a lexicon of words;

    submitting the random query to the first search engine;

    receiving a set of URLs in response to the random query;

    randomly selecting a particular URL identifying a sample page;

    generating a strong query for the sample page;

    submitting the strong query to a second search engine;

    comparing result information received in response to the strong query to determine if the second search engine has indexed the sample page; and

    estimating the relative sizes of the indices of the first and second search engines by dividing a fraction of a first set of pages sampled from the second search engine that are contained in the first search engine by a fraction of a second set of pages sampled from the first search engine that are contained in the second search engine.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×