×

Method for learning character patterns to interactively control the scope of a web crawler

  • US 6,411,952 B1
  • Filed: 06/24/1998
  • Issued: 06/25/2002
  • Est. Priority Date: 06/24/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer implemented method for searching a network for resources, where each resource has an associated address specified as a character string and resources are connected by links in the form of the addresses, comprising:

  • searching the network to locate an initial set of resources in accordance with a defined scope;

    receiving data identifying positive and negative examples from the initial set of resources;

    inferring a rule from the positive and negative examples to limit the scope wherein the rule comprises patterns of character strings representing addresses; and

    performing a subsequent search of the network according to the scope as limited by the inferred rule to locate a subsequent set of resources.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×