×

Method and system for trawling the World-wide Web to identify implicitly-defined communities of web pages

  • US 6,886,129 B1
  • Filed: 11/24/1999
  • Issued: 04/26/2005
  • Est. Priority Date: 11/24/1999
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for pre-identifying implicitly defined communities including groups of pages of common interest from a collection of hyper-linked pages, wherein the communities have not been previously identified, comprising the steps of:

  • identifying a collection of hyperlinked pages from a plurality of sites, wherein each of the sites includes one or more hyper-linked pages;

    identifying hyper-links between any two pages on a same site, wherein the same site is included within the plurality of sites;

    removing the identified hyper-links between the two pages on a same site;

    identifying a plurality of (i,j)-cores within the identified collection, the (i,j)-cores including a first set of hyperlinked pages and a second set of hyper-linked pages, wherein each page in the first set of hyperlinked pages points to every page in the second set of hyperlinked pages, and where i and i are the numbers of hyper-linked pages in the first set and hyper-linked pages in the second set, respectively, that appear in each of the identified (i,j)-cores; and

    expanding each of the identified (i,j)-cores into a fully community, the full community being a subset of the pages reading a particular topic.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×