×

Method to efficiently partition large hyperlinked databases by hyperlink structure

  • US 6,675,170 B1
  • Filed: 06/29/2000
  • Issued: 01/06/2004
  • Est. Priority Date: 08/11/1999
  • Status: Active Grant
First Claim
Patent Images

1. A method for partitioning a database containing a plurality of documents into desired and undesired type documents, the plurality of documents containing text and/or links to and from other documents in the database, the method comprising the steps of:

  • providing a source document of the desired type, the source document including a collection of seed documents linked to more similar documents of the desired type than to dissimilar documents of the undesired type;

    providing a sink document for providing access to the database, the sink document is generic and representative of the database;

    identifying a cut-set of links which is the smallest set of links such that removing them from the database completely disconnects the source document and its linked documents from the sink document and its linked documents thereby defining first and second subsets of documents, respectively; and

    defining the first subset of documents as desired type documents and the remaining documents as undesired type documents.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×