×

XML: finding authoritative pages for mining communities based on page structure criteria

  • US 6,778,997 B2
  • Filed: 01/05/2001
  • Issued: 08/17/2004
  • Est. Priority Date: 01/05/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of determining a set of well-formed hyperlinked documents based upon an analysis of the links between and structure of documents within a larger set of hyperlinked documents, said well-formed hyperlinked documents being authorities on a specified topic, said method comprising:

  • obtaining a base set of hyperlinked documents containing documents relevant to said specified topic and documents which are authorities on said specified topic;

    determining a structure score for each document within said base set;

    setting an authority weight and a hub weight of each document equal to said document'"'"'s corresponding structure score;

    for each document, updating said authority weight of the document to equal a sum of hub weights of all documents within said base set pointing to the document;

    for each document, updating said hub weight of the document to equal a sum of authority weights of all documents within said base set the document is pointing to;

    identifying a predetermined number of documents having the highest valued authority weights as said authorities on said specified topic.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×