×

System and method for the automatic recognition of relevant terms by mining link annotations

  • US 6,651,059 B1
  • Filed: 11/15/1999
  • Issued: 11/18/2003
  • Est. Priority Date: 11/15/1999
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system for automatically and iteratively mining relevant terms comprising:

  • a metadata extractor for extracting hypertext links from a document, the hypertext links containing metadata terms cn,m;

    a document vector module for creating a vector for the document, using the hypertext links;

    an association module for measuring the number of documents that contain the metadata terms cn,m in the hypertext links to perform a statistical analysis;

    wherein the association module discovers association rules from the document vector based primarily on the hypertext links;

    wherein the association rules comprise a support metric for an association rule (X|Y), where X and Y are sets of terms, and where a support p(X, Y) is defined as a joint probability of the frequency of co-occurrence of the sets of terms X and Y; and

    wherein the association rules further comprise a hybrid metric H(s,c) that normalize a support function n(s) and a confidence function n(c), and is expressed as follows;

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×