×

System and method for incorporating anchor text into ranking search results

  • US 7,739,277 B2
  • Filed: 09/30/2004
  • Issued: 06/15/2010
  • Est. Priority Date: 09/30/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method for presenting a ranking of search results, comprising:

  • providing an index to a plurality of documents including;

    a main index associating with each of the documents a frequency of one or more terms being included in each of the documents;

    an anchor text index associating with each of the documents an anchor text frequency of the one or more terms being included in anchor text in a source document referencing each of the documents;

    receiving a query including at least one query term;

    applying the query to the index to yield results of the query identifying one or more of the documents that include the at least one query term;

    applying a scoring function to generate a score for each of the one or more documents included in the results of the query, wherein the scoring function (score) includes one of;

    score =

    ( wtf B + wtf Anchor B Anchor )

    ( k 1 + 1 )
    k 1

    ( ( 1 - b ) + b

    w



    l
    avw



    l
    )
    + ( wtf + wtf Anchor )
    ×

    log

    ( N n )
    ;



    and
    score =

    ( wtf B + wtf Anchor B Anchor )

    ( k 1 + 1 )
    k 1 + ( wtf B + wtf Anchor B Anchor )
    ×

    log

    ( N n )
    ,
    where;

    wtf is a weighted term frequency applying a weight to a frequency with which a given query term is included in the document;

    wtfAnchor is a weighted term frequency applying a weight to a frequency with which the given query term is included in anchor text referencing the document;

    k1 is a constant;

    b is a constant;

    wdl is a weighted document length applying a weight to a length of the document being scored;

    avwdl is an average weighted document length of all documents being scored;

    N is the number of documents on the network; and

    n is the number of documents including at least one appearance of a given query term; and

    generating an output of the ranked results of the query to be displayed to a user.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×