×

Pseudo-anchor text extraction for vertical search

  • US 7,657,507 B2
  • Filed: 03/02/2007
  • Issued: 02/02/2010
  • Est. Priority Date: 03/02/2007
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method comprising:

  • via a processor executing computer-readable instructions;

    extracting from a digital corpus an object;

    extracting from the digital corpus a pseudo-anchor text associated with the object, wherein extracting the pseudo-anchor text comprises;

    extracting from the digital corpus a parallel object identified with a second object identifier;

    identifying an occurrence of the object in the digital corpus;

    selecting a first candidate anchor block based on the occurrence of the object;

    identifying an occurrence of the parallel object in the digital corpus;

    selecting a second candidate anchor block based on the occurrence of the parallel object;

    comparing similarity between the first identifier and the second identifier;

    adding the first candidate anchor block and the second candidate anchor block to a common candidate anchor block set if the similarity between the first identifier and the second identifier satisfies a specified threshold; and

    extracting the pseudo-anchor text from the common candidate anchor block set; and

    making the pseudo-anchor text available for searching.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×