×

Determining corresponding terms written in different formats

  • US 9,734,197 B2
  • Filed: 03/06/2014
  • Issued: 08/15/2017
  • Est. Priority Date: 07/06/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer-implemented method comprising:

  • receiving an input term in a first language format;

    identifying two different groups of hyperlinks that each link to a same plurality of intermediary documents, wherein the different groups of hyperlinks have anchor texts in different respective language formats, including;

    identifying a first group of hyperlinks each having a respective first anchor text that includes the input term in the first language format, andidentifying a second group of hyperlinks each having a respective second anchor text in a second language format;

    determining, from all of the second anchor texts of the second group of hyperlinks, a second term in the second language format that corresponds to the input term in the first language format, including;

    computing, by one or more computers, a total count of terms, including duplicates, occurring in all of the second anchor texts of the second group of hyperlinks,computing, by one or more computers, a respective individual count of occurrences, in all of the second anchor texts of the second group of hyperlinks, of each of a plurality of terms in the second language format that occur in all of the second anchor texts of the second group of hyperlinks,computing, by one or more computers, a respective score for each of the plurality of terms in the second language format that occur in all of the second anchor texts of the second group of hyperlinks, including comparing, for each term of the plurality of terms in the second language format that occur in all of the second anchor texts of the second group of hyperlinks, the respective individual count for the term to the total count of terms occurring in the second anchor texts of the second group of hyperlinks, anddesignating a highest-scoring term among the plurality of terms in the second language format that occur in all of the second anchor texts of the second group of hyperlinks as the second term in the second language format that corresponds to the input term in the first language format;

    receiving a first query having the input term in the first language format;

    generating a revised query that includes the second term in the second language format;

    obtaining search results using the revised query; and

    providing the search results obtained using the revised query in response to receiving the first query.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×