DETERMINING CORRESPONDING TERMS WRITTEN IN DIFFERENT FORMATS
First Claim
Patent Images
1. A method comprising:
- identifying a first set of anchor text written in a first format and containing a given term;
identifying a set of documents to which the first set of anchor text points;
identifying a second set of anchor text written in a second format and pointing to the identified set of documents;
analyzing the second set of anchor text to determine that a representation of the given term in the first format corresponds to a representation of the given term in the second format.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus consistent with the invention allow a user to submit an ambiguous search query and to receive relevant search results. Queries can be expressed using character sets and/or languages that are different from the character set and/or language of at least some of the data that is to be searched. A translation between these character sets and/or languages can be performed by examining the use of terms in aligned text. Probabilities can be associated with each possible translation. Refinements can be made to these probabilities by examining user interactions with the search results.
22 Citations
17 Claims
-
1. A method comprising:
-
identifying a first set of anchor text written in a first format and containing a given term; identifying a set of documents to which the first set of anchor text points; identifying a second set of anchor text written in a second format and pointing to the identified set of documents; analyzing the second set of anchor text to determine that a representation of the given term in the first format corresponds to a representation of the given term in the second format. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer program product embodied on a non-transitory computer-readable medium, the computer program product including instructions, which when executed by a computer system, are operable to cause the computer system to perform operations comprising:
-
identifying a first set of anchor text written in a first format and containing a given term; identifying a set of web pages to which the first set of anchor text points; identifying a second set of anchor text written in a second format and pointing to the identified set of web pages; determining a probability that a representation of the given term in the first format corresponds to a representation of the given term in the second format. - View Dependent Claims (15, 16, 17)
-
Specification