TRANSLITERATION FOR QUERY EXPANSION
First Claim
1. A computer-implemented method comprising:
- identifying, using one or more computers, a plurality of transliterated terms in a target language;
for each transliterated term of the plurality of transliterated terms in the target language, mapping the transliterated term to one or more terms in a source language; and
for a first transliterated term of the plurality of transliterated terms in the target language, identifying one or more second transliterated terms of the plurality of transliterated terms in the target language as candidate synonyms of the first transliterated term, where each of the one or more second transliterated terms is mapped to at least one term in the source language that is also mapped from the first transliterated term.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer program products, for identifying candidate synonyms of transliterated terms for query expansion. In one aspect, a method includes identifying multiple transliterated terms in a target language. For each transliterated term of the multiple transliterated terms in the target language, the transliterated term is mapped to one or more terms in a source language. For a first transliterated term of the multiple transliterated terms in the target language, one or more second transliterated terms of the multiple transliterated terms in the target language are identified as candidate synonyms of the first transliterated term, where each of the one or more second transliterated terms is mapped to at least one term in the source language that is also mapped from the first transliterated term.
209 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
identifying, using one or more computers, a plurality of transliterated terms in a target language; for each transliterated term of the plurality of transliterated terms in the target language, mapping the transliterated term to one or more terms in a source language; and for a first transliterated term of the plurality of transliterated terms in the target language, identifying one or more second transliterated terms of the plurality of transliterated terms in the target language as candidate synonyms of the first transliterated term, where each of the one or more second transliterated terms is mapped to at least one term in the source language that is also mapped from the first transliterated term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-implemented method comprising:
-
generating, using one or more computers, a training group of possible transliterated synonyms in a target language; training a probabilistic model using the training group to learn probabilities of spelling variations in transliterated synonyms in the target language; and applying the probabilistic model to a particular transliterated term in the target language to identify one or more candidate synonyms of the particular transliterated term.
-
-
18. A system comprising:
one or more computers configured to perform operations including; identifying a plurality of transliterated terms in a target language; for each transliterated term of the plurality of transliterated terms in the target language, mapping the transliterated term to one or more terms in a source language; and for a first transliterated term of the plurality of transliterated terms in the target language, identifying one or more second transliterated terms of the plurality of transliterated terms in the target language as candidate synonyms of the first transliterated term, where each of the one or more second transliterated terms is mapped to at least one term in the source language that is also mapped from the first transliterated term.
-
19. A system comprising:
one or more computers configured to perform operations including; generating a training group of possible transliterated synonyms in a target language; training a probabilistic model using the training group to learn probabilities of spelling variations in transliterated synonyms in the target language; and applying the probabilistic model to a particular transliterated term in the target language to identify one or more candidate synonyms of the particular transliterated term.
-
20. A computer-implemented method comprising:
-
identifying, using one or more computers, a plurality of transliterated terms in a target language; for a first transliterated term of the plurality of transliterated terms in the target language, identifying one or more second transliterated terms of the plurality of transliterated terms in the target language as candidate synonyms of the first transliterated term; and using the candidate synonyms of the first transliterated term to expand queries including the first transliterated term.
-
Specification