Conversion of input text strings
First Claim
1. A method performed by data processing apparatus, the method comprising:
- receiving, with one or more processors at a server, an input string having a plurality of terms, the input string being in a first form, wherein a given sequence of the plurality of terms refers to a geographic feature, wherein the given sequence is annotated with a geographic-feature type selected from a plurality of geographic-feature types each indicating a characteristic of an entity in the physical world corresponding to the geographic feature, and wherein the given sequence is stored in an annotated format with the geographic-feature type in a database of geographic labels prior to receiving the input string;
transforming, with one or more processors, the input string from the first form to a second form, the transforming including;
applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, wherein at least some of the rules are applied in response to a match between a feature type of the respective rule and the geographic-feature type with which the given sequence is annotated to indicate the characteristic of the corresponding entity in the physical world,translating the identified one or more terms to one or more translated terms in the second form, andtransliterating at least some of the remaining terms of the plurality of terms into transliterated terms in the second form, including selecting one or more transliteration rules for application in accordance with the indicated characteristic of the entity, wherein when the given sequence of the plurality of terms is annotated with a first geographic-feature type of the plurality of geographic-feature types, a first rule of the one or more rules identifies a specific term in the input string for translation in response to a match between a first feature type of the first rule and the first geographic-feature type, and when the given sequence of the plurality of terms is annotated with a second geographic-feature type of the plurality of geographic-feature types, a second rule of the one or more rules identifies the same specific term in the input string for transliteration in response to a match between a second feature type of the second rule and the second geographic-feature type, wherein the specific term is translated or transliterated to a term having the same grammatical form as the specific term;
the method further comprising;
concatenating, with one or more processors, at least the translated and transliterated terms to form a hybrid output string in the second form; and
storing the hybrid output string in the database of geographic labels; and
when a map of a geographic region including the entity is requested for display;
(i) retrieving the hybrid output string from the database and (ii) providing, via a network interface, the hybrid output string along with map data for presenting the map with the hybrid output string at a client device.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for transforming text strings. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving input string having a plurality of terms, the input string being in a first form; transforming the input string from the first form to a second form including: applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, translating the identified one or more terms to one or more translated terms in the second form, and transliterating the remaining terms of the plurality of terms into transliterated terms in the second form; and concatenating the translated and transliterated terms to form a hybrid output string in the second form.
-
Citations
22 Claims
-
1. A method performed by data processing apparatus, the method comprising:
-
receiving, with one or more processors at a server, an input string having a plurality of terms, the input string being in a first form, wherein a given sequence of the plurality of terms refers to a geographic feature, wherein the given sequence is annotated with a geographic-feature type selected from a plurality of geographic-feature types each indicating a characteristic of an entity in the physical world corresponding to the geographic feature, and wherein the given sequence is stored in an annotated format with the geographic-feature type in a database of geographic labels prior to receiving the input string; transforming, with one or more processors, the input string from the first form to a second form, the transforming including; applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, wherein at least some of the rules are applied in response to a match between a feature type of the respective rule and the geographic-feature type with which the given sequence is annotated to indicate the characteristic of the corresponding entity in the physical world, translating the identified one or more terms to one or more translated terms in the second form, and transliterating at least some of the remaining terms of the plurality of terms into transliterated terms in the second form, including selecting one or more transliteration rules for application in accordance with the indicated characteristic of the entity, wherein when the given sequence of the plurality of terms is annotated with a first geographic-feature type of the plurality of geographic-feature types, a first rule of the one or more rules identifies a specific term in the input string for translation in response to a match between a first feature type of the first rule and the first geographic-feature type, and when the given sequence of the plurality of terms is annotated with a second geographic-feature type of the plurality of geographic-feature types, a second rule of the one or more rules identifies the same specific term in the input string for transliteration in response to a match between a second feature type of the second rule and the second geographic-feature type, wherein the specific term is translated or transliterated to a term having the same grammatical form as the specific term;
the method further comprising;concatenating, with one or more processors, at least the translated and transliterated terms to form a hybrid output string in the second form; and storing the hybrid output string in the database of geographic labels; and when a map of a geographic region including the entity is requested for display;
(i) retrieving the hybrid output string from the database and (ii) providing, via a network interface, the hybrid output string along with map data for presenting the map with the hybrid output string at a client device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 15, 16, 17, 20, 21, 22)
-
-
8. A system comprising:
one or more computers operable to interact to perform operations comprising; receiving an input string having a plurality of terms, the input string being in a first form, wherein a given sequence of the plurality of terms refers to a geographic feature, wherein the given sequence is annotated with a geographic-feature type selected from a plurality of geographic-feature types each indicating a characteristic of an entity in the physical world corresponding to the geographic feature, and wherein the given sequence is stored in an annotated format with the geographic-feature type in a database of geographic labels prior to receiving the input string; transforming the input string from the first form to a second form, the transforming including; applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, wherein at least some of the rules are applied in response to a match between a feature type of the respective rule and the geographic-feature type with which the given sequence is annotated to indicate the characteristic of the corresponding entity in the physical world, translating the identified one or more terms to one or more translated terms in the second form, and transliterating at least some of the remaining terms of the plurality of terms into transliterated terms in the second form, including selecting one or more transliteration rules for application in accordance with the indicated characteristic of the entity, wherein when the given sequence of the plurality of terms is annotated with a first geographic-feature type of the plurality of geographic-feature, a first rule of the one or more rules identifies a specific term in the input string for translation in response to a match between a first feature type of the first rule and the first geographic-feature type, and when the given sequence of the plurality of terms is annotated with a second geographic-feature type of the plurality of geographic-feature types, a second rule of the one or more rules identifies the same specific term in the input string for transliteration in response to a match between a second feature type of the second rule and the second geographic-feature type, wherein the specific term is translated or transliterated to a term having the same grammatical form as the specific term; concatenating at least the translated and transliterated terms to form a hybrid output string in the second form; and storing the hybrid output string in the database of geographic labels; and when map data for a geographic region including the entity is requested for display;
(i) retrieving the hybrid output string from the database and (ii) providing, via a network interface, the hybrid output string along with the requested map data for display at a client device.- View Dependent Claims (9, 10, 11, 12, 13, 14, 18, 19)
Specification