LANGUAGE PROCESSING RESOURCES FOR AUTOMATED MOBILE LANGUAGE TRANSLATION
First Claim
1. A method of representing a language comprising at least two words and at least one translation of a word sequence on a device having a processor and a memory, the method comprising:
- executing on the processor instructions configured to;
store in the memory a word index comprising, for respective words of the language;
the word stored at an index location in the word index, anda word mapping that identifies the index location of the word in the word index; and
store in the memory a translation mapping identifying, for a word index sequence comprising at least one index location, the translation of the words respectively located at the index locations of the word index.
3 Assignments
0 Petitions
Accused Products
Abstract
Automated language translation often involves language translation resources of significant size (e.g., 50-gigabyte phrase tables) and significant computational power exceeding the capabilities of many mobile devices. Remotely accessible servers capable of near-realtime, automated translation may be inaccessible or prohibitively costly while traveling abroad. Presented herein are adaptations of language translation techniques for offline mobile devices involving reducing the size and raising the efficiency of the language modeling resources. A word index may be provided that stores respective string representations of the words of a language, and maps respective words to a location (e.g., address or offset) of respective word representations within the word index. Language translation resources (e.g., phrase tables) may then specify logical relationships using the word index addresses of the involved words, rather than the string equivalents. This technique significantly condenses the language resources and provides faster, bidirectional access to the word representations of the language.
21 Citations
20 Claims
-
1. A method of representing a language comprising at least two words and at least one translation of a word sequence on a device having a processor and a memory, the method comprising:
executing on the processor instructions configured to; store in the memory a word index comprising, for respective words of the language; the word stored at an index location in the word index, and a word mapping that identifies the index location of the word in the word index; and store in the memory a translation mapping identifying, for a word index sequence comprising at least one index location, the translation of the words respectively located at the index locations of the word index. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
10. A method of translating a word sequence from a source language to a target language on a device having a processor and a memory comprising a source word index for the source language, a target word index for the target language, and a translation mapping, the method comprising:
executing on the processor instructions configured to; for respective words of the word sequence, identify the source index location of the word in the source word index; using the translation mapping, identify a translation of the source index locations of the words of the word sequence, the translation comprising at least one target index location in the target word index; for respective target index locations, retrieve a translated word in the target language at the target index location in the target word index; and present the translated words in the target language. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
20. A nonvolatile computer-readable storage device comprising instructions that, when executed on a processor of a device having a memory, cause the device to translate word sequences in a source language to translations in a target language by:
-
storing in the memory a word index comprising, for respective words of the language; the word stored at an index location in the word index, and a word mapping that identifies the index location of the word in the word index; store in the memory a translation mapping identifying, for a word index sequence comprising at least two word indices, the translation of the words of the word sequence mapping to the respective index locations of the words of the word index sequence; and upon receiving a word sequence in the source language to be translated into the target language; for respective words of the word sequence, identifying the source index location of the word in the source word index; using the translation mapping, identifying a translation of the source index locations of the words of the word sequence, the translation comprising at least one target index location in the target word index; for respective target index locations, retrieving a translated word in the target language at the target index location in the target word index; and presenting the translated words in the target language.
-
Specification