Encoding and adaptive, scalable accessing of distributed models
First Claim
Patent Images
1. A system comprising:
- a translation server operable to perform machine translation obtaining translation model data from a translation model for translation between a source language and a target language and language model data from a language model for the target language, the translation server further operable to translate text in the source language into the target language using the obtained translation model data and language model data,the translation server comprising;
a request queue operable to store requests for language model data to be obtained for translating a segment in the source language, anda segment translation server cache operable to store language model data obtained by the requests by the translation server,wherein the translation server is further operable to;
process the translation of the segment using language model data from a second language model for the target language to produce an initial translation of the segment before the requests for the language data in the language model in the request queue are sent out,update the requests for the language model data of the language model in the request queue based on the initial translation,send out the updated requests in the request queue to obtain language model data from the language model for processing the initial translation, andafter the updated requests are served and the data for the updated requests are stored in the segment translation server cache, process the initial translation with the data for the updated requests to produce a final translation.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and apparatus for accessing distributed models in automated machine processing, including using large language models in machine translation, speech recognition and other applications.
89 Citations
18 Claims
-
1. A system comprising:
-
a translation server operable to perform machine translation obtaining translation model data from a translation model for translation between a source language and a target language and language model data from a language model for the target language, the translation server further operable to translate text in the source language into the target language using the obtained translation model data and language model data, the translation server comprising; a request queue operable to store requests for language model data to be obtained for translating a segment in the source language, and a segment translation server cache operable to store language model data obtained by the requests by the translation server, wherein the translation server is further operable to; process the translation of the segment using language model data from a second language model for the target language to produce an initial translation of the segment before the requests for the language data in the language model in the request queue are sent out, update the requests for the language model data of the language model in the request queue based on the initial translation, send out the updated requests in the request queue to obtain language model data from the language model for processing the initial translation, and after the updated requests are served and the data for the updated requests are stored in the segment translation server cache, process the initial translation with the data for the updated requests to produce a final translation. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a translation server operable to perform machine translation obtaining translation model data from a translation model for translation between a source language and a target language and language model data from a language model for the target language, the translation server further operable to translate text in the source language into the target language using the obtained translation model data and language model data, the translation server comprising; a request queue operable to store requests for language model data to be obtained for translating a segment in the source language, a segment translation server cache operable to store language model data obtained by the requests by the translation server; and a second segment translation server cache storing a selected portion of the language model, wherein the translation server is operable to; process the translation of the segment using language model data from a second language model for the target language to produce an initial translation of the segment before the requests for the language data in the language model in the request queue are sent out, update the requests for the language model data of the language model in the request queue based on the initial translation, send out the updated requests in the request queue to obtain language model data from the language model for processing the initial translation, and after the updated requests are served and the data for the updated requests are stored in the segment translation server cache, process the initial translation with the data for the updated requests to produce a final translation, after completing translation of the segment, delete data in the segment translation server cache and retain the selected portion of the language model in the second segment translation server cache. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A system comprising:
-
a translation server operable to perform machine translation obtaining translation model data from a translation model for translation between a source language and a target language and language model data from a language model for the target language, the translation server further operable to translate text in the source language into the target language using the obtained translation model data and language model data, the translation server comprising; a request queue operable to store requests for language model data to be obtained for translating a segment in the source language, a segment translation server cache operable to store language model data obtained by the requests by the translation server; and a second segment translation server cache storing a selected portion of the language model, wherein the translation server is operable to; periodically delete data in the segment translation server cache, process the translation of the segment using language model data from a second language model for the target language to produce an initial translation of the segment before the requests for the language data in the language model in the request queue are sent out, update the requests for the language model data of the language model in the request queue based on the initial translation, send out the updated requests in the request queue to obtain language model data from the language model for processing the initial translation, and after the updated requests are served and the data for the updated requests are stored in the segment translation server cache, process the initial translation with the data for the updated requests to produce a final translation. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification