Task parallelization in a text-to-text system
First Claim
Patent Images
1. A method comprising:
- dividing a corpus of information among multiple work units and carrying out a text-to text operation in each of said work units; and
maintaining a single parameter table for all the work carried out in all the work units, wherein said parameter table is a probability table with probabilities of word to word translation.
2 Assignments
0 Petitions
Accused Products
Abstract
Parallelization of word alignment for a text-to-text operation. The training data is divided into multiple groups, and training is carried out of each group on separate processors. Different techniques can be carried out to increase the speed of the processing. The hookups can be done only once for all of multiple different iterations. Moreover, parallel operations can apply only to the counts, since this may be the most time-consuming part.
124 Citations
20 Claims
-
1. A method comprising:
-
dividing a corpus of information among multiple work units and carrying out a text-to text operation in each of said work units; and maintaining a single parameter table for all the work carried out in all the work units, wherein said parameter table is a probability table with probabilities of word to word translation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer system, comprising:
a master computer, connected to a corpus of training information about text-to-text operations, having a plurality of work unit computers, having separate processors from said master computer, and said master computer running a routine that maintains a table of information related to training based on said corpus, a routine that provides separated portions of said corpus and said work unit computers, and accumulates information indicative of training each of said work unit computers and maintains said table of information, wherein said table of information includes a probability of word to word translation. - View Dependent Claims (14, 15, 16, 17)
-
18. A method, comprising:
-
dividing a training corpus into at least a plurality of groups; carrying out a training operation for a text to text application substantially simultaneously on each of said plurality of groups, using separate processors for each of said groups and using a single table of information indicative of word probabilities, for each of said groups, and using said training operation to update said single probability table based on training information obtained from each of said groups, wherein said single probability table comprises probabilities of word to word translations. - View Dependent Claims (19, 20)
-
Specification