System and method of prioritizing automated translation of communications from a first human language to a second human language
First Claim
1. A method of prioritizing for automated translation from a first human language to a second human language communications relating to at least one predetermined topic, the method comprising:
- capturing and inputting into a data processing system a translation-candidate communication rendered in the first human language and storing in computer memory associated with the data processing system, in a predetermined machine-readable format, a first data set representative of the contents of the translation-candidate communication in the first human language;
maintaining in computer memory a first-language prioritization protocol including data indicative of first-language extraction rules according to which a selected first-data-set sub-portion representative of a communication sub-portion of the translation-candidate communication is algorithmically one of (i) extracted and (ii) rejected for translation depending on whether the selected communication sub-portion exceeds a first relevancy threshold indicative of the relatedness of the communication sub-portion to the at least one predetermined topic;
consulting the first-language prioritization protocol and algorithmically analyzing, in accordance with the first-language extraction rules, the first data set in order to determine whether at least one communication sub-portion associated with the first data set exceeds the first relevancy threshold; and
one of(i) selecting for translation to the second human language each communication sub-portion algorithmically determined to exceed the first relevancy threshold and (ii) rejecting for translation to the second human language each communication sub-portion algorithmically determined not to exceed the first relevancy threshold;
wherein(a) as to a communication sub-portion selected for translation, the method further comprisescausing that sub-portion of the machine-readable first data set representative of the relevant communication sub-portion in the first human language to be translated to a translated-data-set sub-portion representative, in a machine-readable format, of the relevant communication sub-portion in the second human language;
converting at least a portion of the translated-data-set sub-portion into a converted-data-set sub-portion representative of at least a portion of the translated-data-set sub-portion in a human-intelligible format; and
outputting through a machine-to-human interface the converted-data-set sub-portion; and
(b) as to a communication sub-portion rejected for translation, the method further comprises one of (i) deleting from and (ii) archiving in computer memory the first-data-set sub-portion representative of that communication sub-portion.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of prioritizing the automated translation of communications relating to a predetermined topic includes capturing and inputting into a data processing system a translation-candidate communication rendered in a first human language. A first data set representative of the translation-candidate communication is stored in computer memory and parsed into communication sub-portions. Communication sub-portions are algorithmically selected for translation depending on their relatedness to the predetermined topic as determined by first-language extraction rules. Each selected communication sub-portion is translated to a translated-data-set sub-portion representative of that selected communication sub-portion in the second human language. Translated-data-set sub-portions are subjected to a secondary filtration process in accordance with which their relatedness to the predetermined topic is determined by second-language extraction rules. Those translated-data-set sub-portions determined to contain information sufficiently related to the predetermined topic, or a related sub-topic, are converted to converted-data-set sub-portions representative of the translated-data-set sub-portions in a human-intelligible format and outputted through a machine-to-human interface.
17 Citations
15 Claims
-
1. A method of prioritizing for automated translation from a first human language to a second human language communications relating to at least one predetermined topic, the method comprising:
-
capturing and inputting into a data processing system a translation-candidate communication rendered in the first human language and storing in computer memory associated with the data processing system, in a predetermined machine-readable format, a first data set representative of the contents of the translation-candidate communication in the first human language; maintaining in computer memory a first-language prioritization protocol including data indicative of first-language extraction rules according to which a selected first-data-set sub-portion representative of a communication sub-portion of the translation-candidate communication is algorithmically one of (i) extracted and (ii) rejected for translation depending on whether the selected communication sub-portion exceeds a first relevancy threshold indicative of the relatedness of the communication sub-portion to the at least one predetermined topic; consulting the first-language prioritization protocol and algorithmically analyzing, in accordance with the first-language extraction rules, the first data set in order to determine whether at least one communication sub-portion associated with the first data set exceeds the first relevancy threshold; and
one of(i) selecting for translation to the second human language each communication sub-portion algorithmically determined to exceed the first relevancy threshold and (ii) rejecting for translation to the second human language each communication sub-portion algorithmically determined not to exceed the first relevancy threshold;
wherein(a) as to a communication sub-portion selected for translation, the method further comprises causing that sub-portion of the machine-readable first data set representative of the relevant communication sub-portion in the first human language to be translated to a translated-data-set sub-portion representative, in a machine-readable format, of the relevant communication sub-portion in the second human language; converting at least a portion of the translated-data-set sub-portion into a converted-data-set sub-portion representative of at least a portion of the translated-data-set sub-portion in a human-intelligible format; and outputting through a machine-to-human interface the converted-data-set sub-portion; and (b) as to a communication sub-portion rejected for translation, the method further comprises one of (i) deleting from and (ii) archiving in computer memory the first-data-set sub-portion representative of that communication sub-portion. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of prioritizing for automated translation from a first human language to a second human language communications relating to at least one predetermined topic, the method comprising:
-
capturing and inputting into a data processing system a translation-candidate communication rendered in the first human language and storing in computer memory associated with the data processing system, in a predetermined machine-readable format, a first data set representative of the contents of the translation-candidate communication in the first human language; parsing the first data set into first-data-set sub-portions correspondingly representative of communication sub-portions of the translation-candidate communication; maintaining in computer memory a set of relevancy thresholds including at least first and second relevancy thresholds indicative of the relatedness of a communication sub-portion to the at least one predetermined topic, wherein the first relevancy threshold indicates a greater degree of relatedness to the at least one predetermined topic than does the second relevancy threshold; maintaining in computer memory a first-language prioritization protocol including data indicative of first-language extraction rules according to which a selected first-data-set sub-portion is algorithmically one of (i) extracted and prioritized for translation;
(ii) extracted and de-prioritized for translation and (iii) rejected for translation depending on whether the selected communication sub-portion, respectively, (a) exceeds the first relevancy threshold, (b) exceeds the second relevancy threshold, but not the first relevancy threshold, and (c) exceeds neither of the first and second relevancy thresholds;consulting the first-language prioritization protocol and algorithmically analyzing, in accordance with the first-language extraction rules, the first data set in order to determine whether at least one communication sub-portion of the translation-candidate communication associated with the first data set exceeds either of the first and second relevancy thresholds; and
one of(i) selecting for translation to the second human language each communication sub-portion algorithmically determined to exceed either of the first and second relevancy thresholds and (ii) rejecting for translation to the second human language each communication sub-portion algorithmically determined not to exceed either of the first and second relevancy thresholds;
wherein(a) as to each communication sub-portion determined to exceed at least one of the first and second relevancy thresholds, the method further comprises causing the first-data-set sub-portion representative of that communication sub-portion in the first human language to be translated to a translated-data-set sub-portion representative, in a machine-readable format, of that communication sub-portion in the second human language; (b) a first-data-set sub-portion representative of a communication sub-portion determined to exceed the first relevancy threshold is translated to a translated-data-set sub-portion prior to a first-data-set sub-portion representative of a communication sub-portion determined to exceed the second relevancy threshold and not the first relevancy threshold; and (c) as to a communication sub-portion rejected for translation, the method further comprises one of (i) deleting and (ii) archiving in computer memory the first-data-set sub-portion representative of that communication sub-portion. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15)
-
Specification