Synonymous collocation extraction using translation information
First Claim
1. A computer readable medium including instructions readable by a computer which, when implemented, cause the computer to generate synonymous collocations comprising the steps of:
- extracting collocations from a monolingual corpus;
generating candidate synonymous collocations from the extracted collocations; and
selecting synonymous collocations from the candidate synonymous collocations as a function of translation information.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of automatically extracting synonymous collocations from monolingual corpora and a small bilingual corpus is proposed. The methodology includes generating candidate synonymous collocations and selecting synonymous collocations as a function of translation information, including collocation translations and probabilities. Candidate synonymous collocations with similarity scores that exceed a threshold are extracted as synonymous collocations. The extracted collocations can be used later in language generation by substituting synonymous collocations for applications such as writing assistance programs.
-
Citations
34 Claims
-
1. A computer readable medium including instructions readable by a computer which, when implemented, cause the computer to generate synonymous collocations comprising the steps of:
-
extracting collocations from a monolingual corpus;
generating candidate synonymous collocations from the extracted collocations; and
selecting synonymous collocations from the candidate synonymous collocations as a function of translation information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer readable medium including instruction readable by a computer which, when implemented, cause the computer to generate a sentence comprising the steps of:
-
parsing input text into at least one collocation;
obtaining synonymous collocations selected as a function of translation information; and
selecting at least one synonymous collocation for said at least one collocation. - View Dependent Claims (20, 21, 22)
-
-
23. A method of constructing synonymous collocation information comprising the steps of:
-
extracting collocations from unprocessed language corpus;
generating candidate synonymous collocations from the extracted collocations; and
selecting synonymous collocations from the candidate synonymous collocations based on translation information. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
-
31. A method of generating language comprising the steps of:
-
parsing an input sentence into collocations;
accessing a database of synonymous collocations generated using translation information; and
substituting parsed collocations in the input sentence with synonymous collocations from the database. - View Dependent Claims (32, 33, 34)
-
Specification