Method for producing summaries of text document
First Claim
Patent Images
1. A computer method for preparing a summary string from a source document of encoded text, the method comprising the steps of:
- a) comparing a training set of encoded text documents with manually generated summary strings associated therewith to learn probabilities that a given summary word or phrase will appear in summary strings given a source word or phrase appears in an encoded text document;
b) analyzing the manually generated summary strings to learn probabilities that each word or phrase in the manually generated summary strings follows another word or phrase; and
c) constructing from the source document a summary string containing summary words or phrases based on the probabilities of appearing in a summary string established in step a) and the probabilities of following another word or phrase established in step b).
2 Assignments
0 Petitions
Accused Products
Abstract
A computer method for preparing a summary string from a source document of encoded text. The method comprises comparing a training set of encoded text documents with manually generated summary strings associated therewith to learn probabilities that a given summary word or phrase will appear in summary strings given a source word or phrase appears in encoded text documents and constructing from the source document a summary string containing summary words or phrases having the highest probabilities of appearing in a summary string based on the learned probabilities established in the previous step.
160 Citations
14 Claims
-
1. A computer method for preparing a summary string from a source document of encoded text, the method comprising the steps of:
-
a) comparing a training set of encoded text documents with manually generated summary strings associated therewith to learn probabilities that a given summary word or phrase will appear in summary strings given a source word or phrase appears in an encoded text document;
b) analyzing the manually generated summary strings to learn probabilities that each word or phrase in the manually generated summary strings follows another word or phrase; and
c) constructing from the source document a summary string containing summary words or phrases based on the probabilities of appearing in a summary string established in step a) and the probabilities of following another word or phrase established in step b). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
Specification