Multi-document summarization system and method
First Claim
Patent Images
1. A method for generating a summary of a plurality of related documents available in computer readable media in a collection comprising:
- extracting phrases having focus elements from the plurality of documents;
performing phrase intersection analysis on the extracted phrases to generate a phrase intersection table;
performing temporal processing by applying a timestamp to the phrases in the phrase intersection table and ordering the phrases based on the timestamps; and
generating a summary of the plurality of related documents available in computer readable media by performing sentence generation using the phrases in the phrase intersection table which have been subject to said temporal processing.
2 Assignments
0 Petitions
Accused Products
Abstract
A summary for a collection of related documents can be generated by extracting phrases from the documents which include common focus elements. Phrase intersection analysis is then performed on the extracted phrases to generate a phrase intersection table, where identical or equivalent phrases are identified. Temporal processing on the phrases in the phrase intersection table is performed to remove ambiguous time references and to sort the phrases in a temporal sequence. Sentence generation is then used to combine the phrases in the phrase intersection table into a coherent summary.
-
Citations
22 Claims
-
1. A method for generating a summary of a plurality of related documents available in computer readable media in a collection comprising:
-
extracting phrases having focus elements from the plurality of documents; performing phrase intersection analysis on the extracted phrases to generate a phrase intersection table; performing temporal processing by applying a timestamp to the phrases in the phrase intersection table and ordering the phrases based on the timestamps; and generating a summary of the plurality of related documents available in computer readable media by performing sentence generation using the phrases in the phrase intersection table which have been subject to said temporal processing. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for generating a summary of a plurality of related documents in a collection comprising:
-
a storage device for storing the documents in the collection; a lexical database; and a processing subsystem, the processing subsystem being operatively coupled to the storage device and the lexical database, the processing subsystem being programmed to access the plurality of related documents in the storage device and generate a summary thereof by; using the lexical database to extract phrases having focus elements from the plurality of documents; performing phrase intersection analysis on the extracted phrases to generate a phrase intersection table; performing temporal processing by applying a timestamp to the phrases in the phrase intersection table and ordering the phrases based on the timestamps; and performing sentence generation using the phrases in the phrase intersection table which have been subject to said temporal processing. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer readable media for programming a computer system to perform a method of generating a summary of a plurality of related documents in a collection comprising:
-
extracting phrases having focus elements from the plurality of documents; performing phrase intersection analysis on the extracted phrases to generate a phrase intersection table; performing temporal processing by applying a timestamp to the phrases in the phrase intersection table and ordering the phrases based on the timestamps; and generating a summary of the plurality of related documents by performing sentence generation using the phrases in the phrase intersection table which have been subject to said temporal processing. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification