System and method for summarization combining natural language generation with structural analysis
First Claim
Patent Images
1. A method of summarizing source text, comprising:
- ranking two or more nodes in a discourse tree, wherein the ranking is based on the depth of the node, wherein the depth of a node (N) is calculated from the number of subordinated edges of subordination subtree nodes plus one between N and the root, and including removing all nodes having a depth greater than a specified depth;
performing structural summarization based on the ranking of nodes of the source text; and
compressing at least one relationship in the structural summarization.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method of summarizing text comprising performing structural summarization of a source text and compressing at least one relationship in the structural summarization.
78 Citations
36 Claims
-
1. A method of summarizing source text, comprising:
-
ranking two or more nodes in a discourse tree, wherein the ranking is based on the depth of the node, wherein the depth of a node (N) is calculated from the number of subordinated edges of subordination subtree nodes plus one between N and the root, and including removing all nodes having a depth greater than a specified depth; performing structural summarization based on the ranking of nodes of the source text; and compressing at least one relationship in the structural summarization. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method of summarizing source text, comprising:
-
applying default unification to at least one coordination-type relationship in a structural representation of the source text to create at least one compressed representation, wherein default unification involves ordering of expressions of the text from most specific to most general; and performing structural summarization of the source text, the structural summarization to be accomplished by ranking two or more nodes in a discourse tree, wherein the ranking is based on the depth of the node, wherein the depth of a node (N) is calculated from the number of subordinated edges of subordination subtree nodes plus one between N and the root, and including removing all nodes having a depth greater than a specified depth. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A method for selecting coordination relationships for compression in a discourse, the method comprising:
-
automatically segmenting the discourse into units, wherein the segmenting is based on a technique selected from the group consisting of statistical methods, shallow parsing and deep parsing, wherein the discourse can be represented by two or more nodes in a discourse tree; ranking two or more discourse units, wherein the ranking is based on the depth of the node, wherein the depth of a node (N) is calculated from the number of subordinated edges of subordination subtree nodes plus one between N and the root; pruning to remove all nodes having a depth greater than a specified depth; structurally summarizing based on the ranking of the discourse units, wherein the discourse units segments are small enough for the structural summarization to extract semantic meaning from the discourse; and selecting at least one coordination relationship in the representation, wherein the at least one coordination relationship is susceptible to compression. - View Dependent Claims (23, 24)
-
-
25. A system for summarizing source text comprising:
-
a means for ranking two or more nodes in a discourse tree, wherein the ranking is based on the depth of the node, wherein the depth of a node (N) is calculated from the number of subordinated edges of subordination subtree nodes plus one between N and the root; a means for performing structural summarization on the ranking of nodes of the source text, including means for pruning to remove all nodes having a depth greater than a specified depth; a means for compressing at least one coordination relationship in the structural summarization; and a means for generating a text summary based on the structural summarization and the at least one compressed coordination relationship, wherein by selecting the text summary the source text underlying the at least one compressed relationship can be accessed. - View Dependent Claims (26, 27, 28)
-
-
29. A machine readable medium having instructions stored thereon to generate a text summary that when executed by a processor cause a system to:
-
represent a source text as discourse units; generate a discourse tree based on the discourse units; rank nodes in the discourse tree, wherein the ranking is based on the depth of the node, wherein the depth of a node (N) is calculated from the number of subordinated edges of subordination subtree nodes plus one between N and the root; prune to remove all nodes having a depth greater than a specified depth; perform structural summarization based on the pruned ranking of the discourse units; compress at least one relationship in the structural summarization; and generate a text summary based on the structural summarization and the at least one compressed relationship, wherein by selecting the text summary the source text underlying the at least one compressed relationship can be accessed. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36)
-
Specification