Discourse parsing and summarization
First Claim
1. A computer-implemented method of determining discourse structures, the method comprising:
- generating a set of one or more discourse parsing decision rules based on a training set; and
determining a discourse structure for an input text segment by applying the generated set of discourse parsing decision rules to the input text segment.
1 Assignment
0 Petitions
Accused Products
Abstract
A discourse structure for an input text segment is determined by generating a set of one or more discourse parsing decision rules based on a training set, and determining a discourse structure for the input text segment by applying the generated set of discourse parsing decision rules to the input text segment. A tree structure is summarized by generating a set of one or more summarization decision rules based on a training set, and compressing the tree structure by applying the generated set of summarization decision rules to the tree structure. Alternatively, summarization is accomplished by parsing an input text segment to generate a parse tree for the input segment, generating a plurality of potential solutions, applying a statistical model to determine a probability of correctness for each of potential solution, and extracting one or more high- probability solutions based on the solutions'"'"' respective determined probabilities of correctness.
-
Citations
67 Claims
-
1. A computer-implemented method of determining discourse structures, the method comprising:
-
generating a set of one or more discourse parsing decision rules based on a training set; and
determining a discourse structure for an input text segment by applying the generated set of discourse parsing decision rules to the input text segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20)
-
-
16. A computer-implemented text parsing method comprising:
-
generating a set of one or more discourse segmenting decision rules based on a training set; and
determining boundaries in an input text segment by applying the generated set of discourse segmenting decision rules to the input text segment.
-
-
21. A computer-implemented method of generating discourse trees, the method comprising:
-
segmenting an input text segment into elementary discourse units (EDUs); and
incrementally building a discourse tree for the input text segment by performing operations on the EDUs to selectively combine the EDUs into larger discourse tree units. - View Dependent Claims (22, 23, 24, 25, 26, 28)
-
-
27. A discourse parsing system comprising:
-
a plurality of automatically learned decision rules;
an input list comprising a plurality of elementary discourse trees (EDTs), each EDT corresponding to an elementary discourse unit (EDU) of an input text segment;
a stack for holding discourse tree segments while a discourse tree for the input text segment is being built; and
a plurality of operators for incrementally building the discourse tree for the input text segment by selectively combining the EDTs into a discourse tree segment according to the plurality of decision rules and moving the discourse tree segment onto the stack.
-
-
29. A computer-implemented method comprising determining a discourse structure for an input text segment by applying a set of automatically learned discourse parsing decision rules to an input text segment.
-
30. A computer-implemented summarization method comprising:
-
generating a set of one or more summarization decision rules based on a training set; and
compressing a tree structure by applying the generated set of summarization decision rules to the tree structure. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 48, 49, 50, 51, 52, 53, 54)
-
-
47. A computer-implemented summarization method comprising:
-
generating a parse tree for an input text segment; and
iteratively reducing the generated parse tree by selectively eliminating portions of the parse tree.
-
-
55. A computer-implemented summarization method comprising:
-
parsing an input text segment to generate a parse tree for the input segment;
generating a plurality of potential solutions;
applying a statistical model to determine a probability of correctness for each of potential solution;
extracting one or more high-probability solutions based on the solutions'"'"' respective determined probabilities of correctness. - View Dependent Claims (56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67)
-
Specification