System and method for generating analytic summaries
First Claim
1. A system for generating text summaries of a content portion comprising at least one phrase, the system comprising:
- a parts of speech determining circuit that determines the part of speech of constituents of the at least one phrase;
an informativity determining circuit that determines the informativity of the constituents of the at least one phrase based on the determined part of speech;
an informativity compressing circuit that compresses the constituents of the at least one phrase based on the determined informativity, grammatical readability and a desired degree of compression.
2 Assignments
0 Petitions
Accused Products
Abstract
A technique for compressing texts such that referential integrity, sentence coherency, punctuation and readability are preserved and which provides for compression of sentence constituents based on the type of content, the informativity of the sentence constituent and the grammatical readability of the resultant sentence or phrase. Information content portions are parsed to generate parts of speech tags. The informativity of the constituents in a phrase or sentence is determined and the parts of speech having lower information content and having a low effect on grammatical readability of the phrase or sentence are selectively compressed. Parts of speech having successively higher informativity and low effect on grammatical readability are selected for compression until the desired level of compression is reached. Compressed portions are indicated in the summary with a selectable placeholder which expands to display the compressed text.
-
Citations
14 Claims
-
1. A system for generating text summaries of a content portion comprising at least one phrase, the system comprising:
-
a parts of speech determining circuit that determines the part of speech of constituents of the at least one phrase;
an informativity determining circuit that determines the informativity of the constituents of the at least one phrase based on the determined part of speech;
an informativity compressing circuit that compresses the constituents of the at least one phrase based on the determined informativity, grammatical readability and a desired degree of compression. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for generating text summaries of a content portion comprising at least one phrase, the method comprising the steps of:
-
determining a desired degree of compression;
determining parts of speech of the constituents of the at least one phrase;
determining the informativity of the constituents of the at least one phrase based on the determined part of speech;
compressing the constituent of the at least one phrase based in the determined informativity, grammatical readability and the desired degree of compression. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification