Computer based summarization of natural language documents
First Claim
Patent Images
1. A method for summarizing the contents of a natural language document including a plurality of sentences and provided in electronic or digital form, said method comprising:
- A. extracting from said document eSAOs, including extracting subjects, objects, and actions and extracting one or more of adjectives, prepositions, indirect objects and adverbials;
B. determining a weight for each eSAO;
C. for each sentence in said document, using the weights of all eSAOs for said sentence to obtain a sentence weight; and
D. generating one or more document summaries as a function of said sentence weights.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method for summarizing the contents of a natural language document provided in electronic or digital form includes preformatting the document, performing linguistic analysis, weighting each sentence in the document as a function of quantitative importance, and generating one or more document summaries, from a plurality of selectable document summary types, as a function of the sentence weights.
-
Citations
36 Claims
-
1. A method for summarizing the contents of a natural language document including a plurality of sentences and provided in electronic or digital form, said method comprising:
-
A. extracting from said document eSAOs, including extracting subjects, objects, and actions and extracting one or more of adjectives, prepositions, indirect objects and adverbials;
B. determining a weight for each eSAO;
C. for each sentence in said document, using the weights of all eSAOs for said sentence to obtain a sentence weight; and
D. generating one or more document summaries as a function of said sentence weights. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for summarizing the contents of a natural language document provided in electronic or digital form, said method comprising:
-
A. performing linguistic analysis, including;
i) tagging substantially each word as a function of a part of speech of said word;
ii) parsing verbal sequences and noun phrases from said tagged words; and
iii) building a syntactical parsed tree from said verbal sequences and noun phrases, according to a set of rules, wherein words grouped by a rule become inaccessible to other rules;
B. weighting each sentence in the document as a function of quantitative importance and said syntactical parsed tree; and
C. generating one or more document summaries, from a plurality of selectable document summary types, as a function of the sentence weights. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A system for summarizing the contents of a natural language document provided in electronic or digital form, said system comprising:
-
A. at least one memory having a set of linguistic rules stored therein;
B. a linguistic analyzer coupled to said at least one memory and configured for;
i) a tagging substantially each word as a function of a part of speech of said word;
ii) parsing verbal sequences and noun phrases from said tagged words; and
iii) building a syntactical parsed tree from said verbal sequences and noun phrases, according to said set of rules, wherein words grouped by a rule become inaccessible to other rules;
C. a sentence weighting module configured to access said syntactical phrase tree and to weight each sentence in the document as a function of quantitative importance and said syntactical parsed tree; and
D. a summary generating for one or more document summaries, from a plurality of selectable document summary types, as a function of the sentence weights. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A system for summarizing the contents of a natural language document including a plurality of sentences and provided in electronic or digital form, said system comprising:
-
A. at least one memory having a set of linguistic rules stored therein;
B. a linguistic analyzer coupled to said at least one memory and configured for extracting from said document eSAOs, including extracting subjects, objects, and actions and extracting one or more of adjectives, prepositions, indirect objects and adverbials;
C. a weighting module for determining a weight for each eSAO and, for each sentence in said document, using the weights of all eSAOs for said sentence to obtain a sentence weight; and
D. a summary generator for generating one or more document summaries as a function of said sentence weights. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification