Document abstraction system and method thereof
First Claim
Patent Images
1. A method of producing an abstract for a text, said text comprising one or more sentences, each sentence comprising one or more words, said method comprising the computer-implemented steps of:
- determining the first sentence of said text;
determining the last sentence of said text;
determining, among the remaining sentences of said text, at most ten abstract sentences containing numeric information; and
producing said abstract based on said first sentence, said at most ten abstract sentences, and said last sentence.
2 Assignments
0 Petitions
Accused Products
Abstract
A document abstract system and methodology produces an abstract from a text by identifying sentences within the text that contain numerical information, such as dates and numbers. The sentences with numerical information along with the first and last sentences of the document are copied for producing the abstract. The computer generated abstract preferably includes a list of proper nouns and adjectives and most common words in the document.
-
Citations
24 Claims
-
1. A method of producing an abstract for a text, said text comprising one or more sentences, each sentence comprising one or more words, said method comprising the computer-implemented steps of:
-
determining the first sentence of said text; determining the last sentence of said text; determining, among the remaining sentences of said text, at most ten abstract sentences containing numeric information; and producing said abstract based on said first sentence, said at most ten abstract sentences, and said last sentence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method of producing an abstract for a text, said text comprising one or more sentences, each sentence comprising one or more words, said method comprising the computer-implemented steps of:
-
determining the first sentence of said text; determining the last sentence of said text; determining, among the remaining sentences of said text, one or more abstract sentences containing numeric information; and counting a number of occurrences for each word within said text; generating a list of words from said text and the number of occurrences thereof; pruning said list of words to produce a pruned list of words including the step of removing proper names from said list of words, wherein a proper name has an initial letter that is capitalized when not at the beginning of a sentence of said one or more sentences; determining a plurality of common words from said pruned list of words, said plurality of common words comprising words from said pruned list of words having at least as many occurrences as any words from said pruned list of words not part of said plurality of common words; and producing said abstract based on said first sentence, said at most ten abstract sentences, said last sentence, and said plurality of common words.
-
-
13. A method of searching a plurality of texts, each comprising one or more sentences, said method comprising the computer-implemented steps of:
-
producing a plurality of abstracts, each abstract based on a corresponding text of said plurality of texts; receiving a search key from a user; searching said plurality of abstracts for an abstract that matches said search key; and outputting said abstract that matches said search key; wherein said abstract contains one or more sentences of the corresponding text, consisting essentially of the first sentence of the corresponding text, at most ten abstract sentences, each containing numeric information, and the last sentence of the corresponding text.
-
-
14. A method of searching a plurality of texts, each comprising one or more sentences, said method comprising the computer-implemented steps of:
-
receiving a search key from a user; searching said plurality of texts for a text that matches said search key; producing an abstract, based on a corresponding document of said plurality of texts that matches said search key; and outputting said abstract; wherein said abstract contains one or more sentences of the corresponding texts, consisting essentially of the first sentence of the corresponding text, at most ten abstract sentences, each containing numeric information, and the last sentence of the corresponding text.
-
-
15. A document abstraction system, configured to produce an abstract from a text, comprising one or more sentences each sentence comprising one or more words, said document abstraction system comprising:
-
means for determining the first sentence of said text; means for determining the last sentence of said text; means for determining, among the remaining sentences of said text, at most ten abstract sentences containing numeric information; and means for producing said abstract based on said first sentence, said one or more abstract sentences, and said last sentence. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computer-readable medium having stored thereon sequences of instructions for producing an abstract from a document, comprising one or more sentences each comprising one or more words, said sequences of instructions includes sequences of instructions which, when executed by a processor, cause said processor to perform the steps of:
-
determining the first sentence of said text; determining the last sentence of said text; determining, among the remaining sentences of said text, at most ten abstract sentences containing numeric information; and producing said abstract based on said first sentence, said one or more abstract sentences, and said last sentence. - View Dependent Claims (21, 22, 23, 24)
-
Specification