Text summarization method & apparatus using a multidimensional subspace
First Claim
1. A computer-implemented method of summarizing a first unit of text data with relation to an existing document collection, comprising:
- computing a term weight that is representative of the relevance of a term to a second unit of text data with relation to the document collection;
comparing the computed term weight to a predetermined threshold; and
returning a relevant term based at least in part on a result of the comparison.
1 Assignment
0 Petitions
Accused Products
Abstract
A text summarizer identifies relevant terms in a document, weights the terms and extracts one or more segments to produce a summary or abstract. The various terms in a particular are weighted in relation to an existing document collection. A term weight computer computes term weights for terms in the document, and a threshold comparator compares the term weights to determine if the corresponding terms are relevant to the document collection. Next, a term weight summer adds the term weights for each occurrence of each relevant term in the various segments of the document, and a summation comparator compares the summations to identify a text summarization segment representative of the document. Optionally, relevant terms can be highlighted in the term summarization segment.
-
Citations
45 Claims
-
1. A computer-implemented method of summarizing a first unit of text data with relation to an existing document collection, comprising:
-
computing a term weight that is representative of the relevance of a term to a second unit of text data with relation to the document collection;
comparing the computed term weight to a predetermined threshold; and
returning a relevant term based at least in part on a result of the comparison. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer program product for summarizing a first unit of text data with relation to an existing document collection, including a computer-readable medium encoded with instructions configured to be executed by a processor in order to perform predetermined operations comprising:
-
computing a term weight that is representative of the relevance of a term to a second unit of text data with relation to the document collection;
comparing the computed term weight to a predetermined threshold; and
returning a relevant term based at least in part on a result of the comparison. - View Dependent Claims (17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A text summarizer, comprising:
-
a term weight computer configured to compute a term weight that is representative of the relevance of a term to the document collection;
a threshold comparator configured to compare the computed term weight to a predetermined threshold, wherein the text summarizer is configured to return a relevant term based at least in part on a result of the comparison. - View Dependent Claims (32)
-
-
33. A computer-implemented method for creating a summary with relation to an existing document collection having one or more documents based on a query, comprising:
-
receiving query information from a user;
identifying a first document segment of a first document of the document collection, wherein the first document segment is substantially optimized to represent a summary of the first document in relation to the query information based on a weighting process of terms within the document, the weighting process being based on a subspace transformation of the query information, the subspace being based on a number of occurrences of terms in the documents of the document collection; and
returning the first document segment of the first document to the user. - View Dependent Claims (34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45)
-
Specification