Ontological concept-based, user-centric text summarization
First Claim
1. A method of constructing a text summarization, comprising:
- selecting at least one domain ontology comprising a set of concepts;
defining a user profile indicative of the user'"'"'s interests in terms of the concepts in the selected ontology;
determining if a document is relevant to the user based upon the user profile;
responsive to determining that the document is relevant, using at least a portion of the selected ontology to extract concepts from the document;
determining the degree of match between the extracted concepts and the concepts defined in the user profile; and
generating a document text summary if the degree of match exceeds a predetermined threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for constructing a text summarization. At least one domain ontology that includes a set of concepts is selected. A user profile indicative of a user'"'"'s interests is defined in terms of the ontology concepts. A document'"'"'s relevance to the user is determined based upon the user profile. If the document is relevant, at least a portion of the ontology is used to extract concepts from the document. The degree of match between the extracted concepts and the user profile concepts is determined and the document text summary is generated if the degree of match exceeds a predetermined threshold. Generating the summary may include selecting sentences based on the concepts in the user profile, ranking the selected sentences by relevance to the user profile, selecting sentences for inclusion in the document text summary based upon the ranking, and merging the selected sentences into the document text summary.
-
Citations
30 Claims
-
1. A method of constructing a text summarization, comprising:
-
selecting at least one domain ontology comprising a set of concepts;
defining a user profile indicative of the user'"'"'s interests in terms of the concepts in the selected ontology;
determining if a document is relevant to the user based upon the user profile;
responsive to determining that the document is relevant, using at least a portion of the selected ontology to extract concepts from the document;
determining the degree of match between the extracted concepts and the concepts defined in the user profile; and
generating a document text summary if the degree of match exceeds a predetermined threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer program product comprising a computer readable medium containing a set of computer executable instructions for constructing a text summarization, the instructions comprising:
-
computer code means for selecting at least one domain ontology comprising a set of concepts;
computer code means for defining a user profile indicative of the user'"'"'s interests in terms of the concepts in the selected ontology;
computer code means for determining if a document is relevant to the user based upon the user profile;
computer code means for using at least a portion of the selected ontology to extract concepts from the document responsive to determining that the document is relevant;
computer code means for determining the degree of match between the extracted concepts and the concepts defined in the user profile; and
computer code means for generating a document text summary if the degree of match exceeds a predetermined threshold. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A data processing system including processor, memory, and input means, the system further include computer program product code for constructing a text summarization, the code comprising:
-
computer code means for selecting at least one domain ontology comprising a set of concepts;
computer code means for defining a user profile indicative of the user'"'"'s interests in terms of the concepts in the selected ontology;
computer code means for determining if a document is relevant to the user based upon the user profile;
computer code means for using at least a portion of the selected ontology to extract concepts from the document responsive to determining that the document is relevant;
computer code means for determining the degree of match between the extracted concepts and the concepts defined in the user profile; and
computer code means for generating a document text summary if the degree of match exceeds a predetermined threshold. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification