System and method for automatic anthology creation using document aspects
First Claim
Patent Images
1. A computer implemented method for searching, browsing, presenting, and interacting with data assembled from contents of at least one document, comprising:
- for a document aspect comprising a portion of said document'"'"'s contents in user viewable form, where said portion is all and only those parts of said at least one document that have something semantically useful in common, and for one or more predetermined document aspect types within said at least one document, performing the following steps;
extracting from said at least one document one or more user viewable objects related to said predetermined type in accordance with said document aspect to yield a document aspect instance;
storing said document aspect instance extracted from said at least one document in connection with a user viewable assembled collection in a repository for a subsequent use; and
when said document aspect instance is accessed, providing said assembled collection to a user;
wherein a document aspect provides finer granularity in search results for a search that finds terms within multiple articles in a document contained within a collection, in which case such search results are displayed hierarchically as collections at a top level, with lists of documents within each collection, and with lists of articles within the documents, and wherein granularity is extendable to a page level, displaying a list of pages under each article, and for each page showing two or more words of context surrounding an occurrence of a search term.
5 Assignments
0 Petitions
Accused Products
Abstract
A generic and expandable document aspect system and method for searching, browsing, presenting, and interacting with data assembled from document contents and related external data is provided. New varieties of document aspects are added to existing installations and can be accessed by users without requiring upgrades to server or clients, for example by using plug-in technology.
123 Citations
30 Claims
-
1. A computer implemented method for searching, browsing, presenting, and interacting with data assembled from contents of at least one document, comprising:
-
for a document aspect comprising a portion of said document'"'"'s contents in user viewable form, where said portion is all and only those parts of said at least one document that have something semantically useful in common, and for one or more predetermined document aspect types within said at least one document, performing the following steps; extracting from said at least one document one or more user viewable objects related to said predetermined type in accordance with said document aspect to yield a document aspect instance; storing said document aspect instance extracted from said at least one document in connection with a user viewable assembled collection in a repository for a subsequent use; and when said document aspect instance is accessed, providing said assembled collection to a user; wherein a document aspect provides finer granularity in search results for a search that finds terms within multiple articles in a document contained within a collection, in which case such search results are displayed hierarchically as collections at a top level, with lists of documents within each collection, and with lists of articles within the documents, and wherein granularity is extendable to a page level, displaying a list of pages under each article, and for each page showing two or more words of context surrounding an occurrence of a search term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 29)
-
-
15. A system for searching, browsing, presenting, and interacting with data assembled from contents of at least one document, comprising:
-
for a document aspect comprising a portion of said document'"'"'s contents in user viewable form, where said portion is all and only those parts of said at least one document that have something semantically useful in common, and for one or more predetermined types within said at least one document, means for performing the following; extracting from said at least one document one or more user viewable objects related to said predetermined type in accordance with said document aspect to yield a document aspect instance; storing said document aspect instance extracted from said at least one document in connection with a user viewable assembled collection in a repository for a subsequent use; when said document aspect instance is accessed, means for providing said assembled collection to a user; wherein a document aspect provides finer granularity in search results for a search that finds terms within multiple articles in a document contained within a collection, in which case such search results are displayed hierarchically as collections at a top level, with lists of documents within each collection, and with lists of articles within the documents, and wherein granularity is extendable to a page level, displaying a list of pages under each article, and for each page showing two or more words of context surrounding an occurrence of a search term; and wherein the means for performing includes a computer processor. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 30)
-
Specification