Region based information retrieval system
First Claim
Patent Images
1. A system for retrieving information from document collections comprising:
- a document collection subsystem for managing documents in a document database;
a region finding, splitting and graphing subsystem for analyzing documents in the document database, establishing regions of these documents, where the region is less than the containing document and the bounds of each region is defined by the existence of at least one other identical or nearly identical region elsewhere in the document database, identifying region sets of such identical or nearly identical regions across documents, and storing these regions sets in a region set database;
an indexing subsystem for making the region sets searchable and storing the index information in a searchable index of region set database; and
a searching and ranking subsystem for finding region sets in the region set database using the searchable index of region sets based on an information request, creating a list of region set clusters of closely related region sets from the regions sets found, where the relations are based on the relationships in the region set graphs obtained from the region set graphs database, and communicating the search results.
0 Assignments
0 Petitions
Accused Products
Abstract
A region based information retrieval system improves on conventional information retrieval systems by breaking down documents into one or more region(s) and processing the additional information available at a region level of analysis. When looking at regions, it becomes possible to quickly distinguish between groups of related documents, quickly ignore or focus on certain information, track recent evolutions of documents, as well as understand the historical relationships, heritage, and versions of these documents. This is all possible whether or not the document publishers specify where the content originally came from.
-
Citations
19 Claims
-
1. A system for retrieving information from document collections comprising:
-
a document collection subsystem for managing documents in a document database; a region finding, splitting and graphing subsystem for analyzing documents in the document database, establishing regions of these documents, where the region is less than the containing document and the bounds of each region is defined by the existence of at least one other identical or nearly identical region elsewhere in the document database, identifying region sets of such identical or nearly identical regions across documents, and storing these regions sets in a region set database; an indexing subsystem for making the region sets searchable and storing the index information in a searchable index of region set database; and a searching and ranking subsystem for finding region sets in the region set database using the searchable index of region sets based on an information request, creating a list of region set clusters of closely related region sets from the regions sets found, where the relations are based on the relationships in the region set graphs obtained from the region set graphs database, and communicating the search results. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of organizing and retrieving information in documents collections comprising:
-
a document collection element for managing documents in a document database; a region finding, splitting and graphing element for analyzing documents in the document database, establishing regions of these documents, where the region is less than the containing document and the bounds of each region is defined by the existence of at least one other identical or nearly identical region elsewhere in the document database, identifying region sets of such identical or nearly identical regions across documents, and storing these regions sets in a region set database; an indexing element for making the region sets searchable and storing the index information in a searchable index of region set database; and a searching and ranking element for finding region sets in the region set database using the searchable index of region sets based on an information request, creating a list of region set clusters of closely related region sets from the regions sets found, where the relations are based on the relationships in the region set graphs obtained from the region set graphs database, and communicating the search results. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification