Region Based Information Retrieval System
First Claim
1. A system for retrieving information from document collections comprising:
- a document collection subsystem for managing document collection in a document database;
a region finding, splitting and graphing subsystem for analyzing documents in the document database, establishing regions of these documents, identifying region sets of identical or nearly identical regions across documents, and storing these regions sets in a region set database;
an indexing subsystem for making the region sets searchable and storing the index information in a searchable index of region set database; and
a searching and ranking subsystem for finding region sets in the region set database using the searchable index of region set database based on an information request, and communicating the search results to end-users or another system.
0 Assignments
0 Petitions
Accused Products
Abstract
A region based information retrieval system improves on conventional information retrieval systems by breaking down documents into one or more region(s) and processing the additional information available at a region level of analysis. When looking at regions, it becomes possible to quickly distinguish between groups of related documents, quickly ignore or focus on certain information, track recent evolutions of documents, as well as understand the historical relationships, heritage, and versions of these documents. This is all possible whether or not the document publishers specify where the content originally came from.
-
Citations
19 Claims
-
1. A system for retrieving information from document collections comprising:
-
a document collection subsystem for managing document collection in a document database; a region finding, splitting and graphing subsystem for analyzing documents in the document database, establishing regions of these documents, identifying region sets of identical or nearly identical regions across documents, and storing these regions sets in a region set database; an indexing subsystem for making the region sets searchable and storing the index information in a searchable index of region set database; and a searching and ranking subsystem for finding region sets in the region set database using the searchable index of region set database based on an information request, and communicating the search results to end-users or another system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for retrieved information from document collections, comprising:
-
identifying documents related to an information request; establishing any relationships between these documents; clustering the documents based on such relationships; ranking each cluster of documents; and communicating the search results to end-users or another system. - View Dependent Claims (11, 12)
-
-
13. A method of organizing and retrieving information in documents collections comprising:
-
breaking down each document into one or more regions; establishing region sets which are duplicates or near duplicates of each other; searching and finding region sets related to an information request; and communicating the search results to end-users or another system. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification