Object oriented information retrieval framework mechanism
First Claim
1. A computer system comprising:
- a central processing unit;
a user interface; and
a main memory having an operating system that supports an object oriented programming environment containing a framework that provides an extensible information retrieval system that operates on documents stored in the computer system, the framework including;
index class objects having word index objects that map words contained in the stored documents to the documents that contain the words;
a build index object that responds to a user build index request by processing a stored document and creating the word index objects such that they contain the document word mapping and document-related information produced by a preprocessing operation;
a posting list class of objects that are pointed to by the word index objects and contain a frequency count that indicates the number of times a word appears in a word index object of a document;
a document table class of objects that map a word index object to the indexed document from which it was preprocessed; and
a query index object that processes a user query so as to produce a query result from comparison of the user query and the word index objects in response to a user query;
wherein the query result identifies stored documents relevant to the user query.
2 Assignments
0 Petitions
Accused Products
Abstract
A framework for use with object-oriented programming systems provides a reusable object oriented (OO) framework for use with object oriented programming systems that provides an information retrieval (IR) shell that permits a framework user to define an index class that includes word index objects and provides an extensible information retrieval system that evaluates a user query by comparing information contained in the user query with information contained in the word index objects that relates to stored documents. The information in word index objects is produced by preprocessing operations on documents such that the documents relevant to the user query will be identified, thereby providing a query result. The information retrieval system user can load documents into the computer system storage, index documents so their information can be subject to a query search, and request query evaluation to identify and retrieve documents most closely related to the subject matter of a user query.
-
Citations
34 Claims
-
1. A computer system comprising:
-
a central processing unit; a user interface; and a main memory having an operating system that supports an object oriented programming environment containing a framework that provides an extensible information retrieval system that operates on documents stored in the computer system, the framework including; index class objects having word index objects that map words contained in the stored documents to the documents that contain the words; a build index object that responds to a user build index request by processing a stored document and creating the word index objects such that they contain the document word mapping and document-related information produced by a preprocessing operation; a posting list class of objects that are pointed to by the word index objects and contain a frequency count that indicates the number of times a word appears in a word index object of a document; a document table class of objects that map a word index object to the indexed document from which it was preprocessed; and a query index object that processes a user query so as to produce a query result from comparison of the user query and the word index objects in response to a user query; wherein the query result identifies stored documents relevant to the user query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An object oriented framework for use in a computer system having an operating system that supports an object oriented programming environment, wherein the framework provides an extensible information retrieval system that operates on documents stored in the computer system and includes:
-
index class objects having word index objects that map words contained in the stored documents to the documents that contain the words; a build index object that responds to a user build index request by processing a stored document and creating the word index objects such that they contain the document word mapping and document-related information produced by a preprocessing operation; a posting list class of objects that are pointed to by the word index objects and contain a frequency count that indicates the number of times a word appears in a word index object of a document; a document table class of objects that map a word index object to the indexed document from which it was preprocessed; and a query index object that processes a user query so as to produce a query result from comparison of the user query and the word index objects in response to a user query; wherein the query result identifies stored documents relevant to the user query. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A program product for use in a computer system having an operating system that supports an object-oriented programming environment, the program product comprising:
-
a signal bearing media; and a framework recorded on the signal bearing media, the framework providing an extensible information retrieval system that operates on documents stored in the computer system and includes; index class objects having word index objects that map words contained in the stored documents to the documents that contain the words; a build index object that responds to a user build index request by processing a stored document and creating the word index objects such that they contain the document word mapping and document-related information produced by a preprocessing operation; a posting list class of objects that are pointed to by the word index objects and contain a frequency count that indicates the number of times a word appears in a word index object of a document; a document table class of objects that map a word index object to the indexed document from which it was preprocessed; and a query index object that processes a user query so as to produce a query result from comparison of the user query and the word index objects in response to a user query; wherein the query result identifies stored documents relevant to the user query. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A method of executing an application program in a computer system having a central processing unit that controls processing in the computer system, a user interface, and a main memory having an operating system that supports an object oriented programming environment, the method comprising the steps of:
-
providing an object oriented framework that provides an extensible information retrieval system; and evaluating a user query by using the framework to compare information contained in the user query with information contained in object oriented programming, extensible index class objects of the framework having word index objects that map words contained in the stored documents to the documents that contain the words, wherein the framework further includes; a build index object that responds to a user build index request by processing a stored document and creating the word index objects such that they contain the document word mapping and document-related information produced by a preprocessing operation; a posting list class of objects that are pointed to by the word index objects and contain a frequency count that indicates the number of times a word appears in a word index object of a document; a document table class of objects that map a word index object to the indexed document from which it was preprocessed; and a query index object that processes a user query so as to produce a query result from comparison of the user query and the word index objects in response to a user query; a build index object that responds to a user build index request by processing a stored document and creating the word index objects such that they contain the document word mapping and document-related information produced by a preprocessing operation; a posting list class of objects that are pointed to by the word index objects and contain a frequency count that indicates the number of times a word appears in a word index object of a document; a document table class of objects that map a word index object to the indexed document from which it was preprocessed; and a query index object that processes a user query so as to produce a query result from comparison of the user query and the word index objects in response to a user query; wherein the query result identifies stored documents relevant to the user query. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34)
-
Specification