Method and apparatus for integrating a dynamic lexicon into a full-text information retrieval system
First Claim
1. A method of reindexing a plurality of indices representative of information stored in an information retrieval system, the system including memory having a database stored therein, the database being logically divided to include the plurality of indices, an information database having information objects stored therein and a lexicon, the lexicon including a plurality of data items and groups of data items that appear in the information database, the method comprising the steps of:
- (a) maintaining a predetermined time variable representative of the last time the plurality of indices were reindexed;
(b) making a plurality of changes to the lexicon;
(c) attaching a time stamp to each one of the plurality of changes to the lexicon to indicate when the change was made to the lexicon;
(d) locating information objects in the information database that are affected by the plurality of changes to the lexicon;
(e) reindexing portions of the plurality of indices representative of the information objects located in step (d) to reflect the changes in the lexicon;
(f) updating the predetermined time variable to equal the value of the time stamp of a most recent one of the plurality of changes to indicate the changes to the lexicon that have been processed; and
(g) repeating steps (c)-(f) until all changes to the lexicon after the predetermined time have been applied to the plurality of indices.
3 Assignments
0 Petitions
Accused Products
Abstract
An information retrieval system including a plurality of indices representative of information stored in the information retrieval system and a dynamic lexicon is disclosed. The system includes memory having a database stored therein, the database being logically divided to include the plurality of indices, an information database having information objects stored therein and a dynamic lexicon which includes a plurality of data items and groups of data items that appear in the information database. A predetermined time variable represents the last time the plurality of indices were reindexed. After changes are made to the lexicon, a time stamp is attached to each one of the plurality of changes to the lexicon to indicate when the change was made to the lexicon. At some specified time interval later, the reindexing process is invoked. This process involves selecting a subset of the plurality of changes made to the lexicon after the predetermined time variable, locating all information objects in the information database that are affected by the plurality of changes to the lexicon, reindexing the portions of the plurality of indices representative of the information objects affected by the changes to the lexicon to reflect the changes in the lexicon, and then updating the predetermined time variable to indicate changes to the lexicon have been processed. The foregoing process is repeated until all changes to the lexicon after the predetermined time have been applied to the plurality of indices.
-
Citations
14 Claims
-
1. A method of reindexing a plurality of indices representative of information stored in an information retrieval system, the system including memory having a database stored therein, the database being logically divided to include the plurality of indices, an information database having information objects stored therein and a lexicon, the lexicon including a plurality of data items and groups of data items that appear in the information database, the method comprising the steps of:
-
(a) maintaining a predetermined time variable representative of the last time the plurality of indices were reindexed; (b) making a plurality of changes to the lexicon; (c) attaching a time stamp to each one of the plurality of changes to the lexicon to indicate when the change was made to the lexicon; (d) locating information objects in the information database that are affected by the plurality of changes to the lexicon; (e) reindexing portions of the plurality of indices representative of the information objects located in step (d) to reflect the changes in the lexicon; (f) updating the predetermined time variable to equal the value of the time stamp of a most recent one of the plurality of changes to indicate the changes to the lexicon that have been processed; and (g) repeating steps (c)-(f) until all changes to the lexicon after the predetermined time have been applied to the plurality of indices. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method of reindexing a plurality of indices representative of information stored in an information retrieval system, the system including memory having a database stored therein, the database being logically divided to include the plurality of indices, an information database having information objects stored therein and a lexicon, the lexicon including a plurality of data items and groups of data items that appear in the information database, the method comprising the steps of:
-
(a) maintaining a predetermined time variable representative of the last time the plurality of indices were reindexed; (b) making a plurality of changes to the lexicon; (c) attaching a time stamp to each one of the plurality of changes to the lexicon to indicate when the change was made to the lexicon; (d) selecting a subset of the plurality of changes made to the lexicon after the predetermined time variable; (e) locating information objects in the information database that are affected by the subset of the plurality of changes to the lexicon; (f) reindexing portions of the plurality of indices representative of the information objects located in step (e) to reflect the changes in the lexicon; (g) updating the predetermined time to equal the value of the time stamp of a most recent one of the plurality of changes in the selected subset to indicate the changes to the lexicon that have been processed; and (h) repeating steps (c)-(g) until all changes to the lexicon after the predetermined time have been applied to the plurality of indices. - View Dependent Claims (7, 8)
-
-
9. A method of maintaining a consistent view of a database while reindexing a plurality of indices representative of information stored in an information retrieval system, the system including memory having the database stored therein, the database being logically divided to include the plurality of indices, an information database having information objects stored therein and a lexicon, the lexicon including a plurality of data items and groups of data items that appear in the information database, the method comprising the steps of:
-
(a) maintaining a predetermined time variable representative of the last time the plurality of indices were reindexed; (b) attaching a plurality of first time stamps to each one of a plurality of entries in the lexicon and the plurality of indices, the plurality of first time stamps being representative of the time each entry was added to the lexicon and the plurality of indices; (c) making a plurality of changes to the lexicon in a non-destructive manner to maintain prior versions of each one of the plurality of entries in the lexicon; (d) attaching a plurality of second time stamps to each one of the plurality of changes to the lexicon and to the plurality of indices to indicate when the changes were made; (e) locating information objects in the information database that are affected by the plurality of changes to the lexicon; (f) reindexing portions of the plurality of indices representative of the information objects located in step (e) to reflect the changes in the lexicon in a non-destructive manner to maintain prior versions of each one of the plurality of entries in the plurality of indices; (g) processing a user query to retrieve information objects from the information database by accessing only those entries of the lexicon and the plurality of indices whose respective one of the plurality of the first and second time stamps is less than or equal to the predetermined time variable to provide the user with a consistent view of the lexicon and the plurality of indices; (h) updating the predetermined time variable to equal to the value of the time stamp of a most recent one of the plurality of changes to indicate the changes to the lexicon that have been processed; and (i) repeating steps (e), (f) &
(h) until all changes to the lexicon after the predetermined time have been applied to the plurality of indices.
-
-
10. An information retrieval system comprising:
-
a memory system; a database stored in said memory system, said database being logically divided to include a plurality of indices representative of information stored in an information database having information objects stored therein and a lexicon, the lexicon including a plurality of words and phrases that appear in the information database; and a computer having memory associated therewith, the computer being coupled to the memory system, the computer including; means for maintaining a predetermined time variable representative of the last time the plurality of indices were reindexed in said memory associated with said computer; means for making a plurality of changes to the lexicon and for attaching a time stamp to each one of the plurality of changes to indicate when the change was made to the lexicon; means for locating articles in the articles database that are affected by the plurality of changes to the lexicon; means for reindexing portions of the plurality of indices representative of the articles affected by the changes to the lexicon to reflect the changes in the lexicon; and means for updating the predetermined time variable to equal the value of the time stamp of a most recent one of the plurality of changes to indicate changes to the lexicon have been processed.
-
-
11. An information retrieval system comprising:
-
a memory system; a database stored in said memory system, said database being logically divided to include a plurality of indices representative of information stored in an information database having information objects stored therein and a lexicon, the lexicon including a plurality of words and phrases that appear in the information database; and a computer having memory associated therewith, the computer being coupled to the memory system, the computer including; means for maintaining a predetermined time variable representative of the last time the plurality of indices were reindexed in said memory associated with said computer; means for making a plurality of changes to the lexicon and for attaching a time stamp to each one of the plurality of changes to indicate when the change was made to the lexicon; means for selecting a subset of the plurality of changes made to the lexicon after the predetermined time variable; means for locating articles in the articles database that are affected by the plurality of changes to the lexicon; means for reindexing portions of the plurality of indices representative of the articles affected by the changes to the lexicon to reflect the changes in the lexicon; and means for updating the predetermined time variable to equal the value of the time stamp of a most recent one of the plurality of changes in the selected subset to indicate changes to the lexicon have been processed. - View Dependent Claims (12, 13)
-
-
14. An information retrieval system comprising:
-
a memory system; a database stored in said memory system, said database being logically divided to include a plurality of indices representative of information stored in an information database having information objects stored therein and a lexicon, the lexicon including a plurality of words and phrases that appear in the information database; and a computer having memory associated therewith, the computer being coupled to the memory system, the computer including; means for maintaining a predetermined time variable representative of the last time the plurality of indices were reindexed; means for attaching a plurality of first time stamps to each one of a plurality of entries in the lexicon and the plurality of indices, the plurality of first time stamps being representative of the time each entry was added to the lexicon and the plurality of indices; means for making a plurality of changes to the lexicon in a non-destructive manner to maintain prior versions of each one of the plurality of entries in the lexicon; means for attaching a plurality of second time stamps to each one of the plurality of changes to the lexicon and to the plurality of indices to indicate when the changes were made; means for locating information objects in the information database that are affected by the plurality of changes to the lexicon; means for reindexing portions of the plurality of indices representative of the information objects to reflect the changes in the lexicon in a non-destructive manner to maintain prior versions of each one of the plurality of entries in the plurality of indices; means for processing a user query to retrieve information objects from the information database by accessing only those entries of the lexicon and the plurality of indices whose respective one of the plurality of first and second time stamps is less than or equal to the predetermined time variable to provide the user with a consistent view of the lexicon and the plurality of indices; and means for updating the predetermined time variable to equal the value of the time stamp of a most recent one of the plurality of changes to indicate the changes to the lexicon that have been processed.
-
Specification