Method and apparatus for searching multiple data element type files
First Claim
1. A method for constructing a searchable database from a collection of information having multiple data types, comprising the steps of:
- a. identifying all data types within the collection b. identifying all data elements within each data type, each data element comprising a plurality of words;
c. removing non-printable characters from the identified data elements;
d. associating each data element with a unique identifier;
e. saving each data element;
f. hyperlinking all data elements containing a common word; and
g. indexing the data elements.
0 Assignments
0 Petitions
Accused Products
Abstract
Method and apparatus is disclosed for constructing, and updating, a searchable database from very large data collections including diverse types of data thus enabling the data to be cataloged, data mined, indexed, parsed, cross-referenced and published for rapid location and free text retrieval of data elements from terabytes of information or millions of elements. The method and apparatus enables full text searches with the aid of an Internet browser using very complex search criteria without special skills. The full text searches are performed on a searchable database developed from the data collections in a large data storage system and wherein data type elements are identified, hyperlinked when possible, and stored in a retrievable format. In addition to the searchable database, a database is constructed that shows graphically the relationships between a current module being viewed and any other modules the current module may reference or be referenced by.
-
Citations
20 Claims
-
1. A method for constructing a searchable database from a collection of information having multiple data types, comprising the steps of:
-
a. identifying all data types within the collection b. identifying all data elements within each data type, each data element comprising a plurality of words;
c. removing non-printable characters from the identified data elements;
d. associating each data element with a unique identifier;
e. saving each data element;
f. hyperlinking all data elements containing a common word; and
g. indexing the data elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A method for updating a searchable database constructed from a collection of information having multiple data types as of a first point in time with the collection of data as of a second, later point in time, the first and second points in time defining a time interval;
- comprising the steps of;
a. at the first point in time i. identifying all data types within the collection;
ii. identifying all data elements within each data type, each data element comprising a plurality of words;
iii. removing nonprintable characters from the identified data elements;
iv. associating each data element with a unique identifier;
v. hyperlinking all data elements containing a common word; and
vi. indexing the data elements; and
b. at the second point in time i. identifying all data elements within each data type;
ii. removing nonprintable characters from the identified data elements;
iii. identifying those data elements that were removed from the collection during the time interval;
vii. identifying those data elements that were changed in the collection during the time interval;
viii. identifying those data elements that were added to the collection during the time interval;
ix. associating each of the removed, changed and added data elements w with a unique identifier;
x. hyperlinking all removed, changed and added data elements containing a common word; and
iv. indexing the removed, changed and added data elements data elements with the previously indexed data elements.
- comprising the steps of;
-
17. An apparatus for constructing a searchable database from a collection of information having multiple data types, comprising:
-
a. means for identifying all data types within the collection b. means for identifying all data elements within each data type, each data element comprising a plurality of words;
c. means for removing nonprintable characters from the identified data elements;
d. means for associating each data element with a unique identifier;
e. means for hyperlinking all data elements containing a common word; and
f. means for indexing the data elements.
-
-
18. An apparatus for updating a searchable database constructed from a collection of information having multiple data types as of a first point in time with the collection of data as of a second, later point in time, the first and second points in time defining a time interval, comprising:
-
a. at the first point in time i. means for identifying all data types within the collection;
ii. means for identifying all data elements within each data type, each data element comprising a plurality of words;
iii. means for removing nonprintable characters from the identified data elements;
iv. means for associating each data element with a unique identifier;
v. means for hyperlinking all data elements containing a common word; and
vi. means for indexing the data elements; and
b. at the second point in time i. means for identifying all data elements within each data type at the second point in time;
ii. means for removing nonprintable characters from the identified data elements;
iii. means for identifying those data elements that were removed from the collection during the time interval;
iv. means for identifying those data elements that were changed in the collection during the time interval;
v. means for identifying those data elements that were added to the collection during the time interval;
vi. means for associating each of the removed, changed and added data elements with a unique identifier;
vii. means for hyperlinking all removed, changed and added data elements containing a common word; and
viii. means for indexing the removed, changed and added data elements data elements with the previously indexed data elements. - View Dependent Claims (19, 20)
-
Specification