Database system and method for data acquisition and perusal
First Claim
1. A data acquisition and perusal system, comprising:
- a database selection module that enables selection of a plurality of files including HTML files for inclusion into at least one selectable database;
a link module that enables custom links to be defined between selected terms of selected files of the at least one database;
a database index generator module that enables generation of a searchable index of the data contained in the at least one selectable databases including the custom links, the generator module enabling only valid custom links to be added to the searchable index;
wherein the searchable index stores the word locations of words in the database including word locations of words in the HTML files that can be used to identify and highlight words in HTML files when those files are displayed in HTML format; and
a search module that enables a search of the searchable index to be performed according to a search criterion.
2 Assignments
0 Petitions
Accused Products
Abstract
A data acquisition and perusal system and method including a database selection module, a database index generator module and a search module. The database selection module enables selection of a plurality of files for inclusion into at least one selectable database. The database index generator module enables generation of a searchable index of the data contained in the selectable database. The search module enables a search to be performed of the searchable index according to search criteria. The data acquisition and perusal system and method may also allow users to view, acquire, and generate single- or multiple-data sources locally or remotely, and allow users to compile, index, modify, and append the data sources according to default or user defined criteria. The data acquisition and perusal system and method may also selectively acquire and display data contained within remote databases depending upon the user'"'"'s access permissions to such databases. Such a system allows for the capture of HTML data which is automatically indexed without human intervention and has the ability to automatically and accurately locate or “pinpoint,” and highlight specific text or groups of text designated by the user within the resulting database. Such a system contains a link module that enables custom links to be defined between selected terms of selected files of the selectable database including the custom links so that the searchable index includes only valid links.
115 Citations
150 Claims
-
1. A data acquisition and perusal system, comprising:
-
a database selection module that enables selection of a plurality of files including HTML files for inclusion into at least one selectable database;
a link module that enables custom links to be defined between selected terms of selected files of the at least one database;
a database index generator module that enables generation of a searchable index of the data contained in the at least one selectable databases including the custom links, the generator module enabling only valid custom links to be added to the searchable index;
wherein the searchable index stores the word locations of words in the database including word locations of words in the HTML files that can be used to identify and highlight words in HTML files when those files are displayed in HTML format; and
a search module that enables a search of the searchable index to be performed according to a search criterion. - View Dependent Claims (2, 4, 5, 7, 10, 11, 12, 13, 14, 15, 16, 17, 19)
-
-
3. (cancelled).
-
6. (canceled).
-
8-9. -9. (cancelled).
-
18. (cancelled).
-
20-51. -51. (cancelled).
-
52. A data acquisition and perusal method for finding, storing and retrieving useful information, comprising the steps of:
-
locating a plurality of accessible HTML files according to a selected search criteria;
selecting a plurality of the located files containing relevant information for automatic inclusion into at least one selectable database;
defining custom linking relationships between selected terms and designated files of the selected database;
verifying the validity of the custom linking relationships;
generating a searchable index of the data contained in the selected database including the custom linking relationships so that the searchable index includes only valid custom linking relationships;
searching the searchable index according to a selected search criterion to locate words and phrases in the data; and
accurately highlighting the located terms and phrases while displaying at least portions of the HTML files in accordance with formatting control tags of the HTML files. - View Dependent Claims (56, 58, 61, 62, 63, 64, 67, 68, 72, 73)
-
-
53-55. -55. (cancelled).
-
57. (cancelled).
-
59-60. -60. (cancelled).
-
65-66. -66. (cancelled).
-
69-71. -71. (cancelled).
-
74. A data acquisition and perusal method, comprising the steps of:
-
selecting a plurality of files, including HTML type format, for inclusion into at least one selectable database;
generating a searchable index of the plurality of selected files included in the selectable database;
wherein the searchable index stores the word locations of words in the database, including word locations of words in the HTML files that can be used to identify and highlight words in HTML files when those files are displayed in HTML format; and
searching the searchable index according to search criteria;
said data selection, database index generator and search steps allowing users to view and acquire single or multiple data sources locally and remotely, and allowing users to compile, index, modify, and append the data sources locally and remotely according to default or user defined criteria. - View Dependent Claims (75, 77)
-
-
76. (cancelled).
-
78. A data indexing and perusal system comprising:
-
an index module that enables generation of a searchable index of a plurality of selected source files, including HTML files;
wherein the searchable index stores word locations of words in the source files, including the word locations of words in HTML files that can be used to identify and highlight words in HTML files while those files are displayed in HTML format;
a search module that enables a search to be performed of the index according to a search criterion to locate words and phrases in the plurality of selected source files;
a display utility that displays at least portions of files in the plurality of selected source files that meet the search criterion; and
an annotation module that enables users to generate annotations of the plurality of selected source files, the annotations being displayable by the display utility. - View Dependent Claims (79, 80, 81, 82, 83, 84, 85, 86, 87)
-
-
88. A data indexing and perusal system comprising:
-
an index module that enables generation of an index of a plurality of selected source files;
a custom link module that enables a user to create links between two of the plurality of selected source files; and
a search module that enables a search to be performed according to a search criterion to locate words and phrases in the plurality of selected source files. - View Dependent Claims (89, 90, 91, 92, 93)
-
-
94. A data indexing and perusal system comprising:
-
an index module that enables generation of a searchable index of a plurality of HTML files;
a search module that enables a search to be performed of the index according to a search criterion to locate words and phrases in the plurality of HTML files; and
a browser for displaying the HTML files that meet the search criterion and which utilizes word locations retrieved from an index of word locations to visually distinguish the searched words and phrases from any surrounding text in the displayed HTML files.
-
-
95. A data indexing and perusal system comprising:
-
a display module operable to display a web page comprising a plurality of links to displayable web elements from the group consisting of web pages, text, images, and graphics, wherein the web page and web elements have original source internet addresses;
a selection module that enables saving of the web page and selective saving of the linked elements to a local computer data storage device, thereby providing the selectively saved linked elements with local addresses;
the selection module being configured to automatically modify the web page'"'"'s links to the selectively saved linked elements so that they point to the selectively saved linked elements'"'"' local addresses;
an index module that enables generation of a searchable index of the saved web page and the selectively saved linked elements; and
a search module that enables a search to be performed of the index according to a search criterion. - View Dependent Claims (96)
-
-
97. A method of annotating, indexing, searching, and displaying a plurality of selected source files, the method comprising:
-
enabling users to generate custom annotations of the plurality of selected source files;
generating a searchable index of the plurality of selected source files;
incorporating any user-generated custom annotations into the index;
searching the searchable index according to a search criterion to locate words and phrases in the plurality of selected source files; and
displaying at least portions of files in the plurality of selected source files that meet the search criterion. - View Dependent Claims (98, 99, 100, 101, 102, 103)
-
-
104. A method of linking, indexing, and searching a plurality of selected source files, the method comprising:
-
enabling users to create custom links between two or more of the plurality of selected source files;
generating a searchable index of the plurality of selected source files;
incorporating any user-created custom links into the index; and
searching the searchable index according to a search criterion to locate words and phrases in the plurality of selected source files. - View Dependent Claims (105, 106, 107, 108, 109)
-
-
110. A method of storing, indexing, and searching information from the internet, the method comprising:
-
displaying a web page comprising a plurality of links to displayable web elements from the group consisting of web pages, text, images, and graphics, wherein the web page and web elements have original source internet addresses;
selecting at least one of the linked elements for indexing;
saving the web page and the selected linked elements as separate and distinct files to a local computer data storage device, thereby providing the web page and selected linked elements with local addresses;
automatically modifying the web page'"'"'s links to the selected linked elements so that they point to the selected linked elements'"'"' local addresses;
generating a searchable index of the web page and selected link elements; and
searching the index according to a search criterion. - View Dependent Claims (111)
-
-
112. An internet browser application that acts as a client to a remote web server and displays HTML files having formatting control tags in a graphical user interface in accordance with the formatting control tags of the HTML files, the internet browser application comprising:
-
a database selection module that enables a plurality of HTML files to be downloaded and saved into at least one selectable database;
a database index generation module that enables generation of a common searchable index of all files downloaded and saved into a selected database; and
a search module that enables a search to be performed of an index generated by the database index generation module according to a search criterion to locate words and phrases in the files of the selected database. - View Dependent Claims (113, 114, 115, 116, 117)
-
-
118. In an internet browser application that acts as a client to a remote web server and displays HTML files having formatting control tags in a graphical user interface in accordance with the formatting control tags of the HTML files, the improvement comprising:
an index generation module that enables generation of an index of an HTML file displayed in the browser, the index being separate from the downloaded and saved files. - View Dependent Claims (119)
-
120. In an internet browser application that acts as a client to a remote web server and displays HTML files having formatting control tags in a graphical user interface in accordance with the formatting control tags of the HTML files, the improvement comprising:
-
a database selection module that enables a plurality of HTML files to be downloaded and saved into at least one selectable database; and
a search module that enables a search to be performed to locate words and phrases in a plurality of HTML files downloaded and saved in a selectable database;
wherein the graphical user interface displays at least portions of the HTML files that meet the search criterion in accordance with the formatting control tags of the HTML files.
-
-
121. In an internet browser application that acts as a client to a remote web server and displays HTML files having formatting control tags in a graphical user interface in accordance with the formatting control tags of the HTML files, and which includes a search module enabling a search to be performed to locate words and phrases in a displayed HTML file according to a search criterion, the improvement comprising:
-
an index of word locations of words in the displayed HTML file;
wherein the internet browser application uses retrieved word locations of words in the HTML files that meet the search criterion in order to highlight the words in the HTML files that meet the search criterion while displaying the HTML file in accordance with the formatting control tags of the HTML files.
-
-
122. A method of identifying a search term in an HTML file comprising:
-
accessing an index that identifies a word location, relative to the beginning of the HTML file, of the search term within the HTML file;
displaying the HTML file in an HTML viewer; and
highlighting characters in the HTML file at the identified word location. - View Dependent Claims (123, 124, 125, 126)
-
-
127. A method of generating an index of word locations of an HTML file having visible characters and control tags, wherein said word locations can be used to pinpoint and highlight a word in a HTML browser displaying the HTML file, the method comprising:
-
identifying the lengths of contiguous blocks of visible characters in the HTML file that are not interrupted by control tags;
for each control tag encountered in the HTML file, determining whether the control tag is an incrementing control tag, wherein an incrementing control tag causes an HTML viewer to advance a file position pointer when the control tag is encountered; and
for a word in the HTML file, generating a number identifying the location of the word in the HTML file by adding an amount by which any incrementing control tags preceding the word advance the file position pointer to the lengths of contiguous blocks of visible characters preceding the word.
-
-
128. A computer software apparatus comprising:
-
a user interface that enables a user to enter a Boolean search expression for searching an HTML file, the Boolean search expression comprising a first text string expression, a second text string expression, and a Boolean operator on the first and second text string expressions;
a search module operable to execute the Boolean search expression on the HTML file; and
an HTML viewer adapted to highlight words in the HTML file corresponding to the Boolean search expression while displaying the HTML file in HTML format.
-
-
129. A computer software apparatus comprising:
-
a user interface that enables a user to enter a proximity search expression for searching an HTML file;
a search module operable to execute the proximity search expression on the HTML file; and
an HTML viewer adapted to highlight words in the HTML file corresponding to the proximity search expression while displaying the HTML file in HTML format.
-
-
130. A computer software apparatus comprising:
-
a user interface that enables a user to enter a conflation search expression for searching an HTML file;
a search module operable to execute the conflation search expression on the HTML file; and
an HTML viewer adapted to highlight words in the HTML file corresponding to the conflation search expression while displaying the HTML file in HTML format.
-
-
131. A method of displaying words in an HTML file satisfying a Boolean search query performed on the HTML file, wherein the Boolean search expression comprises a first text string expression, a second text string expression, and a Boolean operator on the first and second text string expressions, the method comprising:
for each word satisfying the Boolean search query, accessing an index that identifies the word location, relative to the beginning of the HTML file, of that word;
displaying the HTML file in an HTML viewer; and
highlighting characters in the HTML file at the identified word location.- View Dependent Claims (132, 133, 134, 135)
-
136. A method of displaying words in an HTML file satisfying a proximity search query performed on the HTML file, the method comprising:
for each word satisfying the proximity search query, accessing an index that identifies the word location, relative to the beginning of the HTML file, of that word;
displaying the HTML file in an HTML viewer; and
highlighting characters in the HTML file at the identified word location.- View Dependent Claims (137, 138, 139, 140)
-
141. A method of displaying words in an HTML file satisfying a conflation search query performed on the HTML file, the method comprising:
for each word satisfying the conflation search query, accessing an index that identifies the word location, relative to the beginning of the HTML file, of that word;
displaying the HTML file in an HTML viewer; and
highlighting characters in the HTML file at the identified word location.- View Dependent Claims (142, 143, 144, 145)
-
146. A computer software apparatus comprising:
-
a user interface that enables a user to enter a search expression for searching an HTML file;
a search module operable to execute the search expression on the HTML file;
an index that identifies a word location, relative to the beginning of the HTML file, of any matching search term within the HTML file; and
an HTML viewer adapted to use the word location to highlight any matching search term in the HTML file corresponding to the search expression while displaying the HTML file in HTML format. - View Dependent Claims (147, 148, 149, 150)
-
Specification