Automatic system for configuring to dynamic database search forms
First Claim
1. A method for the automatic configuration of dynamic database search forms comprising:
- obtaining a database listing containing the uniform resource locators (URLS) for each one of a plurality of databases to be configured;
accessing each one of said plurality of databases;
capturing a web page from each database of the plurality of databases associated with said URL;
locating data entry windows in said captured web page;
determining a probability score for each of the data entry windows, wherein each probability score represents a probability that a corresponding data entry window is associated with a query request mechanism;
selecting a most probable data entry window from among the data entry windows for passing queries to said at least one database, wherein the selecting is based on the probability score;
searching candidate responses for a next link indicating a next page for additional results from said at least one database in response to a query; and
writing an engine file describing the form layout and requirements based upon said candidate responses and said next link.
3 Assignments
0 Petitions
Accused Products
Abstract
An automatic system and method for configuring to dynamic database search forms for facilitating the efficient submission of multiple queries to search engines. The system includes a computer system having storage for facilitating the retention of dynamic database content, a communications module for performing bi-directional communications with a network; a query input module for receiving a plurality of queries from a user and transferring the plurality of queries to a plurality of databases; an action string module for determining an appropriate data entry window for use in passing a query to the database; a results module for locating areas on a responsive page where results are placed; a next link module used to locate a link (URL) associated with additional results provided by the database; and an engine file module for storing results such that a general format query is translatable into a database specific dialects.
-
Citations
43 Claims
-
1. A method for the automatic configuration of dynamic database search forms comprising:
-
obtaining a database listing containing the uniform resource locators (URLS) for each one of a plurality of databases to be configured; accessing each one of said plurality of databases; capturing a web page from each database of the plurality of databases associated with said URL; locating data entry windows in said captured web page; determining a probability score for each of the data entry windows, wherein each probability score represents a probability that a corresponding data entry window is associated with a query request mechanism; selecting a most probable data entry window from among the data entry windows for passing queries to said at least one database, wherein the selecting is based on the probability score; searching candidate responses for a next link indicating a next page for additional results from said at least one database in response to a query; and writing an engine file describing the form layout and requirements based upon said candidate responses and said next link. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for the automatic configuration of dynamic database search forms comprising:
-
obtaining a database listing containing the uniform resource locators (URLS) for each one of a plurality of databases to be configured; accessing each one of said plurality of databases; capturing a web page from each database of the plurality of databases associated with said URL; locating data entry windows in said captured web page; determining a probability score for each of the data entry windows, wherein each probability score represents a probability that a corresponding data entry window is associated with a query request mechanism; selecting a most probable data entry window from among the data entry windows for passing queries to said at least one database, wherein the selecting is based on the probability score; searching candidate responses for a next link indicating a next page for additional results from said at least one database in response to a query; and writing an engine file describing the form layout and requirements based upon said candidate responses and said next link; wherein the step of locating data entry windows in said captured webpage comprises; saving information captured from the web page as a source version of the web page; filtering said source version into additional listings of URLs and text portions; examining said text portions for occurrences of a form label; collecting each form tagged with the form label; wherein the step of determining a probability score comprises; scoring each one of said forms to develop a numerical representation of a likelihood that any one form is a query input form; wherein the step of selecting a most probable data entry window from among the data entry windows comprises; selecting one of said forms based on said form having a higher numerical representation than any other of said forms; storing an action string associated with said form, said action string comprising a URL having at least a domain portion, a program portion, and a query portion; storing a get-post indicator associated with said database; wherein the step of scoring each one of said forms further comprises; locating an action string associated with said data entry window; obtaining a listing of bad action string; comparing said action string with said listing of bad action strings and determining if a portion of said action string matches any bad action strings of said listing of bad action strings, setting said numerical representation to zero and terminating said step of scoring if a portion of said action string matches any of said bad action strings within a predefined window determined by a binding factor; setting at least one metric selected from the group consisting of; a name matching metric; an undesirable link test metric; an undesirable value metric; a desirable link text metric; and a null text metric; and computing a said numerical representation. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A method for the automatic configuration of dynamic database search forms comprising:
-
obtaining a database listing containing the uniform resource locators (URLs) for each one of a plurality of databases to be configured; accessing each one of said plurality of databases; capturing a web page from the database associated with said URL; locating data entry windows in said captured web page; selecting a most probable data entry window of data entry windows in said captured web page based on a probability score of each data entry window for passing queries to said database; searching candidate responses for a next link indicating a next page for additional results from said database in response to a query; and writing an engine file describing the form layout and requirements based upon said candidate responses and said next link; determining a location of each one of a plurality of results or non result item_locations on a responsive web page where results from a query are posted; selecting a plurality of validation queries; submitting a first one of said plurality of validation queries to said database using said action string; capturing a first responsive web page returned in response to said first one of said plurality of validation queries; resubmitting said first one of said plurality of validation queries to said database using said action string; capturing a second responsive web page returned in response to said second submission of said first one of said plurality of validation queries; comparing said first responsive web page with said second responsive web page, any differences between said first and second responsive web page, any differences between said first and second responsive web page are extraneous responses and are ignored, storing any duplicates between said first and second responsive web pages as candidate responses to said validation query; storing said candidate responses; submitting a second one of said plurality of validation queries to said database using said action string; capturing a responsive web page returned in response to said second validation query; repeating submission of additional validation queries and capture of additional responsive web pages until all validation queries have been submitted; comparing said first responsive web page to each of said additional responsive web pages, ignoring any duplicates between said first responsive and additional responsive web pages as extraneous responses, storing any differences between said first responsive and said additional responsive web pages as candidate responses to said validation query; comparing each one of said responsive web pages to all other said responsive web pages, ignoring any duplicates between said responsive web pages as extraneous responses, storing any differences between said responsive web pages as candidate responses to said validation query; and searching candidate responses for a next link indicating a next page for additional results from said database in response to said query. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31)
-
-
32. A system for the automatic configuration of dynamic database search forms comprising:
-
a computer system having a storage means for facilitating the retention and recall of dynamic database content, said computer system having a communications means for performing bi-directional communications between said computer system and a network; a query input means for receiving a plurality of queries from a user and transferring the plurality of queries to a plurality of databases; an action string module interfaced to said computer system and configured to automatically determine a format associated with an entry page for a database from said entry page, said action string module being configured to automatically determine an appropriate data entry window on said entry page for use in passing a query to said database; a results module interfaced to said computer system and said action string module, said results module locating areas on a responsive page returned by said database in response to said query where results are placed; a next link module interface to each one of said computer system, action string module, and results module, said next link module locating a link associated with additional results provided by said database in response to said query; an engine file module interfaced to said computer system and said modules for storing results produced by each module such that a general format query is translatable into a database specific format allowing a common query to be submitted to multiple databases each requiring different formats. - View Dependent Claims (33, 34)
-
-
35. A system for the automatic configuration of dynamic database search forms comprising:
-
a computer system having a storage means for facilitating the retention and recall of dynamic database content, said computer system having a communications means for performing bi-directional communications between said computer system and a network; a query input means for receiving a plurality of queries from a user and transferring the plurality of queries to a plurality of databases; an action string module interfaced to said computer system for determining a format associated with an entry page for a database, said action string module being for determining an appropriate data entry window for use in passing a query to said database; a results module interfaced to said computer system and said action string module, said results module locating areas on a responsive page returned by said database in response to said query where results are placed; a next link module interface to each one of said computer system, action string module, and results module, said next link module locating a link associated with additional results provided by said database in response to said query; an engine file module interfaced to said computer system and every other module for storing results produced by each module such that a general format query is translatable into a database specific format allowing a common query to be submitted to multiple databases each requiring different formats; a data comparison portion providing user specific information to each of said modules for facilitating analysis of said databases; wherein said data comparison portion further comprises; a database listing providing a URL for each of said databases to be analyzed; a bad action string listing providing URLs for known databases which are not to be included in the analysis of said databases; a desirable text link listing providing a plurality of desirable terms for use in analysis of said databases, a presence of any one of said plurality of desirable terms increasing a score associated with a data entry window on one of said responsive pages; an undesirable text link listing providing a plurality of undesirable terms for use in analysis of said databases, a presence of any one of said plurality of undesirable terms setting a score associated with a data entry window on one of said responsive pages to 0 and ending analysis of said data entry window; and an undesirable value listing providing a plurality of undesirable values for use in analysis of said databases, a presence of any one of said plurality of undesirable values decreases a score associated with a data entry window on one of said responsive pages.
-
-
36. A system for the automatic configuration of dynamic database search forms comprising:
-
a computer system having a storage means for facilitating the retention and recall of dynamic database content, said computer system having a communications means for performing bi-directional communications between said computer system and a network; a query input means for receiving a plurality of queries from a user and transferring the plurality of queries to a plurality of databases; an action string module interfaced to said computer system for determining a format associated with an entry page for a database, said action string module being for determining an appropriate data entry window for use in passing a query to said database; a results module interfaced to said computer system and said action string module, said results module locating areas on a responsive page returned by said database in response to said query where results are placed; a next link module interface to each one of said computer system, action string module, and results module, said next link module locating a link associated with additional results provided by said database in response to said query; an engine file module interfaced to said computer system and every other module for storing results produced by each module such that a general format query is translatable into a database specific format allowing a common query to be submitted to multiple databases each requiring different formats; a database listing providing a URL for each of said databases to be analyzed; a bad action string listing providing URLs for known databases which are not to be included in the analysis of said databases; a desirable text link listing providing a plurality of desirable terms for use in analysis of said databases, a presence of any one of said plurality of desirable terms increases a score associated with a data entry window on one of said responsive pages; an undesirable text link listing providing a plurality of undesirable terms for use in analysis of said databases, a presence of any one of said plurality of undesirable terms sets a score associated with a data entry window on one of said responsive pages to 0 and ending analysis of said data entry window; and an undesirable value listing providing a plurality of undesirable values for use in analysis of said databases, a presence of any one of said plurality of undesirable values decreases a score associated with a data entry window on one of said responsive pages; a next link listing providing said next link module with a plurality of candidate terms for facilitating selection of a URL associated with a link to additional responses provided by said database in response to said query.
-
-
37. A method for automatic configuration of dynamic search forms for a database, said method comprising:
-
accessing a web page from a web site providing access to the database; capturing said web page; locating a plurality of data entry windows in said captured web page; determining a probability for each data entry window the data entry window is associated with a query request mechanism; selecting, based on the probability that a data entry window is associated with a query request mechanism, a most probable data entry window from among the data entry windows for passing queries to said at least one database; storing an identification of said most probable data entry window in association with an identification of said database for use in submitting queries to said database. - View Dependent Claims (38, 39, 40, 41, 42, 43)
-
Specification