Parallel loading of markup language data files and documents into a computer database
First Claim
1. A method for loading markup language data files and documents into a computer database, said markup language data files containing data items and elements, said database having an application programming interface for modifying contents of the database through database command statements, said method comprising the steps of:
- parsing a markup language data file one element at a time according to a format description, each of said elements being immediately available for further processing without waiting for subsequent elements to be parsed;
generating one or more database command statements immediately responsive to the availability of said elements such that said command generation and parsing occur substantially simultaneously;
storing each of said generated database command statements in a database command statement queue immediately upon their generation; and
executing said queued database command statements such that a database is updated substantially simultaneously with said step of parsing a data file in order to achieve loading of data from said markup language data file into a database.
3 Assignments
0 Petitions
Accused Products
Abstract
An XML Data Loader which employs the SAX parsing technology, provides an extendible architecture to generate SQL for each type of XML document to be loaded, and leverages multithreading and decoupling of processes to parse an XML file and to update a database. By using object-oriented programming methodologies, the system and method create new instances of the XML data loader for each XML file to be loaded into a database. Thus, many XML data files may be processed and loaded simultaneously, which minimizes system memory requirements, improves system reliability and memory management, and reduces processing time required from the start of processing an XML file to completing its loading into a database. The invention is applicable to other types of markup language documents, as well.
-
Citations
17 Claims
-
1. A method for loading markup language data files and documents into a computer database, said markup language data files containing data items and elements, said database having an application programming interface for modifying contents of the database through database command statements, said method comprising the steps of:
-
parsing a markup language data file one element at a time according to a format description, each of said elements being immediately available for further processing without waiting for subsequent elements to be parsed;
generating one or more database command statements immediately responsive to the availability of said elements such that said command generation and parsing occur substantially simultaneously;
storing each of said generated database command statements in a database command statement queue immediately upon their generation; and
executing said queued database command statements such that a database is updated substantially simultaneously with said step of parsing a data file in order to achieve loading of data from said markup language data file into a database. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer readable medium having program code stored therein for loading markup language data files and documents into a computer database in a computer system, said markup language data files containing data items and elements, said database having an application programming interface for modifying contents of the database through database command statements, said program code when executed by a computer system causing the computer system to perform the steps of:
-
parsing a markup language data file one element at a time according to a format description, each of said elements being immediately available for further processing without waiting for subsequent elements to be parsed;
generating one or more database command statements immediately responsive to the availability of said elements such that said command generation and parsing occur substantially simultaneously;
storing each of said generated database command statements in a database command statement queue immediately upon their generation; and
executing said queued database command statements such that a database is updated substantially simultaneously with said step of parsing a data file in order to achieve loading of data from said markup language data file into a database. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system for loading markup language data files and documents into a computer database in a computer, said markup language data files containing data items and elements, said database having an application programming interface for modifying contents of the database through database command statements, said system comprising:
-
a markup language data file loader which is adapted to parse markup language data files on an element-by-element basis, each of said elements being immediately available for further processing without waiting for subsequent elements to be parsed;
said loader being adapted to select and operate an operator extension;
at least one operator extension operable by said loader, adapted to immediately evaluate each element as it is available from said loader and to generate database application programming interface command statements such that said command generation and parsing occur substantially simultaneously; and
a database updater adapted to execute said generated database application programming interface command statements in order to modify contents of a database substantially simultaneously with said loading and command generation. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
Specification