Processing structured documents stored in a database
First Claim
Patent Images
1. A method for processing a structured document, the method comprising:
- identifying a plurality of strings processed by a database;
assigning an identifier to each of the plurality of strings;
storing each of the plurality of strings and its assigned identifier in a table in the database, wherein at least one string of the plurality of strings in the table in the database is derived from a source other than the structured documents stored in the database, and wherein at least another string of the plurality of strings in the table in the database is derived from the one or more structured documents stored in the database; and
utilizing the table during document processing at least by;
receiving a request to store a structured document in the database, wherein the structured document comprises a plurality of element, attribute, and namespace strings;
transforming the structured document by replacing the plurality of element, attribute, and namespace strings with their respective identifiers at least by;
requesting an identifier for one of the plurality of element, attribute, and namespace strings in the structured document;
performing a lookup on the table for the one string;
returning the identifier assigned to the one string when the one string is located in the table; and
assigning a new identifier to the one string, storing the one string and the new identifier in the table, designating a status of the one string as permanent, and returning the new identifier when the one string is not located in the table; and
storing the transformed structured document in the database.
0 Assignments
0 Petitions
Accused Products
Abstract
A method, computer readable medium, and system for processing a structured document are disclosed. The method, computer readable medium, and system include identifying a plurality of strings processed by a database, assigning an identifier to each of the plurality of strings, storing each of the plurality of strings and its assigned identifier in a table in the database, and utilizing the table during document processing.
-
Citations
22 Claims
-
1. A method for processing a structured document, the method comprising:
-
identifying a plurality of strings processed by a database; assigning an identifier to each of the plurality of strings; storing each of the plurality of strings and its assigned identifier in a table in the database, wherein at least one string of the plurality of strings in the table in the database is derived from a source other than the structured documents stored in the database, and wherein at least another string of the plurality of strings in the table in the database is derived from the one or more structured documents stored in the database; and utilizing the table during document processing at least by; receiving a request to store a structured document in the database, wherein the structured document comprises a plurality of element, attribute, and namespace strings; transforming the structured document by replacing the plurality of element, attribute, and namespace strings with their respective identifiers at least by; requesting an identifier for one of the plurality of element, attribute, and namespace strings in the structured document; performing a lookup on the table for the one string; returning the identifier assigned to the one string when the one string is located in the table; and assigning a new identifier to the one string, storing the one string and the new identifier in the table, designating a status of the one string as permanent, and returning the new identifier when the one string is not located in the table; and storing the transformed structured document in the database. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A non-transitory computer readable medium containing a computer program for processing a structured document stored thereon, the computer program comprising instructions for:
-
identifying a plurality of strings processed by a database; assigning an identifier to each of the plurality of strings; storing each of the plurality of strings and its assigned identifier in a table in the database, wherein at least one string of the plurality of strings in the table in the database is derived from a source other than the structured documents stored in the database, and wherein at least another string of the plurality of strings in the table in the database is derived from the one or more structured documents stored in the database; and utilizing the table during document processing at least by; receiving a request to store a structured document in the database, wherein the structured document comprises a plurality of element, attribute, and namespace strings; transforming the structured document by replacing the plurality of element, attribute, and namespace strings with their respective identifiers at least by; requesting an identifier for one of the plurality of element, attribute, and namespace strings in the structured document; performing a lookup on the table for the one string; returning the identifier assigned to the one string when the one string is located in the table; and assigning a new identifier to the one string, storing the one string and the new identifier in the table, designating a status of the one string as permanent, and returning the new identifier when the one string is not located in the table; and storing the transformed structured document in the database. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A system for processing a structured document, the system comprising:
-
a string identifier generator operable to; identify a plurality of strings processed by the database, and assign an identifier to each of the plurality of strings; a database in communication with the string identifier generator, the database including a table operable to store each of the plurality of strings and its assigned identifier, wherein at least one string of the plurality of strings in the table in the database is derived from a source other than the structured documents stored in the database, and wherein at least another string of the plurality of strings in the table in the database is derived from the one or more structured documents stored in the database; and a cache in communication with the string identifier generator and the database, the cache being operable to store a portion of the table, wherein the table and the cache are utilized during document processing at least by; receiving a request to store a structured document in the database, wherein the structured document comprises a plurality of element, attribute, and namespace strings; the transforming the structured document by replacing the plurality of element, attribute, and namespace strings with their respective identifiers at least by; requesting an identifier for one of the plurality of element, attribute, and namespace strings in the structured document; performing a lookup on the table for the one string; returning the identifier assigned to the one string when the one string is located in the table; and assigning a new identifier to the one string, storing the one string and the new identifier in the table, designating a status of the one string as permanent, and returning the new identifier when the one string is not located in the table; and storing the transformed structured document in the database. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22)
-
Specification