System and method facilitating unified framework for structured/unstructured data
First Claim
1. At a computer system including a processor and system memory, a computer-implemented method for accessing data, comprising:
- parsing a hierarchical source document into a data document that comprises a hierarchical representation of the source document;
loading, by the processor, information associated with the parsed hierarchical source document into a hierarchical, unstructured data document such that the data document comprises an unstructured hierarchical representation of the information associated with the hierarchical source document;
parsing the same hierarchical source document into a data set that comprises a relational representation of the source document;
loading information associated with the parsed hierarchical source document into the relational, structured data set such that the data set comprises a structured relational representation of at least some of the information associated with the hierarchical source document, wherein the structured relational representation is defined by a schema that is inferred based on information associated with the hierarchical source document; and
accessing the information in the hierarchical source document in at least one of the following manners using a software service;
as structured data via the structured data set or as unstructured, hierarchical data via the unstructured data document, wherein any changes to the data document are only reflected in the information stored in the data set if the changed data is associated with the relational data stored in the data set, such that integrity of the source document is maintained in the data stored in the data document while the data set maintains at least a subset of the information associated with the source document.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method facilitating a unified framework for accessing structured and unstructured data is provided. The invention includes a source document having data that is parsed into a data document component providing a hierarchical representation of data associated with the source document and a data set component providing a relational representation of at least a portion of the data associated with the source document. The invention further provides for a schema defining a structure of the relational representation to be associated with the source document and/or inferred by the data set component. Data stored in the data document component and the data set component are synchronized, thus a change made to data stored in the data set component is reflected in data stored in the data document component. Further, a change made to data stored in the data document component is reflected in data stored in the data set component if utilized according to the schema. The invention further provides for a service to access the hierarchical representation of data associated with the source document and/or a designer to access the relational representation of data associated with the source document.
40 Citations
17 Claims
-
1. At a computer system including a processor and system memory, a computer-implemented method for accessing data, comprising:
-
parsing a hierarchical source document into a data document that comprises a hierarchical representation of the source document; loading, by the processor, information associated with the parsed hierarchical source document into a hierarchical, unstructured data document such that the data document comprises an unstructured hierarchical representation of the information associated with the hierarchical source document; parsing the same hierarchical source document into a data set that comprises a relational representation of the source document; loading information associated with the parsed hierarchical source document into the relational, structured data set such that the data set comprises a structured relational representation of at least some of the information associated with the hierarchical source document, wherein the structured relational representation is defined by a schema that is inferred based on information associated with the hierarchical source document; and accessing the information in the hierarchical source document in at least one of the following manners using a software service;
as structured data via the structured data set or as unstructured, hierarchical data via the unstructured data document, wherein any changes to the data document are only reflected in the information stored in the data set if the changed data is associated with the relational data stored in the data set, such that integrity of the source document is maintained in the data stored in the data document while the data set maintains at least a subset of the information associated with the source document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system facilitating access to data, comprising:
-
a computer processor; a system memory; a first storing module for storing a hierarchical source document as a hierarchical, unstructured data document such that the data document comprises an unstructured hierarchical representation of data associated with the hierarchical source document; a second storing module for storing at least a portion of the same hierarchical source document as a relational, parsed, structured data set such that the data set comprises a structured relational representation of data, wherein the structured relational representation is defined by a schema that is inferred based on information associated with the hierarchical source document; an accessing module comprising a software service for accessing at least one of the hierarchical, unstructured representation of data and the relational, structured representation of data in at least one of the following manners;
as structured data via the structured data set or as unstructured, hierarchical data via the unstructured data document; anda synchronizing module for synchronizing a change to the relational, structured representation of data with the hierarchical, unstructured representation of data, wherein any changes to the data document are only reflected in the information stored in the data set if the changed data is associated with the relational data stored in the data set, such that integrity of the source document is maintained in the data stored in the data document while the data set maintains at least a subset of the information associated with the source document. - View Dependent Claims (10, 11)
-
-
12. At a computer system including a processor and system memory, a computer-implemented method for accessing data, comprising:
-
parsing a hierarchical source document into a data document that comprises a hierarchical representation of the source document; loading, by the processor, information associated with the parsed hierarchical source document into a hierarchical, unstructured data document such that the data document comprises an unstructured hierarchical representation of the information associated with the hierarchical source document; parsing the same hierarchical source document into a data set that comprises a relational representation of the source document; loading information associated with the parsed hierarchical source document into the relational, structured data set such that the data set comprises a structured relational representation of at least some of the information associated with the hierarchical source document, wherein the structured relational representation is defined by a schema that is inferred based on information associated with the hierarchical source document; associating a region of the hierarchical, unstructured representation of data with a row of the relational, structured data set; correlating a subset of the region with a state, the state being one of;
(1) the subset being mapped to an element of the data set;
(2) the subset being mapped to the row of the data set; and
(3) the subset being unmapped to the data set, a topmost node of the associated region is an element with a local name and namespace as at least one corresponding property of an associated data table in a data set component; andaccessing the associated region of a hierarchical representation of data in at least one of the following manners using a software service;
as structured data via the structured data set or as unstructured, hierarchical data via the unstructured data document, wherein any changes to the data document are only reflected in the information stored in the data set if the changed data is associated with the relational data stored in the data set, such that integrity of the source document is maintained in the data stored in the data document while the data set maintains at least a subset of the information associated with the source document. - View Dependent Claims (13, 14, 15, 16, 17)
-
Specification