Integrating diverse data sources using a mark-up language
First Claim
Patent Images
1. A method for processing source data from a plurality of diverse sources in a selected data domain, comprising:
- specifying a unified schema that is defined specifically for the selected data domain, from among a multiplicity of schemata that are specific to different data domains selected from a group of domains consisting of computer system performance evaluation, customer relationship management, healthcare, and telecommunications, the specified schema listing markup tags in the selected data domain that can exist in a document in the markup language;
defining correspondences of data fields from the sources to the markup tags listed by the selected schema; and
mapping the source data in accordance with the correspondences to generate unified data in the markup language,wherein at least some of the source data are represented in a language other than the markup language, and wherein mapping the source data comprises transforming the data to the markup language.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for processing source data from a plurality of diverse sources in a selected data domain includes specifying a unified schema that lists markup tags in the selected data domain that can exist in a document in the markup language and defining correspondences of data fields from the sources to the markup tags listed by the schema. The source data are mapped in accordance with the correspondences to generate unified data in the markup language.
-
Citations
29 Claims
-
1. A method for processing source data from a plurality of diverse sources in a selected data domain, comprising:
-
specifying a unified schema that is defined specifically for the selected data domain, from among a multiplicity of schemata that are specific to different data domains selected from a group of domains consisting of computer system performance evaluation, customer relationship management, healthcare, and telecommunications, the specified schema listing markup tags in the selected data domain that can exist in a document in the markup language; defining correspondences of data fields from the sources to the markup tags listed by the selected schema; and mapping the source data in accordance with the correspondences to generate unified data in the markup language, wherein at least some of the source data are represented in a language other than the markup language, and wherein mapping the source data comprises transforming the data to the markup language. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for processing source data from a plurality of diverse sources in a selected data domain, comprising:
-
specifying a unified schema that is defined specifically for the selected data domain, from among a multiplicity of schemata that are specific to different data domains selected from a group of domains consisting of computer system performance evaluation, customer relationship management, healthcare, and telecommunications, the specified schema listing markup tags in the selected data domain that can exist in a document in the markup language; defining correspondences of data fields from the sources to the markup tags listed by the selected schema; mapping the source data in accordance with the correspondences to generate unified data in the markup language; and querying the sources by addressing a query to the unified data in the markup language, wherein mapping the source data comprises mapping the source data responsive to the query.
-
-
11. Apparatus for processing source data from a plurality of diverse sources in a selected data domain, comprising a data integration processor, which is adapted to receive and store a unified schema that is defined specifically for the selected data domain, from among a multiplicity of schemata that are specific to different data domains selected from a group of domains consisting of computer system performance evaluation, customer relationship management, healthcare, and telecommunications, the schema listing markup tags in the selected data domain that can exist in a document in the markup language, and which is further adapted to receive and store definitions of correspondences of data fields from the sources to the markup tags listed by the schema, and to map the source data in accordance with the correspondences to generate unified data in the markup language,
wherein at least some of the source data are represented in a language other than the markup language, and wherein the processor is adapted to transform the data to the markup language.
-
19. Apparatus for processing source data from a plurality of diverse sources in a selected data domain, comprising a data integration processor, which is adapted to receive and store a unified schema that is defined specifically for the selected data domain, from among a multiplicity of schemata that are specific to different data domains selected from a group of domains consisting of computer system performance evaluation, customer relationship management, healthcare, and telecommunications, the schema listing markup tags in the selected data domain that can exist in a document in the markup language, and which is further adapted to receive and store definitions of correspondences of data fields from the sources to the markup tags listed by the schema, and to map the source data in accordance with the correspondences to generate unified data in the markup language,
wherein the processor is adapted to receive and respond to a query addressed to the unified data in the markup language, and wherein the processor is adapted to map the source data responsive to the query.
-
20. A computer software product for processing source data from a plurality of diverse sources in a selected data domain, the product comprising a computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive a unified schema that is defined specifically for the selected data domain, from among a multiplicity of schemata that are specific to different data domains selected from a group of domains consisting of computer system performance evaluation, customer relationship management, healthcare, and telecommunications, the schema listing markup tags in the selected data domain that can exist in a document in the markup language, the instructions further causing the computer to receive definitions of correspondences of data fields from the sources to the markup tags listed by the schema, and to map the source data in accordance with the correspondences to generate unified data in the markup language,
wherein the instructions further cause the computer to accept and respond to a query addressed to the unified data in the markup language, wherein the product comprises middleware, which causes the computer to map the source data responsive to the query.
-
26. Apparatus for processing source data from a plurality of diverse sources in diverse formats in a selected data domain, comprising a data integration processor, which is adapted to receive and store a unified schema that lists markup tags in the selected data domain that can exist in a document in the markup language, and to receive and store definitions of correspondences of data fields from the sources to the markup tags listed by the schema, and
which is further adapted to receive a query with respect to the sources in the markup language, and upon receiving the query, to map the source data from at least one of the diverse formats to the unified schema responsively to the query in accordance with the correspondences so as to generate a reply to the query, so that the reply comprises unified data in the markup language.
-
27. A computer software product for processing source data from a plurality of diverse sources in diverse formats in a selected data domain, the product comprising a computer-readable medium in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive a unified schema that lists markup tags in the selected data domain that can exist in a document in the markup language and to receive definitions of correspondences of data fields from the sources to the markup tags listed by the schema,
the instructions further causing the computer to receive a query with respect to the sources in the markup language, and upon receiving the query, to map the source data from at least one of the diverse formats to the unified schema responsively to the query in accordance with the correspondences so as to generate a reply to the query, so that the reply comprises unified data in the markup language.
Specification