Systems and Methods for Providing Performance Metadata in Interest-Driven Business Intelligence Systems
First Claim
1. An interest-driven business intelligence server system, comprising:
- a processor;
a memory connected to the processor and configured to store an interest-driven business intelligence application;
raw data storage configured to store raw data, where a portion of the raw data comprises unstructured data;
metadata storage configured to store data description metadata describing the raw data; and
report storage configured to store previously generated reports comprising previous reporting data and previous reporting data requirements;
wherein the interest-driven business intelligence application configures the processor to;
receive a report specification, where the report specification comprises at least one reporting data requirement;
determine performance metadata information for an interest-driven data pipeline that is utilized to generate reporting data based on the report specification; and
provide the performance metadata information for the interest-driven data pipeline to an interest-driven user visualization system.
2 Assignments
0 Petitions
Accused Products
Abstract
Interest-driven business intelligence server systems that provide performance metadata are described. In the disclosed embodiments, an interest-driven business intelligence server system receives a report specification. The report specification includes at least reporting data requirement. The interest-driven business intelligence server determines performance metadata information for an interest-driven data pipeline that is utilized to generate reporting data based on the report specification. The performance metadata information for the interest-driven data pipeline is transmitted to an interest-driven user visualization system by the interest-driven business intelligence server system.
-
Citations
24 Claims
-
1. An interest-driven business intelligence server system, comprising:
-
a processor; a memory connected to the processor and configured to store an interest-driven business intelligence application; raw data storage configured to store raw data, where a portion of the raw data comprises unstructured data; metadata storage configured to store data description metadata describing the raw data; and report storage configured to store previously generated reports comprising previous reporting data and previous reporting data requirements; wherein the interest-driven business intelligence application configures the processor to; receive a report specification, where the report specification comprises at least one reporting data requirement; determine performance metadata information for an interest-driven data pipeline that is utilized to generate reporting data based on the report specification; and provide the performance metadata information for the interest-driven data pipeline to an interest-driven user visualization system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16)
-
-
15. The system of 14 wherein the interest-driven business intelligence application determines estimates of performance metadata for the reporting data satisfying the reporting data requirements by configuring the processor to:
-
retrieve historical performance metadata for the reporting data satisfying the reporting data requirements; and wherein the historical performance metadata is used to determine the estimate of the performance metadata for the reporting data satisfying the reporting data requirements.
-
-
17. A method performed by an interest-driven business intelligence server system to provide performance metadata for an interest-driven data pipeline comprising:
-
receiving a report specification using an interest-driven business intelligence server system, where the report specification comprises at least one reporting data requirement; determining performance metadata information for an interest-driven data pipeline that is utilized to generate reporting data based on the report specification wherein the performance metadata information includes at least one of estimates of performance metadata for the interest-driven data pipeline and historical performance metadata of the interest driven data pipeline; and transmitting the performance metadata information for the interest-driven data pipeline to an interest-driven user visualization system using the interest-driven business intelligence server system. - View Dependent Claims (18, 19, 20, 21, 22)
-
-
23. A method performed by an interest-driven business intelligence server system to provide metric data for an interest-driven data pipeline comprising:
-
receiving a report specification using an interest-driven business intelligence server system, where the report specification comprises at least one reporting data requirement; determining performance metadata information for an interest-driven data pipeline that is utilized to generate reporting data based on the report specification wherein the performance metadata information includes estimates of performance metadata for the interest-driven data pipeline including performance metadata for raw data wherein the estimates of the performance metadata for the raw data are determined by; generating a mini job that is an ETL processing job that extracts a sufficient amount to provide an accurate sample of the raw data available using the interest-driven business intelligence server system, providing the mini job to a distributed storage system using the interest-driven business intelligence server system, receiving the sample of raw data from the distributed storage system in response to the mini job using the interest-driven business intelligence server system, determining performance metadata of the sample raw data using the interest-driven business intelligence server system wherein the performance metadata for the sample of raw data includes performance metadata for the mini job and the estimates of the performance metadata includes estimates for an ETL processing job for obtaining raw data to satisfy the reporting requirement, determining estimates of performance metadata for the raw data needed to satisfy the reporting requirements based on the determined performance metadata of the sample of raw data using the interest-driven business intelligence server system; and transmitting the performance metadata information for the interest-driven data pipeline to an interest-driven user visualization system using the interest-driven business intelligence server system.
-
-
24. An interest-driven business intelligence server system, comprising:
-
a processor; a memory connected to the processor and configured to store an interest-driven business intelligence application; raw data storage configured to store raw data, where a portion of the raw data comprises unstructured data; metadata storage configured to store data description metadata describing the raw data; and report storage configured to store previously generated reports comprising previous reporting data and previous reporting data requirements; wherein the interest-driven business intelligence application configures the processor to; receive a report specification, where the report specification comprises at least one reporting data requirement; determine performance metadata information for an interest-driven data pipeline that is utilized to generate reporting data based on the report specification wherein the interest-driven data pipeline includes transformation processes that transform a set of raw data into a set of reporting data and intermediate sets of data generated by the transformation processes and wherein the performance metadata information for interest-driven data pipeline includes performance metadata for raw data, reporting data, and source data of the interest-driven data pipeline and wherein the performance metadata for the raw data by configuring the processor to; generate a mini job that is an ETL processing job that extracts a sufficient amount to data provide an accurate sample of the raw data available, provide the mini job to an distributed storage system, receive the sample of raw data from the distributed storage system in response to the mini job, determine performance metadata of the sample raw data; and determine estimates of performance metadata for the raw data needed to satisfy the reporting requirements based on the determined performance metadata of the sample of raw data, wherein the performance metadata for the sample of raw data includes performance metadata for the mini job and the estimates of the performance metadata includes estimates for an ETL processing job for obtaining raw data to satisfy the reporting requirements, wherein the interest-driven business intelligence application determines estimates of performance metadata for the raw data satisfying the reporting data requirements by configuring the processor to retrieve historical performance metadata for the raw data satisfying the reporting data requirements, and wherein the historical information is used to determine the estimate of the performance metadata for the raw data satisfying the reporting data requirements; and provide the performance metadata information for the interest-driven data pipeline to an interest-driven user visualization system.
-
Specification