Data relationships storage platform
First Claim
Patent Images
1. A data relationships storage platform, comprising:
- a data processing system communicatively coupled to one or more data sources and one or more big-data databases, wherein the data processing system is programmed to;
collect data pieces from the one or more data sources;
analyze the collected data pieces to determine whether one or more relationships exist between the collected data pieces;
determine correlation information corresponding to the data pieces, wherein the correlation information comprises information indicating how two or more data pieces relate to each other;
create one or more data globs that include one or more of the data pieces and relationship information;
create reusable data globs that include the data pieces and relationship information;
communicate one or more data globs to the one or more big-data databases so that the big-data databases store the data globs, anddeduplicate collected data pieces according to a level of similarity the collected data pieces have with stored information associated with one or more data globs,wherein the data processing system includes a storage medium having stored therein;
one or more collector modules that perform the collecting, where at least one collector module is designated to collect data pieces from each data source;
one or more analyzer modules that perform the analyzing and the creation of the data globs, where the analyzer modules use an intensity algorithm to determine the degree of correlation between data pieces; and
one or more data services that manage the communication of the data globs to the big-data databases.
1 Assignment
0 Petitions
Accused Products
Abstract
A data relationships storage platform for analysis of one or more data sources is described herein. A data processing system may be communicatively coupled to one or more data sources and one or more big-data databases. One or more collectors may collect data pieces from the one or more data sources. One or more analyzer may analyze the collected data pieces to determine whether one or more relationships exist between the collected data pieces. The analysis results in one or more data globs that include one or more of the data pieces and relationship information, such as tags. The tagged data globs may be communicated to and stored in one or more big-data databases.
53 Citations
22 Claims
-
1. A data relationships storage platform, comprising:
a data processing system communicatively coupled to one or more data sources and one or more big-data databases, wherein the data processing system is programmed to; collect data pieces from the one or more data sources; analyze the collected data pieces to determine whether one or more relationships exist between the collected data pieces; determine correlation information corresponding to the data pieces, wherein the correlation information comprises information indicating how two or more data pieces relate to each other; create one or more data globs that include one or more of the data pieces and relationship information; create reusable data globs that include the data pieces and relationship information; communicate one or more data globs to the one or more big-data databases so that the big-data databases store the data globs, and deduplicate collected data pieces according to a level of similarity the collected data pieces have with stored information associated with one or more data globs, wherein the data processing system includes a storage medium having stored therein; one or more collector modules that perform the collecting, where at least one collector module is designated to collect data pieces from each data source; one or more analyzer modules that perform the analyzing and the creation of the data globs, where the analyzer modules use an intensity algorithm to determine the degree of correlation between data pieces; and one or more data services that manage the communication of the data globs to the big-data databases. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
10. A data relationships storage platform, comprising:
-
a big-data database communicatively coupled to a data processing system that is communicatively coupled to one or more data sources, wherein the data processing system communicates one or more data globs to the big-data database so that the big-data database stores the data globs, wherein the data globs are generated by the data processing system by; collecting data pieces from the one or more data sources, analyzing the collected data pieces to determine whether one or more relationships exist between the collected data pieces; determining correlation information corresponding to the data pieces, wherein the correlation information comprises information indicating how two or more data pieces relate to each other; creating the one or more data globs out of the one or more of the data pieces and relationship information; creating reusable data globs that include the data pieces and relationship information; communicating one or more data globs to the one or more big-data databases so that the big-data databases store the data globs; and deduplicating collected data pieces according to a level of similarity the collected data pieces have with stored information associated with data globs in the big-data database, wherein the data processing system includes a storage medium having stored therein; one or more collector modules that perform the collecting, where at least one collector module is designated to collect data pieces from each data source; one or more analyzer modules that perform the analyzing and the creation of the data globs, where the analyzer modules use an intensity algorithm to determine the degree of correlation between data pieces; and one or more data services that manage the communication of the data globs to the big-data databases. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification