System and method for generating random linked data according to an RDF dataset profile
First Claim
Patent Images
1. A computer program product residing on a non-transitory computer readable storage medium having a plurality of instructions stored thereon which, when executed by a processor, cause the processor to perform operations comprising:
- gathering statistics for a plurality of characteristics of a set of resources associated with a framework, wherein gathering statistics includes reading the set of resources in an N-Quads format, wherein the N-Quads format includes a graph URI, wherein the statistics include unique rdf;
type objects in the set of resources and further include predicates that appear in resources for the unique rdf;
type objects;
generating a profile based upon, at least in part, the gathered statistics, wherein the profile generated includes, at least in part, a minimum, maximum, average, and standard deviation of at least one of a string value and a numerical value for each predicate;
selecting a data set for generation of a new resource associated with the framework with one or more of the plurality of characteristics of the set of resources based upon, at least in part, the graph URI, wherein the data set includes an existing dataset for augmentation;
generating the new resource associated with the framework with the one or more of the plurality of characteristics of the set of resources using the profile generated based upon the gathered statistics, including selecting a graph type to create based on a distribution included within the profile, wherein generating the new resource associated with the framework includes generating new, random data with the one or more of the plurality of characteristics of the set of resources using the profile, wherein, upon selecting the existing dataset for augmentation, generating the new resource associated with the framework includes randomly selecting a resource from the existing dataset for augmentation to which to link, and producing a single fully linked dataset; and
testing an application with the new resource associated with the framework.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, computer program product, and computer system for gathering statistics, by a computing device, for a set of resources associated with a framework. A profile is generated based upon, at least in part, the gathered statistics. A data set is selected for generation of a new resource. The new resource is generated using the profile generated based upon the gathered statistics.
-
Citations
10 Claims
-
1. A computer program product residing on a non-transitory computer readable storage medium having a plurality of instructions stored thereon which, when executed by a processor, cause the processor to perform operations comprising:
-
gathering statistics for a plurality of characteristics of a set of resources associated with a framework, wherein gathering statistics includes reading the set of resources in an N-Quads format, wherein the N-Quads format includes a graph URI, wherein the statistics include unique rdf;
type objects in the set of resources and further include predicates that appear in resources for the unique rdf;
type objects;generating a profile based upon, at least in part, the gathered statistics, wherein the profile generated includes, at least in part, a minimum, maximum, average, and standard deviation of at least one of a string value and a numerical value for each predicate; selecting a data set for generation of a new resource associated with the framework with one or more of the plurality of characteristics of the set of resources based upon, at least in part, the graph URI, wherein the data set includes an existing dataset for augmentation; generating the new resource associated with the framework with the one or more of the plurality of characteristics of the set of resources using the profile generated based upon the gathered statistics, including selecting a graph type to create based on a distribution included within the profile, wherein generating the new resource associated with the framework includes generating new, random data with the one or more of the plurality of characteristics of the set of resources using the profile, wherein, upon selecting the existing dataset for augmentation, generating the new resource associated with the framework includes randomly selecting a resource from the existing dataset for augmentation to which to link, and producing a single fully linked dataset; and testing an application with the new resource associated with the framework. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computing system including a processor and a memory configured to perform operations comprising:
-
gathering statistics for a plurality of characteristics of a set of resources associated with a framework, wherein gathering statistics includes reading the set of resources in an N-Quads format, wherein the N-Quads format includes a graph URI, wherein the statistics include unique rdf;
type objects in the set of resources and further include predicates that appear in resources for the unique rdf;
type objects;generating a profile based upon, at least in part, the gathered statistics, wherein the profile generated includes, at least in part, a minimum, maximum, average, and standard deviation of at least one of a string value and a numerical value for each predicate; selecting a data set for generation of a new resource with one or more of the plurality of characteristics of the set of resources based upon, at least in part, the graph URI, wherein the data set includes an existing dataset for augmentation; generating the new resource associated with the framework with the one or more of the plurality of characteristics of the set of resources using the profile generated based upon the gathered statistics, including selecting a graph type to create based on a distribution included within the profile, wherein generating the new resource associated with the framework includes generating new, random data with the one or more of the plurality of characteristics of the set of resources using the profile, wherein, upon selecting the existing dataset for augmentation, generating the new resource associated with the framework includes randomly selecting a resource from the existing dataset for augmentation to which to link, and producing a single fully linked dataset; and testing an application with the new resource associated with the framework. - View Dependent Claims (8, 9, 10)
-
Specification