Incrementally retrieving data for objects to provide a desired level of detail
First Claim
Patent Images
1. A system for retrieving data and metadata comprising:
- a memory;
a computing device comprising;
a user interface that;
allows a user to select an information set that includes a limited amount of information from a repository index that includes information about an indexed repository including one or more data server systems, wherein the information set includes references to member objects in the repository index;
allows the user to select an action to apply to the information set wherein the selected action requires extra information for the information set beyond the information within the repository index;
allows the user to define one or more data harvesting criteria for a data expansion operation to be performed for the information set on the indexed repository wherein the data expansion operation provides extra information for performance of the selected action; and
a processor that;
determines data server systems of the indexed repository to participate in the data expansion operation;
causes each participating data server system to execute the data expansion operation against data and metadata on that particular data server system according to the data harvesting criteria while allowing the data and metadata to be left intact on that particular data server system;
utilizes natural language processing and artificial intelligence to classify the data and metadata on each particular data server system and to refine the data harvesting criteria;
converts a characterization included in the data harvesting criteria to adaptors with filters to retrieve the data and metadata on each particular data server system according to a user request;
transforms the repository index for the indexed repository to include additional information from the data expansion operation, wherein the transforming adds the additional information to the information set to incrementally update the limited amount of information included in the information set until a desired level of detail is attained, and wherein the transforming ensures that subsequently generated information sets created based on the data and metadata subject to the data expansion operation include the additional information;
the additional information comprises a plurality of subsets, and wherein transforming the repository index comprises;
training a classifier to learn one or more classifications based on machine learning techniques;
classifying each of the subsets into one or more learned classifications during the data expansion operation via the trained classifier;
determining inclusion of each subset in the desired level of detail according to the classification of the subset by the trained classifier; and
applies the selected action to the information set based on the additional information from the data expansion operation.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method is provided, for retrieving data and metadata according to a data harvesting criteria into an indexed repository, comprising providing a user interface allowing a user to define data harvesting criteria for adjusting a subset of data and metadata on an indexed repository. Responsive to a user utilizing the user interface by defining the data harvesting criteria, the subset of data and metadata on the indexed repository is adjusted according to the data harvesting criteria.
79 Citations
9 Claims
-
1. A system for retrieving data and metadata comprising:
-
a memory; a computing device comprising; a user interface that; allows a user to select an information set that includes a limited amount of information from a repository index that includes information about an indexed repository including one or more data server systems, wherein the information set includes references to member objects in the repository index; allows the user to select an action to apply to the information set wherein the selected action requires extra information for the information set beyond the information within the repository index; allows the user to define one or more data harvesting criteria for a data expansion operation to be performed for the information set on the indexed repository wherein the data expansion operation provides extra information for performance of the selected action; and a processor that; determines data server systems of the indexed repository to participate in the data expansion operation; causes each participating data server system to execute the data expansion operation against data and metadata on that particular data server system according to the data harvesting criteria while allowing the data and metadata to be left intact on that particular data server system; utilizes natural language processing and artificial intelligence to classify the data and metadata on each particular data server system and to refine the data harvesting criteria; converts a characterization included in the data harvesting criteria to adaptors with filters to retrieve the data and metadata on each particular data server system according to a user request; transforms the repository index for the indexed repository to include additional information from the data expansion operation, wherein the transforming adds the additional information to the information set to incrementally update the limited amount of information included in the information set until a desired level of detail is attained, and wherein the transforming ensures that subsequently generated information sets created based on the data and metadata subject to the data expansion operation include the additional information; the additional information comprises a plurality of subsets, and wherein transforming the repository index comprises; training a classifier to learn one or more classifications based on machine learning techniques; classifying each of the subsets into one or more learned classifications during the data expansion operation via the trained classifier; determining inclusion of each subset in the desired level of detail according to the classification of the subset by the trained classifier; and applies the selected action to the information set based on the additional information from the data expansion operation. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product for providing information to a user, comprising a non-transitory computer readable storage medium having computer readable program code embodied therewith, the computer readable program code, when executed by a processor, causes the processor to:
-
generate an information set that includes a limited amount of information based on a repository index, wherein the repository index includes information about an indexed repository that includes one or more data server systems, and wherein the information set includes references to member objects in the repository index; receive user input via a user interface, the user interface allowing a user to; select an action to apply to the information set, wherein the selected action requires extra information for the information set beyond the information within the repository index; and define one or more data harvesting criteria for a data expansion operation to be performed for the information set on the indexed repository, wherein the data expansion provides the extra information for performance of the selected action; determine data server systems of the indexed repository to participate in the data expansion operation; cause each of the participating data server systems to execute the data expansion operation against data and metadata on that particular data server system according to the data harvesting criteria, while allowing the data and metadata to be left intact on that particular data server system; utilize natural language processing and artificial intelligence to classify the data and metadata on each particular data server system and to refine the data harvesting criteria; convert a characterization included in the data harvesting criteria to adaptors with filters to retrieve the data and metadata on each particular data server system according to a user request; transform the repository index for the indexed repository to include additional information from the data expansion operation, wherein the transforming adds the additional information to the information set to incrementally update the limited amount of information included in the information set until a desired level of detail is attained, and wherein the transforming ensures that subsequently generated information sets created based on the data and metadata subject to the data expansion operation include the additional information; the additional information comprises a plurality of subsets, and wherein transforming the repository index comprises; training a classifier to learn one or more classifications based on machine learning techniques; classifying each of the subsets into one or more learned classifications during the data expansion operation via the trained classifier; determining inclusion of each subset in the desired level of detail according to the classification of the subset by the trained classifier; and apply the selected action to the information set based on the additional information from the data expansion operation. - View Dependent Claims (7, 8, 9)
-
Specification