DATA PRUNING BASED ON METADATA
First Claim
Patent Images
1. A method for managing query operations in a computer system having file sets stored in computer accessible storage comprising:
- receiving a query directed to a set of files comprising data, wherein the query comprises a plurality of predicates;
determining whether data in each file match at least one predicate of the plurality of predicates based on file metadata without accessing the set of files;
removing files that do not match at least one predicate from the set of files to create a reduced set of files;
identifying, based on the metadata, one or more predicates of the query that do not fully match any file in the set of files without accessing the set of files;
removing the one or more predicates that do not fully match any file in the set of files from the query to create a modified query;
executing the modified query against the reduced set of files to create a final set of files; and
returning the final set of files in response to the query.
2 Assignments
0 Petitions
Accused Products
Abstract
A system, apparatus, and method for processing queries wherein the query includes a request to access or delete data and accessing metadata associated with the set of data, the metadata defining data characteristics of the set of data and identifying at least sets of data that need or not need to be accessed or deleted based on the metadata without accessing the actual data in the set of data; also methods to optimize processing of some operations based on the collected metadata on data.
1 Citation
58 Claims
-
1. A method for managing query operations in a computer system having file sets stored in computer accessible storage comprising:
-
receiving a query directed to a set of files comprising data, wherein the query comprises a plurality of predicates; determining whether data in each file match at least one predicate of the plurality of predicates based on file metadata without accessing the set of files; removing files that do not match at least one predicate from the set of files to create a reduced set of files; identifying, based on the metadata, one or more predicates of the query that do not fully match any file in the set of files without accessing the set of files; removing the one or more predicates that do not fully match any file in the set of files from the query to create a modified query; executing the modified query against the reduced set of files to create a final set of files; and returning the final set of files in response to the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A networked computer system having processors, memory, and storage for processing and storing distributed sets of digital files that are distributed over a plurality of storages, wherein computer executable instructions for query operations cause the computer hardware perform the instructions, the system comprising:
-
a receiving module configured to receive a query directed to a set of files wherein the query comprises a plurality of predicates; a metadata module configured to access metadata associated with the set of files, determine whether data in each file match at least one predicate of the plurality of predicates based on the file metadata without accessing the set of files, and remove files that do not match at least one predicate from the set of files to create a reduced set of files; a comparison module configured to identify, based on the metadata, one or more predicates of the query that do not fully match any file in the set of files without accessing the set of files; and a modification module configured to remove the one or more predicates that do not fully match any file in the set of files from the query to create a modified query. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
-
38. An apparatus comprising:
-
a means for receiving a query directed to a set of files wherein the query comprises a plurality of predicates for defining data; a means for accessing metadata associated with the set of files, determining whether data in each file match at least one predicate of the plurality of predicates based on the file metadata without accessing the set of files, and removing files that do not match at least one predicate from the set of files to create a reduced set of files; a modification means for identifying, based on the metadata, one or more predicates of the query that do not fully match any file in the set of files without accessing the set of files; the modification means further for removing the one or more predicates that do not fully match any file in the set of files from the query to create a modified query. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58)
-
Specification