Adaptive monitoring and processing of new data files and changes to existing data files
First Claim
1. A method, comprising:
- observing a plurality of files in at least a portion of a filesystem for newly added files and changes to existing files;
upon detecting a new file in the at least the portion of the filesystem;
creating a record for the new file;
processing data in the new file; and
setting a time period for the new file that indicates when the observing step is to evaluate whether changes have been made to the new file;
upon detecting a change to an existing file in the at least the portion of the filesystem;
determining if a current size of the existing file is different than a previously saved size for the existing file;
based on determining that the current size of the existing file is different than the previously saved size;
identifying new data in the existing file by comparing a previous end of item identifier associated with the existing file with a current end of item identifier of the existing file, the current end of item identifier is calculated using a number of bytes from the end of the existing file; and
processing the new data in the existing file; and
setting a time period for the existing file that indicates when the observing step is to reevaluate whether changes have been made to the existing file.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments are directed towards a dynamic change evaluation mechanism, whereby items having a detected possible change are scheduled for re-evaluation for possible changes at a higher frequency than items detected to not have previously changed, while those items detected as not to have changed are dynamically scheduled for re-evaluation based on an evaluation backlog that may be in turn based, in part, on a time from when an item is assigned an expiration time to when the item is evaluated. In one embodiment, a possibly changed item may be assigned a new expiration time independent of the evaluation backlog. In another embodiment, if no change is detected, then the item may be assigned a new expiration time as a function of a previous expiration time and on the evaluation backlog.
84 Citations
21 Claims
-
1. A method, comprising:
-
observing a plurality of files in at least a portion of a filesystem for newly added files and changes to existing files; upon detecting a new file in the at least the portion of the filesystem; creating a record for the new file; processing data in the new file; and setting a time period for the new file that indicates when the observing step is to evaluate whether changes have been made to the new file; upon detecting a change to an existing file in the at least the portion of the filesystem; determining if a current size of the existing file is different than a previously saved size for the existing file; based on determining that the current size of the existing file is different than the previously saved size; identifying new data in the existing file by comparing a previous end of item identifier associated with the existing file with a current end of item identifier of the existing file, the current end of item identifier is calculated using a number of bytes from the end of the existing file; and processing the new data in the existing file; and setting a time period for the existing file that indicates when the observing step is to reevaluate whether changes have been made to the existing file. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus, comprising:
-
a file observation subsystem, implemented at least partially in hardware, that observes a plurality of files in at least a portion of a filesystem for newly added files and changes to existing files; wherein the file observation subsystem, upon detecting a new file in the at least the portion of the filesystem; creates a record for the new file; processes data in the new file; and sets a time period for the new file that indicates when the file observation subsystem is to evaluate whether changes have been made to the new file; a file processor subsystem, implemented at least partially in hardware, that, upon the file observation subsystem detecting a change to an existing file in the at least the portion of the filesystem; determines if a current size of the existing file is different than a previously saved size for the existing file; based on a determination that the current size of the existing file is different than the previously saved size, identifies new data in the existing file by comparing a previous end of item identifier associated with the existing file with a current end of item identifier of the existing file, and processes the new data in the existing file, the current end of item identifier is calculated using a number of bytes from the end of the existing file; and sets a time period for the existing file that indicates when the file observation subsystem is to reevaluate whether changes have been made to the existing file. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer readable medium, storing software instructions, which when executed by one or more processors cause performance of:
-
observing a plurality of files in at least a portion of a filesystem for newly added files and changes to existing files; upon detecting a new file in the at least the portion of the filesystem; creating a record for the new file; processing data in the new file; and setting a time period for the new file that indicates when the observing step is to evaluate whether changes have been made to the new file; upon detecting a change to an existing file in the at least the portion of the filesystem; determining if a current size of the existing file is different than a previously saved size for the existing file; based on determining that the current size of the existing file is different than the previously saved size; identifying new data in the existing file by comparing a previous end of item identifier associated with the existing file with a current end of item identifier of the existing file, the current end of item identifier is calculated using a number of bytes from the end of the existing file; and processing the new data in the existing file; and setting a time period for the existing file that indicates when the observing step is to reevaluate whether changes have been made to the existing file. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification