Methods and systems for quick and efficient data management and/or processing
First Claim
1. A method of data management implemented in and performed by a computing device utilizing a storage device, the method comprising:
- segmenting a data stream using a combination of a first data segmentation procedure and a second data segmentation procedure, wherein an expected average data group size of the first data segmentation procedure and the second data segmentation procedure are different,wherein said first segmentation procedure breaks said data stream into a plurality of data groups,wherein the second data segmentation procedure combines at least two small data groups, from said plurality of data groups, together to make a larger data group having an average size that is larger than average sizes of the small data groups, andwherein the second data segmentation procedure refrains from combining at least two other small data groups together in response to determining that the at least two other small data groups are new and non-duplicate of data previously occurring in the data stream and in response to determining that a data group that is sequentially adjacent to the at least two other small data groups is a duplicate of data previously occurring in the data stream, wherein the at least two other small data groups are consecutive data groups in the plurality of data groups and wherein the second data segmentation procedure combines the at least two small data groups into the larger data group in response to determining that the at least two small data groups are new and non-duplicate of data that has previously occurred in the data stream.
3 Assignments
0 Petitions
Accused Products
Abstract
System(s) and method(s) are provided for data management and data processing. For example, various embodiments may include systems and methods relating to relatively larger groups of data being selected with comparable or better performing selection results (e.g., high data redundancy elimination and/or average chunk size). In various embodiments, the system(s) and method(s) may include, for example a data group, block, or chunk combining technique or/and a data group, block, or chunk splitting technique. Various embodiments may include a first standard or typical data grouping, blocking, or chunking technique and/or data group, block, or chunk combining technique or/and a data group, block, or chunk splitting technique. Exemplary system(s) and method(s) may relate to data hashing and/or data elimination. Embodiments may include a look-ahead buffer and determine whether to emit small chunks or large chunks based on characteristics of underlying data and/or particular application of the invention (e.g., for backup).
-
Citations
11 Claims
-
1. A method of data management implemented in and performed by a computing device utilizing a storage device, the method comprising:
-
segmenting a data stream using a combination of a first data segmentation procedure and a second data segmentation procedure, wherein an expected average data group size of the first data segmentation procedure and the second data segmentation procedure are different, wherein said first segmentation procedure breaks said data stream into a plurality of data groups, wherein the second data segmentation procedure combines at least two small data groups, from said plurality of data groups, together to make a larger data group having an average size that is larger than average sizes of the small data groups, and wherein the second data segmentation procedure refrains from combining at least two other small data groups together in response to determining that the at least two other small data groups are new and non-duplicate of data previously occurring in the data stream and in response to determining that a data group that is sequentially adjacent to the at least two other small data groups is a duplicate of data previously occurring in the data stream, wherein the at least two other small data groups are consecutive data groups in the plurality of data groups and wherein the second data segmentation procedure combines the at least two small data groups into the larger data group in response to determining that the at least two small data groups are new and non-duplicate of data that has previously occurred in the data stream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of data management, implemented in and performed by a computing device utilizing a storage device, the method comprising:
-
segmenting a data stream using a combination of a first data segmentation procedure and a second data segmentation procedure, wherein an expected average data group size of the first data segmentation procedure and the second data segmentation procedure are different, wherein said first segmentation procedure breaks said data stream into a plurality of data groups, wherein the second data segmentation procedure refrains from breaking apart a given data group of said plurality of data groups in response to determining that said given data group is non-duplicate of data previously occurring in the data stream, and wherein the second data segmentation procedure further breaks apart at least one of the plurality of data groups that is sequentially adjacent to said given data group into smaller data groups in response to determining that said at least one of the plurality of data groups is non-duplicate of data previously occurring in the data stream and in response to determining that a data group that is sequentially adjacent to said at least one of the plurality of data groups is a duplicate of data previously occurring in the data stream. - View Dependent Claims (10, 11)
-
Specification