CONTENT BASED FILE CHUNKING
First Claim
1. A method performed by data processing apparatus, the method comprising:
- identifying a data item to be chunked;
determining the type of the data item;
determining whether the type of the data item is one of a specified one or more types;
if it is determined that the type of the data item is not one of the specified one or more types, performing a first chunking of the data item; and
if it is determined that the type of the data item is one of the specified one or more types, performing a second chunking of the data item that is based on the particular content portions of the data item.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for transferring electronic data. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of identifying a data item to be chunked; determining the type of the data item; determining whether the type of the data item is one of a specified one or more types; if it is determined that the type of the data item is not one of the specified one or more types, performing a first chunking of the data item; and if it is determined that the type of the data item is one of the specified one or more types, performing a second chunking of the data item that is based on the particular content portions of the data item.
34 Citations
26 Claims
-
1. A method performed by data processing apparatus, the method comprising:
-
identifying a data item to be chunked; determining the type of the data item; determining whether the type of the data item is one of a specified one or more types; if it is determined that the type of the data item is not one of the specified one or more types, performing a first chunking of the data item; and if it is determined that the type of the data item is one of the specified one or more types, performing a second chunking of the data item that is based on the particular content portions of the data item. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 11, 12)
-
-
10. A method performed by data processing apparatus, the method comprising:
-
receiving a data item to be chunked; identifying a type associated with the data item; using the identified type to introspect the data of the data item and build a content based map of the data item; using the content based map to identify a separate chunking to be performed for different content portions in the data item; and chunking the data item based on the content in the data item.
-
-
13. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
identifying a data item to be chunked; determining the type of the data item; determining whether the type of the data item is one of a specified one or more types; if it is determined that the type of the data item is not one of the specified one or more types, performing a first chunking of the data item; and if it is determined that the type of the data item is one of the specified one or more types, performing a second chunking of the data item that is based on the particular content portions of the data item.
-
-
14. A computer storage medium encoded with a computer program, the program comprising instructions that when executed by data processing apparatus cause the data processing apparatus to perform operations comprising:
-
receiving a data item to be chunked; identifying a type associated with the data item; using the identified type to introspect the data of the data item and build a content based map of the data item; using the content based map to identify a separate chunking to be performed for different content portions in the data item; and chunking the data item based on the content in the data item.
-
-
15. A system comprising:
-
one or more computing devices operable to perform operations comprising; identifying a data item to be chunked; determining the type of the data item; determining whether the type of the data item is one of a specified one or more types; if it is determined that the type of the data item is not one of the specified one or more types, performing a first chunking of the data item; and if it is determined that the type of the data item is one of the specified one or more types, performing a second chunking of the data item that is based on the particular content portions of the data item. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A system comprising:
one or more computing devices operable to perform operations comprising; receiving a data item to be chunked; identifying a type associated with the data item; using the identified type to introspect the data of the data item and build a content based map of the data item; using the content based map to identify a separate chunking to be performed for different content portions in the data item; and chunking the data item based on the content in the data item. - View Dependent Claims (25, 26)
Specification