Auto summarization of content
First Claim
Patent Images
1. A method of summarizing electronic data files during a storage event in a storage operation cell, the method comprising:
- in the storage operation cell, which is managed by a storage manager component,implementing a storage event for a primary copy of an electronic data file that originated on a client computer, wherein the storage event comprises a migration operation of the electronic data file, wherein the migration operation generates a duplicate copy of the electronic data file, which is designated a duplicate data file, wherein the duplicate data file is stored, by a media agent component of the storage operation cell, to a secondary storage device in the storage operation cell;
during the storage event, analyzing the duplicate data file and summarizing the contents of the duplicate data file into a summary, wherein a summary agent component of the storage operation cell performs the analyzing and the summarizing, based on applying fuzzy logic, to synthesize the contents of the duplicate data file into the summary, wherein the summary is substantially smaller than the primary copy of the electronic data file, and further wherein the summary comprises at least one link to the duplicate data file;
storing the summary to at least one component of the storage operation cell; and
transmitting, in response to a keyword search directed at least in part to the duplicate data file in the secondary storage device, at least a portion of the summary of the contents of the duplicate data file.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of summarizing data files includes implementing, at a server, a storage event for a data file, analyzing the data file and creating a summary of the data file, and storing the summary linked to the data file.
461 Citations
20 Claims
-
1. A method of summarizing electronic data files during a storage event in a storage operation cell, the method comprising:
-
in the storage operation cell, which is managed by a storage manager component, implementing a storage event for a primary copy of an electronic data file that originated on a client computer, wherein the storage event comprises a migration operation of the electronic data file, wherein the migration operation generates a duplicate copy of the electronic data file, which is designated a duplicate data file, wherein the duplicate data file is stored, by a media agent component of the storage operation cell, to a secondary storage device in the storage operation cell; during the storage event, analyzing the duplicate data file and summarizing the contents of the duplicate data file into a summary, wherein a summary agent component of the storage operation cell performs the analyzing and the summarizing, based on applying fuzzy logic, to synthesize the contents of the duplicate data file into the summary, wherein the summary is substantially smaller than the primary copy of the electronic data file, and further wherein the summary comprises at least one link to the duplicate data file; storing the summary to at least one component of the storage operation cell; and transmitting, in response to a keyword search directed at least in part to the duplicate data file in the secondary storage device, at least a portion of the summary of the contents of the duplicate data file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method of retrieving summaries of electronic data files in a storage operation cell, the method comprising:
-
in the storage operation cell, which is managed by a storage manager component, implementing a storage event for a primary copy of an electronic data file that originated on a client computer, wherein the storage event comprises a backup operation of the electronic data file, wherein the backup operation generates a duplicate copy of the electronic data file, which is designated a duplicate data file, wherein the duplicate data file is stored, by a media agent component of the storage operation cell, to a secondary storage device in the storage operation cell; during the storage event, analyzing the duplicate data file and summarizing the contents of the duplicate data file into a summary, wherein a summary agent component of the storage operation cell performs the analyzing and the summarizing, based on applying fuzzy logic to synthesize the contents of the duplicate data file into the summary, wherein the summary is substantially smaller than the primary copy of the electronic data file, and further wherein the summary comprises at least one link to the duplicate data file; storing the summary to a summary store component of the storage operation cell, wherein the summary store component is distinct from the secondary storage device storing the duplicate data file; searching the summary store based on a keyword search; and transmitting, in response to the keyword search, at least a portion of the summary of the contents of the duplicate data file as extracted from the summary store. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A storage operation cell for summarizing electronic data files during a storage event, the storage operation cell comprising:
-
a storage manager, configured to implement a storage event for a primary copy of an electronic data file that originated on a client computer, wherein the storage event comprises a backup operation of the electronic data file, and wherein the backup operation generates a duplicate copy of the electronic data file, which is designated a duplicate data file; a media agent configured to store the duplicate data file to a secondary storage device in the storage operation cell; a summary agent configured to; (i) during the storage event, based on fuzzy logic, synthesize the contents of the duplicate data file into a summary, wherein the summary is substantially smaller than the primary copy of the electronic data file, and further wherein the summary comprises at least one link to the duplicate data file, and (ii) store the summary to a summary store component of the storage operation cell; and wherein the storage manager is also configured to; (a) direct a keyword search to at least one of the summary store and the secondary storage device, and (b) transmit in response to the keyword search, at least a portion of the summary of the contents of the duplicate data file. - View Dependent Claims (20)
-
Specification