System and method for data management
First Claim
1. A data management system, comprising:
- a first server processor for receiving a plurality of received data files, the data files being capable of being different file types;
a file organizing/categorizing processor for organizing the received data files, based on a predetermined list, into a source directory structure including at least one source directory, and a corresponding destination directory structure including a least one destination directory;
a file logging processor for logging the received data files into a database formed by the source directory structure and identifying a file type of the received data files;
a de-duplicate processor for calculating a value of the received data files to determine whether the received data files have duplicates and flagging duplicated data files in the database;
a plurality of image conversion processors for converting the remaining, de-duplicated, data files into image files, respectively; and
a second server processor for exporting the image files to the destination directory structure;
wherein the file logging processor, the image conversion processors, and the second server processor are parallel processors such that the data files are parallel-processed in a data file logging stage, an image conversion stage, and an image file output stage; and
wherein each of the image conversion processors is capable of converting the data files having the same file type into the corresponding image files.
24 Assignments
0 Petitions
Accused Products
Abstract
An automated data management system and method for logging, processing, and reporting a large volume of data having different file types, stored on different media, and/or run by different operating systems, includes a first server processor for restoring a plurality of received data files, the data files being capable of being different file types; a file organizing/categorizing processor for organizing the received data files, based on a predetermined user list, into a source directory structure and a destination directory structure; a file logging processor for logging the received data files into a database formed by the source and destination directory structures and identifying a file type of the received data files; a de-duplicate processor for calculating a SHA value of the received data files to determine whether the received data files have duplicates and flagging duplicated data files in the database; an image conversion processor for converting the remaining data files into image files, respectively; and a second server processor for exporting the image files.
-
Citations
16 Claims
-
1. A data management system, comprising:
-
a first server processor for receiving a plurality of received data files, the data files being capable of being different file types; a file organizing/categorizing processor for organizing the received data files, based on a predetermined list, into a source directory structure including at least one source directory, and a corresponding destination directory structure including a least one destination directory; a file logging processor for logging the received data files into a database formed by the source directory structure and identifying a file type of the received data files; a de-duplicate processor for calculating a value of the received data files to determine whether the received data files have duplicates and flagging duplicated data files in the database; a plurality of image conversion processors for converting the remaining, de-duplicated, data files into image files, respectively; and a second server processor for exporting the image files to the destination directory structure; wherein the file logging processor, the image conversion processors, and the second server processor are parallel processors such that the data files are parallel-processed in a data file logging stage, an image conversion stage, and an image file output stage; and wherein each of the image conversion processors is capable of converting the data files having the same file type into the corresponding image files. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A data management method, comprising the steps of:
-
receiving a plurality of received data files, the data files being capable of being different file types; organizing/categorizing the received data files, based on a predetermined list, into a source directory structure including at least one source directory, and a corresponding destination directory structure including at least one destination directory; logging the received data files into a database formed by the source directory structure and identifying a file type of the received data files; de-duplicating duplicates in the received data files by calculating a value of the received data files to determine whether the received data files have duplicates and flagging the duplicated data files in the database; converting the remaining data files into image files, respectively, using a plurality of image conversion processors, each of the image conversion processors being capable of converting the data files having the same file type into the corresponding image files; exporting the image files to the destination directory structure; and parallel processing the steps of logging, converting, and exporting such that the data files are parallel-processed in a data file logging stage, an image conversion stage, and an image file output stage. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
Specification