Systems and methods for expedited large file processing
First Claim
1. A system for processing a large file, comprising:
- one or more processors; and
a memory in communication with the one or more processors and storing instructions that, when executed by the one or more processors, are configured to cause the system to;
receive record data comprising a plurality of records, each of the plurality of records comprising an identification value in a common field, the common field having a data format comprising a sequence of characters and each of the identification values corresponding to a name;
determine, based on a comparison of a size of the record data to a predetermined size threshold, a focus value order of magnitude;
determine, based on the data format and the focus value order of magnitude, a plurality of unique focus values, each of the plurality of focus values corresponding to a sub-group of the plurality of records, wherein the plurality of focus values comprise a set of letters from A to Z with each of the plurality of focus values corresponding to a specified portion of the sequence of characters in the data format;
create a plurality of virtual processing units, each associated with a unique one of the plurality of focus values; and
responsive to searching the record data, process, by each of the plurality of virtual processing units, the respective sub-group of the plurality of records that corresponds to the focus value associated with the respective virtual processing unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A system includes one or more memory devices storing instructions, and one or more processors configured to execute the instructions to perform steps of a method for processing a large file. The system may receive record data comprising a plurality of records having an identification value in a common field having a data format. The system may determine a plurality of focus values based on the data format and create a plurality of virtual processing units based on the plurality of focus values. Each of the plurality of virtual processing units may process a sub-group of the plurality of records that corresponds to the focus value associated with the respective virtual processing unit.
134 Citations
20 Claims
-
1. A system for processing a large file, comprising:
-
one or more processors; and a memory in communication with the one or more processors and storing instructions that, when executed by the one or more processors, are configured to cause the system to; receive record data comprising a plurality of records, each of the plurality of records comprising an identification value in a common field, the common field having a data format comprising a sequence of characters and each of the identification values corresponding to a name; determine, based on a comparison of a size of the record data to a predetermined size threshold, a focus value order of magnitude; determine, based on the data format and the focus value order of magnitude, a plurality of unique focus values, each of the plurality of focus values corresponding to a sub-group of the plurality of records, wherein the plurality of focus values comprise a set of letters from A to Z with each of the plurality of focus values corresponding to a specified portion of the sequence of characters in the data format; create a plurality of virtual processing units, each associated with a unique one of the plurality of focus values; and responsive to searching the record data, process, by each of the plurality of virtual processing units, the respective sub-group of the plurality of records that corresponds to the focus value associated with the respective virtual processing unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for processing a large file, comprising:
-
one or more processors; and a memory in communication with the one or more processors and storing instructions that, when executed by the one or more processors, are configured to cause the system to; receive record data comprising a plurality of records, each of the plurality of records comprising an identification value in a common field, the common field having a data format comprising a sequence of characters and each of the identification values corresponding to a name; determine, based on a comparison of a size of the record data to a predetermined size threshold, a focus value order of magnitude; determine, based on the data format and the focus value order of magnitude, a plurality of focus values comprising at least a first focus value and a second focus value, each of the plurality of focus values corresponding to a sub-group of the plurality of records such that the first focus value corresponds to a first sub-group of the plurality of records and the second focus value corresponds to a second sub-group of the plurality of records, wherein the plurality of focus values comprise a set of unique two-character sequences where each unique two-character sequence comprises a first letter followed by a second letter with each of the plurality of focus values corresponding to a specified portion of the sequence of characters in the data format; create a first virtual processing unit for processing the first sub-group of the plurality of records corresponding to the first focus value; create a second virtual processing unit for processing the second sub-group of the plurality of records corresponding to the second focus value; and process the first and second sub-groups of the plurality of records via the first and second virtual processing units, respectively. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A method of processing a large file comprising:
-
receiving record data comprising a plurality of records, each of the plurality of records comprising an identification value in a common field, the common field having a data format comprising a multi-digit sequence; determining, based on a comparison of a size of the record data to a predetermined size threshold, a focus value order of magnitude; determining, based on the data format and the focus value order of magnitude, a plurality of unique focus values, each of the plurality of focus values corresponding to a sub-group of the plurality of records, wherein the plurality of unique focus values comprise a set of unique two-digit numbers ranging from 00 to 99 with each of the plurality of focus values corresponding to a specified portion of the multi-digit sequence in the data format; creating a plurality of virtual processing units, each associated with a unique one of the plurality of focus values; and responsive to searching the record data, processing, by each of the plurality of virtual processing units, the respective sub-group of the plurality of records that corresponds to the focus value associated with the respective virtual processing unit. - View Dependent Claims (17, 18, 19, 20)
-
Specification