Techniques for data extraction
First Claim
1. A method, comprising:
- retrieving an extraction job specification from an extraction job specification repository implemented on a first computing device, wherein the extraction job specification is defined in one or more configuration files, and wherein the extraction job specification comprises;
a source repository identifier that identifies a source repository comprising a plurality of data records;
a data recipient identifier that identifies a data recipient;
a schedule that indicates a timing of when to retrieve the plurality of data records;
wherein the one or more configuration files are implemented in one or more of;
extensible markup language (XML), YAML Ain'"'"'t Markup Language (YAML), JavaScript Object Notation (JSON), and/or a markup language;
using the extraction job specification, retrieving, by a second computing device, the plurality of data records from the source repository based on the schedule;
using the extraction job specification, creating, by the second computing device, an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata;
sending, by the second computing device, the extraction transaction to the data recipient; and
wherein the method is performed using one or more processors.
8 Assignments
0 Petitions
Accused Products
Abstract
Computer-implemented techniques for data extraction are described. The techniques include a method and system for retrieving an extraction job specification, wherein the extraction job specification comprises a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; and a schedule that indicates a timing of when to retrieve the plurality of data records. The method and system further include retrieving the plurality of data records from the source repository based on the schedule, creating an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata, and sending the extraction transaction to the data recipient.
-
Citations
18 Claims
-
1. A method, comprising:
-
retrieving an extraction job specification from an extraction job specification repository implemented on a first computing device, wherein the extraction job specification is defined in one or more configuration files, and wherein the extraction job specification comprises; a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; a schedule that indicates a timing of when to retrieve the plurality of data records; wherein the one or more configuration files are implemented in one or more of; extensible markup language (XML), YAML Ain'"'"'t Markup Language (YAML), JavaScript Object Notation (JSON), and/or a markup language; using the extraction job specification, retrieving, by a second computing device, the plurality of data records from the source repository based on the schedule; using the extraction job specification, creating, by the second computing device, an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata; sending, by the second computing device, the extraction transaction to the data recipient; and wherein the method is performed using one or more processors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer system, comprising:
-
one or more digital data storage media; one or more processors that are communicatively coupled to the storage media; one or more programs stored in the storage media and configured for execution by the one or more processors, the one or more programs comprising instructions which when executed using the one or more processors cause the one or more processors to perform; retrieving an extraction job specification from an extraction job specification repository implemented on a first computing device, wherein the extraction job specification is defined in one or more configuration files, and wherein the extraction job specification comprises; a source repository identifier that identifies a source repository comprising a plurality of data records; a data recipient identifier that identifies a data recipient; a schedule that indicates a timing of when to retrieve the plurality of data records; wherein the one or more configuration files are implemented in one or more of; extensible markup language (XML), YAML Ain′
t Markup Language (YAML), JavaScript Object Notation (JSON), and/or a markup language;using the extraction job specification, retrieving, by a second computing device, the plurality of data records from the source repository based on the schedule; using the extraction job specification, creating, by the second computing device, an extraction transaction from the plurality of data records, wherein the extraction transaction comprises a subset of the plurality of data records and metadata; and sending, by the second computing device, the extraction transaction to the data recipient. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification