Converting data into natural language form
First Claim
Patent Images
1. A method implemented in a computer infrastructure, comprising:
- obtaining document data from an input document, wherein the document data is in a non-natural language form;
determining, from one of a plurality of data types, a data type of the document data via communication with a detection and conversion database;
translating, based on the determined data type, the document data to a natural language form; and
outputting, in natural language form, the translated document data to an output data stream;
wherein the plurality of data types comprises;
document header data;
document field data;
table header data;
table detail data; and
signature data.
2 Assignments
0 Petitions
Accused Products
Abstract
Converting technical data from field oriented electronic data sources into natural language form is disclosed. An approach includes obtaining document data from an input document, wherein the document data is in a non-natural language form. The approach includes determining a data type of the document data from one of a plurality of data types defined in a detection and conversion database. The approach includes translating the document data to a natural language form based on the determined data type. The approach additionally includes outputting the translated document data in natural language form to an output data stream.
-
Citations
24 Claims
-
1. A method implemented in a computer infrastructure, comprising:
-
obtaining document data from an input document, wherein the document data is in a non-natural language form; determining, from one of a plurality of data types, a data type of the document data via communication with a detection and conversion database; translating, based on the determined data type, the document data to a natural language form; and outputting, in natural language form, the translated document data to an output data stream; wherein the plurality of data types comprises;
document header data;
document field data;
table header data;
table detail data; and
signature data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system implemented in hardware, comprising:
-
a processor executing a translation engine that is configured to; obtain document data from an input document, wherein the document data is in a form other than a natural language form; determine, via communication with a detection and conversion database comprising records defining a plurality of data types, a data type of the document data; translate, based on the determined data type, the document data to natural language form; and output, in natural language form, the translated document data to an output data stream; wherein the plurality of data types comprises;
document header data;
document field data;
table header data;
table detail data; and
signature data. - View Dependent Claims (16, 17, 18)
-
-
19. A computer program product comprising a non-transitory computer usable tangible storage medium having readable program code embodied in the tangible storage medium, the computer program product includes at least one component configured to:
-
obtain plural portions of document data from an input document, wherein the plural portions of document data are in a non-natural language form; and for each one of the plural portions of document data, perform the steps of; determine, via communication with a detection and conversion database comprising records defining a plurality of data types, a data type of the one of the plural portions of document data; translate, based on the determined data type, the one of the plural portions of document data to a natural language form; and output, in natural language form, the translated one of the plural portions of document data to an output data stream; wherein the plurality of data types comprises;
document header data;
document field data;
table header data;
table detail data; and
signature data. - View Dependent Claims (20)
-
-
21. A computer system for translating an input document to a natural language form, the system comprising:
-
a CPU, a computer readable memory, and a computer readable storage media; first program instructions configured to obtain document data from an input document, wherein the document data is in a form other than a natural language form; second program instructions configured to determine, via communication with a detection and conversion database comprising records defining a plurality of data types, a data type of the document data; third program instructions configured to translate, based on the determined data type, the document data to natural language form; fourth program instructions configured to output, in natural language form, the translated document data to an output data stream; and fifth program instructions configured to apply a keyword translation to the document data before the translating the document data to natural language form, and apply a keyword translation to the translated document data after the translating the document data to natural language form; wherein the first, second, third, fourth, and fifth program instructions are stored on the computer readable storage media for execution by the CPU via the computer readable memory; each one of the records in the detection and conversion database is associated with a respective one of the plurality of data types; each one of the records comprises data used in comparisons against the document data during the determining; each one of the records further comprises at least one conversion rule applied to the document data during the translating; and the plurality of data types comprises;
document header data;
document field data;
table header data;
table detail data; and
signature data.
-
-
22. A method of deploying a system for translating documents to natural language form, comprising:
-
providing a computer infrastructure configured to; obtain, from an input document, document data wherein the document data is in a non-natural language form; determine, via communication with a detection and conversion database comprising records defining a plurality of data types, a data type of the document data; translate, based on the determine data type, the document data to a natural language form; and output, in natural language form, the translated document data in natural language form to an output data stream; wherein the plurality of data types comprises;
document header data;document field data;
table header data;
table detail data; and
signature data. - View Dependent Claims (23, 24)
-
Specification