Converting data into natural language form
First Claim
Patent Images
1. A method implemented in a computer infrastructure, comprising:
- obtaining document data from a document;
applying a first keyword translation to the document data;
translating, using a translation engine, a product of the first keyword translation to a natural language form;
applying a second keyword translation to a product of the natural language form translation, wherein the translation engine is provided with programming for determining how much document data is required for making a successful determination of data types prior to a translation to the natural language form; and
training a natural language engine by consuming the product of the second keyword translation.
2 Assignments
0 Petitions
Accused Products
Abstract
Converting technical data from field oriented electronic data sources into natural language form is disclosed. An approach includes obtaining document data from an input document, wherein the document data is in a non-natural language form. The approach includes determining a data type of the document data from one of a plurality of data types defined in a detection and conversion database. The approach includes translating the document data to a natural language form based on the determined data type. The approach additionally includes outputting the translated document data in natural language form to an output data stream.
128 Citations
20 Claims
-
1. A method implemented in a computer infrastructure, comprising:
-
obtaining document data from a document; applying a first keyword translation to the document data; translating, using a translation engine, a product of the first keyword translation to a natural language form; applying a second keyword translation to a product of the natural language form translation, wherein the translation engine is provided with programming for determining how much document data is required for making a successful determination of data types prior to a translation to the natural language form; and training a natural language engine by consuming the product of the second keyword translation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer program product comprising a non-transitory computer usable tangible storage medium having readable program code embodied in the tangible storage medium, the computer program product includes at least one component configured to:
-
obtain portions of document data that are in a non-natural language form from a document; for the portions, perform the steps of; applying a first keyword translation to the portion; translate the product of the first keyword translation to a natural language form, wherein a plurality of data types comprises;
a document header data;
a document field data;
a table header data;
a table detail data; and
a signature data;applying a second keyword translation to the product of the translation to the natural language form; and determine whether an end of the document has been reached and concurrently place a product of the second keyword translation onto an output data stream; and training a natural language engine by consuming the output data stream.
-
Specification