Method and apparatus for generating a linguistic representation of raw input data
First Claim
1. A computer-implemented method for generating an output text by transforming received raw input data into a format that can be linguistically expressed in the output text comprising:
- selecting, using a processor, a schema from a schema store, wherein the schema is selected based on one or more messages in a message store, each of the one or more messages being a language independent data structure, each message corresponding to and selected in response to an existence of at least one fact about the received raw input data, the received raw input data expressed in a non-linguistic format;
generating, using the processor, a document plan that is instantiated with the one or more messages, wherein the document plan is instantiated according to the schema that comprises a query set, at least one query in the query set configured for selecting any number of the one or more messages in the message store;
altering, using the processor, the document plan according to an optimization specification, wherein the optimization specification comprises a set of one or more rules, the rules comprising at least one rule for arranging the one or more messages of the document plan based on an optimization function; and
generating, using the processor, an output text for display on a user interface based on the altered document plan, the output text being a linguistic representation of the raw input data.
4 Assignments
0 Petitions
Accused Products
Abstract
Methods, apparatuses, and computer program products are described herein that are configured to be embodied as and/or performed by a document planner. In some example embodiments, a method is provided that comprises selecting a schema based on one or more messages available in a message store and using the selected schema and one or more messages available in the message store to generate a document plan. The schema of this embodiment may be defined by a specification containing one or more queries for selecting one or more messages, one or more messages, and/or one or more predefined phrases to instantiate a document plan. The method of this embodiment may also include applying an optimization specification to the document plan to generate an optimized document plan.
339 Citations
20 Claims
-
1. A computer-implemented method for generating an output text by transforming received raw input data into a format that can be linguistically expressed in the output text comprising:
-
selecting, using a processor, a schema from a schema store, wherein the schema is selected based on one or more messages in a message store, each of the one or more messages being a language independent data structure, each message corresponding to and selected in response to an existence of at least one fact about the received raw input data, the received raw input data expressed in a non-linguistic format; generating, using the processor, a document plan that is instantiated with the one or more messages, wherein the document plan is instantiated according to the schema that comprises a query set, at least one query in the query set configured for selecting any number of the one or more messages in the message store; altering, using the processor, the document plan according to an optimization specification, wherein the optimization specification comprises a set of one or more rules, the rules comprising at least one rule for arranging the one or more messages of the document plan based on an optimization function; and generating, using the processor, an output text for display on a user interface based on the altered document plan, the output text being a linguistic representation of the raw input data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus that is configured to generate an output text by transforming received raw input data into a format that can be linguistically expressed in the output text, the apparatus comprising:
-
at least one processor; and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to at least; select, using the at least one processor, a schema from a schema store, wherein the schema is selected based on one or more messages in a message store, each of the one or more messages being a language independent data structure, each message corresponding to and selected in response to an existence of at least one fact about the received raw input data, the received raw input data expressed in a non-linguistic format; generate, using the at least one processor, a document plan that is instantiated with the one or more messages, wherein the document plan is instantiated according to the schema that comprises a query set, at least one query in the query set configured for selecting any number of the one or more messages in the message store; alter, using the at least one processor, the document plan in accordance with an optimization specification, wherein the optimization specification comprises a set of one or more rules, the rules comprising at least one rule for arranging the one or more messages of the document plan based on an optimization function; and generate, using the at least one processor, an output text for display on a user interface based on the altered document plan, the output text being a linguistic representation of the raw input data. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer program product that is configured to generate an output text by transforming received raw input data into a format that can be linguistically expressed in the output text, the computer program product comprising:
-
at least one computer readable non-transitory memory medium having program code instructions stored thereon, the program code instructions which when executed by an apparatus having at least one processor cause the apparatus at least to; select, using the at least one processor, a schema from a schema store, wherein the schema is selected based on one or more messages in a message store, each of the one or more messages being a language independent data structure, each message corresponding to and selected in response to an existence of at least one fact about the received raw input data, the received raw input data expressed in a non-linguistic format; generate, using the at least one processor, a document plan that is instantiated with the one or more messages, wherein the document plan is instantiated according to the schema that comprises a query set, at least one query in the query set configured for selecting any number of the one or more messages in the message store; alter, using the at least one processor, the document plan in accordance with an optimization specification, wherein the optimization specification comprises a set of one or more rules, the rules comprising at least one rule for arranging the one or more messages of the document plan based on an optimization function; and generate, using the at least one processor, an output text for display on a user interface based on the altered document plan the output text being a linguistic representation of the raw input data.
-
Specification