Creation of structured data from plain text
First Claim
Patent Images
9. A method for creating structured data representation from a plain text description for an application domain, the method comprising:
- constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a parse tree;
mapping the parse tree onto an instance tree of objects of the application domain using the model; and
generating structured data based on the mapping.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for converting plain text into structured data. Parse trees for the plain text are generated based on the grammar of a natural language, the parse trees are mapped on to instance trees generated based on an application-specific model. The best map is chosen, and the instance tree is passing to an application for execution. The method and system can be used both for populating a database and/or for retrieving data from a database based on a query.
165 Citations
22 Claims
-
9. A method for creating structured data representation from a plain text description for an application domain, the method comprising:
-
constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a parse tree;
mapping the parse tree onto an instance tree of objects of the application domain using the model; and
generating structured data based on the mapping.
-
-
10. A method for creating structured data representation from a plain text description for an application domain, the method comprising:
-
constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a plurality of parse trees;
generating, based on the model, a plurality of instance trees comprising an instance tree generated corresponding to each parse tree;
choosing one instance tree from the plurality of instance trees; and
generating structured data based on the chosen instance tree.
-
-
11. A method for creating structured data representation from a plain text description for an application, the method comprising:
-
constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a plurality of parse trees;
generating, based on the model, a plurality of instance trees comprising an instance tree generated corresponding to each parse tree;
pruning the plurality of instance trees to create a second plurality of instance trees;
choosing one instance tree from the second plurality of instance trees; and
generating structured data based on the chosen instance tree.
-
-
12. A system for creating structured data representation from a plain text description for an application domain, the system comprising:
-
a parser for parsing the plain text using a grammar and a vocabulary to generate a plurality of parse trees;
a mapper communicatively coupled to the parser for mapping the plurality of parse trees onto a plurality of instance trees based on a model which defines objects of the application domain and relationships between the objects, and identifies the objects with the vocabulary; and
an output simplifier communicatively coupled to the mapper, wherein instructions to the output simplifier are included in the model. - View Dependent Claims (1, 2, 3, 4, 5, 6, 7, 8, 13, 14, 15, 16)
-
-
14-1. The method of claim 12, further comprising:
a vocabulary storage for storing vocabulary specific to the application.
-
17. A system for creating structured data representation from a plain text description for an application domain, the system comprising:
a content engine comprising;
a parser for parsing the plain text using a grammar and a vocabulary to generate a plurality of parse trees;
a mapper communicatively coupled to the parser, for mapping the plurality of parse trees onto a plurality of instance trees based on a model; and
an output simplifier communicatively coupled to the mapper, wherein instructions to the output simplifier are included in the model;
a grammar storage communicatively coupled to the content engine for providing the content engine with the grammar;
a model storage communicatively coupled to the content engine for providing the content engine with the model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary; and
a vocabulary storage for providing the content engine with the vocabulary.
-
18. A system for creating structured data representation from a plain text description for an application domain, the system comprising:
-
a parser for parsing the plain text using a grammar and a vocabulary to generate a parse tree;
a mapper communicatively coupled to the parser for mapping the parse tree onto an instance tree based on a model which defines objects of the application domain and relationships between the objects, and identifies the objects with the vocabulary; and
an output simplifier communicatively coupled to the mapper, wherein instructions to the output simplifier are included in the model.
-
-
19. A computer program product for storing a program for permitting a computer to perform a method for creating structured data representation from a plain text description for an application domain, the method comprising:
-
constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a plurality of parse trees;
mapping the plurality of parse trees onto a plurality of instance trees created using the model; and
generating structured data based on the mapping.
-
-
20. A computer program product for storing a program for permitting a computer to perform a method for creating structured data representation from a plain text description for an application domain, the method comprising:
-
constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a parse tree;
mapping the parse tree onto an instance tree of objects of the application domain using the model; and
generating structured data based on the mapping.
-
-
21. A computer program product for storing a program for permitting a computer to perform a method for creating structured data representation from a plain text description for an application domain, the method comprising:
-
constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a plurality of parse trees;
generating, based on the model, a plurality of instance trees comprising an instance tree generated corresponding to each parse tree;
choosing one instance tree from the plurality of instance trees; and
generating structured data based on the chosen instance tree.
-
-
22. A computer program product for storing a program for permitting a computer to perform a method for creating structured data representation from a plain text description for an application, the method comprising:
-
constructing a model which defines objects of the application domain and relationships between the objects, and identifies the objects with a vocabulary;
parsing the plain text using a grammar and the vocabulary to generate a plurality of parse trees;
generating, based on the model, a plurality of instance trees comprising an instance tree generated corresponding to each parse tree;
pruning the plurality of instance trees to create a second plurality of instance trees;
choosing one instance tree from the second plurality of instance trees; and
generating structured data based on the chosen instance tree.
-
Specification