Architecture and processes for computer learning and understanding
First Claim
1. A computer-implemented method, comprising:
- forming, by a computing system, a semantic representation of a natural language story, the semantic representation using knowledge stored in a current world model that expresses beliefs about how natural language is understood, wherein the semantic representation includes multiple generative semantic primitive (GSP) structures, the multiple GSP structures include one or more predicates and one or more roles that indicate one or more beliefs regarding an understanding of the natural language story, wherein the current world model is developed over time through processing of multiple natural language stories;
generating, by the computing system, one or more questions to evaluate the semantic representation;
maintaining a dependency structure for linear dialogs to identify questions that are independent from one another;
submitting, from the computing system to a separate user computing device, the one or more questions for presentation to a human user;
receiving, by the computing system from the separate user computing device, one or more responses indicative of input from the human user when answering the one or more questions;
iterating through the dependency structure based at least in part on the one or more responses to identify a set of next questions and branch conditions indicating that at least two next questions of the set of next questions are independent of one another;
learning, by the computing system, at least one new GSP structure that indicates a new belief regarding the understanding of the natural language story, the new belief having an above threshold probability of being included in the multiple GSP structures that is associated with a theme or a context of the natural language story based at least in part on the one or more responses received from the separate user computing device; and
revising, by the computing system, the semantic representation of the natural language story based at least in part on the one or more responses received from the separate user computing device and the at least one new GSP structure.
2 Assignments
0 Petitions
Accused Products
Abstract
An architecture and processes enable computer learning and developing an understanding of arbitrary natural language text through collaboration with humans in the context of joint problem solving. The architecture ingests the text and then syntactically and semantically processes the text to infer an initial understanding of the text. The initial understanding is captured in a story model of semantic and frame structures. The story model is then tested through computer generated questions that are posed to humans through interactive dialog sessions. The knowledge gleaned from the humans is used to update the story model as well as the computing system'"'"'s current world model of understanding. The process is repeated for multiple stories over time, enabling the computing system to grow in knowledge and thereby understand stories of increasingly higher reading comprehension levels.
176 Citations
21 Claims
-
1. A computer-implemented method, comprising:
-
forming, by a computing system, a semantic representation of a natural language story, the semantic representation using knowledge stored in a current world model that expresses beliefs about how natural language is understood, wherein the semantic representation includes multiple generative semantic primitive (GSP) structures, the multiple GSP structures include one or more predicates and one or more roles that indicate one or more beliefs regarding an understanding of the natural language story, wherein the current world model is developed over time through processing of multiple natural language stories; generating, by the computing system, one or more questions to evaluate the semantic representation; maintaining a dependency structure for linear dialogs to identify questions that are independent from one another; submitting, from the computing system to a separate user computing device, the one or more questions for presentation to a human user; receiving, by the computing system from the separate user computing device, one or more responses indicative of input from the human user when answering the one or more questions; iterating through the dependency structure based at least in part on the one or more responses to identify a set of next questions and branch conditions indicating that at least two next questions of the set of next questions are independent of one another; learning, by the computing system, at least one new GSP structure that indicates a new belief regarding the understanding of the natural language story, the new belief having an above threshold probability of being included in the multiple GSP structures that is associated with a theme or a context of the natural language story based at least in part on the one or more responses received from the separate user computing device; and revising, by the computing system, the semantic representation of the natural language story based at least in part on the one or more responses received from the separate user computing device and the at least one new GSP structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented method, comprising:
-
processing, over time by a computing system, multiple natural language stories to produce a current world model that includes a collection of generative semantic primitive (GSP) structures to express beliefs about how natural language is understood, wherein a GSP structure of the collection of GSP structures includes a predicate and one or more roles, wherein the collection of GSP structures is associated with a theme or a context of the multiple natural language stories; maintaining the current world model in a datastore of the computing system; receiving, by the computing system, a natural language story; maintaining a dependency structure for linear dialogs to identify questions that are independent from one another; engaging in a dialog session with a human user to receive input from the human user for use by the computing system to evaluate an extent to which the beliefs of the current world model produce an understanding of the natural language story, wherein engaging in the dialog session includes iterating through the dependency structure based at least in part on the input to identify a set of next questions; and revising, by the computing system, the current world model by adding at least one new GSP structure with at least one of a new predicate or one or more new roles based at least in part on the input from the human user and the at least one new GSP structure having an above threshold probability of being included with a particular collection of GSP structures that is associated with a particular theme or a particular context of the natural language story. - View Dependent Claims (11, 12, 13)
-
-
14. A computing system, comprising:
-
a datastore containing a current world model that expresses beliefs about how natural language is understood; one or more processors to access the current world model maintained in the datastore; and memory coupled to the one or more processors, the memory storing computer executable instructions that, when executed by the one or more processors, perform acts comprising; processing multiple natural language stories over time, the processing including performing syntactic analysis on the multiple natural language stories and subsequently inferring semantic structures as representations of the multiple natural language stories, in part by using information maintained in the current world model and results from the syntactic analysis, wherein inferring the semantic structures includes creating multiple generative semantic primitive (GSP) structures by defining a predicate and one or more roles for at least one GSP structure of the multiple GSP structures, wherein the processing includes; processing, over a first time period, a first story of the multiple natural language stories having a first reading comprehension level; inferring one or more first semantic structures as representations of the first story; processing, over a second time period after the first time period, a second story of the multiple natural language stories having a second reading comprehension level that is more difficult than the first reading comprehension level; and inferring at least one second semantic structure as a representation of the second story based at least in part on expanding at least one of the one or more first semantic structures to include a new semantic structure; maintaining a dependency structure for linear dialogs to identify questions that are independent from one another; conducting dialog sessions with multiple different human users to aggregate multiple responses from the multiple different human users to evaluate the semantic structures as understandings of the multiple natural language stories, wherein evaluating the semantic structures includes at least one of adding a new GSP structure or redefining at least one of the predicate or the one or more roles for an existing GSP structure; and modifying the current world model in the datastore over time to include the semantic structures inferred from the multiple natural language stories and evaluated by the multiple different human users, the semantic structures including the one or more first semantic structures inferred during or after the first time period and the at least one second semantic structure inferred during or after the second time period. - View Dependent Claims (15, 16, 17)
-
-
18. A computing system, comprising:
-
one or more processors; and memory coupled to the one or more processors, the memory storing computer-executable modules comprising; a knowledge integration engine to form semantic structures that provide a semantic representation of a natural language story, wherein the semantic structures include generative semantic primitive (GSP) structures defined with predicates and roles to express a first understanding of the natural language story; and a dialog engine to facilitate a dialog session with a human user to evaluate an extent to which the semantic structures represent the natural language story by generating and submitting one or more questions to a separate user computing device used by the human user and collecting one or more responses from the separate user computing device indicative of input from the human user, wherein the dialog engine includes a dialog dispatcher to break up the dialog session into sub-dialog sessions and maintains a dependency structure for linear dialogs to identify questions that are independent from one another, wherein the dialog dispatcher includes a structure crawler to iterate through the dependency structure based on the one or more responses to identify a set of next questions; and wherein the knowledge integration engine updates the semantic structures to include at least one of adding a new GSP structure or redefining an existing GSP structure to express a second understanding of the natural language story based in part on the one or more responses, wherein adding the new GSP structure or redefining the existing GSP structure increases the extent to which the semantic structures represent the natural language story. - View Dependent Claims (19, 20, 21)
-
Specification