Architecture and processes for computer learning and understanding
First Claim
1. A computer-implemented method, comprising:
- in a computing system for processing a natural language input, performing, by the computing system, a process for generating resources for use in processing the natural language input comprising;
receiving a query to analyze one or more words in the natural language input;
analyzing the one or more words for possible interpretations including, in part, inducing knowledge from language corpora for use in deriving the possible interpretations and performing word sense disambiguation on the one or more words to produce one or more sense vectors, wherein inducing the knowledge includes detecting a setting or a background of that is common to a context of the natural language input;
accumulating one or more knowledge resources containing the knowledge induced from the language corpora, wherein at least one knowledge resource of the one or more knowledge resources includes a three-dimensional tensor structure in which a relationship between two or more parameters has a probability;
storing the one or more knowledge resources, wherein storing the one or more knowledge resources includes storing the at least one knowledge resource as different parameters within the three-dimensional tensor structure; and
returning one or more possible interpretations of the possible interpretations in response to the query.
2 Assignments
0 Petitions
Accused Products
Abstract
An architecture and processes enable computer learning and developing an understanding of arbitrary natural language text through collaboration with humans in the context of joint problem solving. The architecture ingests the text and then syntactically and semantically processes the text to infer an initial understanding of the text. The initial understanding is captured in a story model of semantic and frame structures. The story model is then tested through computer generated questions that are posed to humans through interactive dialog sessions. The knowledge gleaned from the humans is used to update the story model as well as the computing system'"'"'s current world model of understanding. The process is repeated for multiple stories over time, enabling the computing system to grow in knowledge and thereby understand stories of increasingly higher reading comprehension levels.
176 Citations
20 Claims
-
1. A computer-implemented method, comprising:
in a computing system for processing a natural language input, performing, by the computing system, a process for generating resources for use in processing the natural language input comprising; receiving a query to analyze one or more words in the natural language input; analyzing the one or more words for possible interpretations including, in part, inducing knowledge from language corpora for use in deriving the possible interpretations and performing word sense disambiguation on the one or more words to produce one or more sense vectors, wherein inducing the knowledge includes detecting a setting or a background of that is common to a context of the natural language input; accumulating one or more knowledge resources containing the knowledge induced from the language corpora, wherein at least one knowledge resource of the one or more knowledge resources includes a three-dimensional tensor structure in which a relationship between two or more parameters has a probability; storing the one or more knowledge resources, wherein storing the one or more knowledge resources includes storing the at least one knowledge resource as different parameters within the three-dimensional tensor structure; and returning one or more possible interpretations of the possible interpretations in response to the query. - View Dependent Claims (2, 3, 4, 5, 6, 18)
-
7. A computer-implemented method, comprising:
for a natural language input, performing, by a computing system, a process comprising; receiving the natural language input; performing a syntactic analysis of the natural language input to produce one or more linguistic analysis results; inducing knowledge from language corpora including detecting a setting or a background that are common to a context of the natural language input; creating multiple semantic structures to represent the natural language input in part by using the one or more linguistic analysis results and the knowledge induced from the language corpora, wherein the multiple semantic structures include one or more generative semantic primitive (GSP) structures defining one or more roles that are common to the context of the setting or the background of the natural language input, wherein defining the one or more roles includes mapping one or more entities in the natural language input to the one or more roles; engaging in a dialog session with a human user to receive input from the human user to use by the computing system to evaluate the multiple semantic structures as an understanding of the natural language input; revising the multiple semantic structures based on one or more responses from the human user to improve the understanding of the natural language input, wherein revising the multiple semantic structures includes defining at least one new GSP structure with at least one or more new roles based on the one or more responses and based at least in part on the at least one new GSP structure having at least a threshold probability of being included in the multiple semantic structures that is associated with the context of the setting or the background of the natural language input; and storing the knowledge as different parameters within a three-dimensional tensor structure in which relations among the different parameters have associated probabilities. - View Dependent Claims (8, 9, 10, 11, 12, 20)
-
13. A computing system, comprising:
-
one or more processors; and memory coupled to the one or more processors, the memory storing; a knowledge induction engine to analyze language corpora and to generate a resource repository including one or more knowledge resources for use in deriving possible interpretations of natural language, wherein the knowledge induction engine stores the one or more knowledge resources as different parameters within a three-dimensional tensor structure in which relations among the different parameters have associated probabilities; and a knowledge integration engine to form one or more semantic structures that provide a semantic representation of a natural language story, the knowledge integration engine forming the one or more semantic structures through use of one or more linguistic analysis results generated from parsing the natural language story and the one or more knowledge resources stored in the resource repository, wherein forming the one or more semantic structures includes forming at least one frame structure that relates a set of generative primitive (GSP) structures under a particular context of a particular setting of the natural language story, an individual GSP structure of the set of GSP structures having at least a threshold probability of being included in the at least one frame structure that is associated with the particular context of the particular setting of the natural language story. - View Dependent Claims (14, 15, 16, 17, 19)
-
Specification