Graph long short term memory for syntactic relationship discovery
First Claim
Patent Images
1. A method for analyzing natural language text based on linguistic relationships, comprising:
- receiving a graph long short term memory (LSTM) relation extractor;
receiving a selection of documents to query for syntactic relationships;
receiving a keyword tuple, the keyword tuple including multiple keywords and specifying a relationship between the keywords;
parsing the selection of documents to discover natural language segments that include the keywords;
in response to locating a given natural language segment, processing the segment according to the graph LSTM relation extractor to produce a relational score;
determining whether the relational score satisfies a relationship threshold;
in response to the relational score satisfying the relationship threshold, automatically returning the given natural language segment as responsive to the keyword tuple; and
without user input, adding the natural language segment to a knowledge base configured for searching according to at least two of one or more of the keywords and the relationship.
1 Assignment
0 Petitions
Accused Products
Abstract
Long short term memory units that accept a non-predefined number of inputs are used to provide natural language relation extraction over a user-specified range on content. Content written for human consumption is parsed with distant supervision in segments (e.g., sentences, paragraphs, chapters) to determine relationships between various words within and between those segments.
35 Citations
21 Claims
-
1. A method for analyzing natural language text based on linguistic relationships, comprising:
-
receiving a graph long short term memory (LSTM) relation extractor; receiving a selection of documents to query for syntactic relationships; receiving a keyword tuple, the keyword tuple including multiple keywords and specifying a relationship between the keywords; parsing the selection of documents to discover natural language segments that include the keywords; in response to locating a given natural language segment, processing the segment according to the graph LSTM relation extractor to produce a relational score; determining whether the relational score satisfies a relationship threshold; in response to the relational score satisfying the relationship threshold, automatically returning the given natural language segment as responsive to the keyword tuple; and without user input, adding the natural language segment to a knowledge base configured for searching according to at least two of one or more of the keywords and the relationship. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for analyzing linguistic relationships in natural language text, comprising:
-
a processor; and a memory storage device including instructions that when executed by the processor are operable to provide a plurality of graph long short term memory (LSTM) units arranged in an LSTM neural network to provide entity relationships in natural language text, wherein each graph LSTM unit of the plurality of graph LSTM units is associated with a word vector from the natural language text and includes; an input gate, operable to produce an input state by applying an input weight to the word vector and adding an input weighted sum of predecessor hidden vectors, wherein the predecessor hidden vectors are received from graph LSTM units preceding the graph LSTM unit in the LSTM neural network; an output gate, operable to produce an output state by applying an output weight to the word vector and adding an output weighted sum of the predecessor hidden vectors; a plurality of forget gates, wherein a number of forget gates equals a number of related words in a document segment associated with the word vector, wherein each forget gate of the plurality of forget gates is configured to receive an associated predecessor hidden vector from an associated predecessor graph LSTM unit in the LSTM neural network and to produce a forget state by applying a forget weight to the word vector and a weighting of the associated predecessor hidden vector; a memory cell, operable to produce a memory state by multiplexing the input state with the word vector to which a memory weight is applied and a memory weighted sum of the predecessor hidden vectors is added, to which is added a sum of the forget weights produced by the plurality of forget gates multiplexed with an associated memory cell state of the associated predecessor graph LSTM units; and wherein the graph LSTM unit transmits a hidden vector to successor graph LSTM units in the LSTM neural network, wherein the hidden vector is produced by multiplexing the output state with the memory state. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computer readable storage device including instructions for analyzing natural language text based on linguistic relationships, wherein the instructions comprise:
-
receiving a graph long short term memory (LSTM) unit; receiving a training knowledge base, the training knowledge base including keywords that are associated according to a known relationship; receiving a selection of documents; parsing the selection of documents to discover natural language segments that include the keywords, wherein a first portion of the natural language segments exhibit the known relationship and a second portion of the natural language segments do not exhibit the known relationship; and training the graph LSTM unit over a series of epochs, in which training comprises; for each of the natural language segments discovered; identifying a linguistic structure of a given natural language segment; forming a neural network of instances of the graph LSTM unit having a structure based on the linguistic structure; processing the keywords according to the neural network to produce a relational score; comparing the relational score to a relational threshold to determine whether the neural network indicates the given segment exhibits the known relationship; and for each epoch of the series of epochs, automatically adjusting at least one weighting of the graph LSTM unit for use in a next epoch of the series of epochs based on a number of the first portion determined to exhibit the known relationship and a number of the second portion determined to not exhibit the known relationship. - View Dependent Claims (18, 19, 20, 21)
-
Specification