Information relation generation
First Claim
Patent Images
1. A method for mining a relationship of at least a first and a second named entity comprising:
- identifying a sentence with at least the first and the second named entity in a document;
defining, by a processor, a first instance comprising the first and the second named entity, a type of named entity for each of the first and the second named entity, and text in the sentence between the first and the second named entity;
applying, by a processor, latent Dirichlet allocation (LDA) to the document, the LDA including an input of the first instance, and then determining a distribution of types of relationship as an output, the types of relationship comprising labels of how the first named entity relates to the second named entity; and
selecting one of the types of the relationship as the relationship for the first and the second named entity,wherein applying the LDA comprises applying a supervised maximum entropy discrimination LDA with the characteristic types of relationships as observed response variables of an output for supervision of the supervised maximum entropy discrimination LDA.
2 Assignments
0 Petitions
Accused Products
Abstract
For generating a word space, manual thresholding of word scores is used. Rather than requiring the user to select the threshold arbitrarily or review each word, the user is iteratively requested to indicate the relevance of a given word. Words with greater or lesser scores are labeled in the same way depending upon the response. For determining the relationship between named entities, Latent Dirichlet Allocation (LDA) is performed on text associated with the name entities rather than on an entire document. LDA for relationship mining may include context information and/or supervised learning.
-
Citations
9 Claims
-
1. A method for mining a relationship of at least a first and a second named entity comprising:
-
identifying a sentence with at least the first and the second named entity in a document; defining, by a processor, a first instance comprising the first and the second named entity, a type of named entity for each of the first and the second named entity, and text in the sentence between the first and the second named entity; applying, by a processor, latent Dirichlet allocation (LDA) to the document, the LDA including an input of the first instance, and then determining a distribution of types of relationship as an output, the types of relationship comprising labels of how the first named entity relates to the second named entity; and selecting one of the types of the relationship as the relationship for the first and the second named entity, wherein applying the LDA comprises applying a supervised maximum entropy discrimination LDA with the characteristic types of relationships as observed response variables of an output for supervision of the supervised maximum entropy discrimination LDA. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for mining a relationship of at least a first and a second named entity comprising:
-
identifying a sentence with at least the first and the second named entity in a document; defining, by a processor, a first instance comprising the first and the second named entity, a type of named entity for each of the first and the second named entity, and text in the sentence between the first and the second named entity; applying, by a processor, latent Dirichlet allocation (LDA) to the document, the LDA including an input of the first instance, and then determining a distribution of types of relationship as an output, the types of relationship comprising labels of how the first named entity relates to the second named entity; and selecting one of the types of the relationship as the relationship for the first and the second named entity, wherein applying the LDA comprises applying a labeled LDA without a labeling prior probability.
-
-
9. A method for mining a relationship of at least a first named entity and a second named entity on a non-transitory computer readable storage media having stored therein data representing instructions executable by a programmed processor, the method comprising:
-
applying latent Dirichlet allocation (LDA) to a document, which is stored on the non-transitory computer readable storage media, with identified sentences with the first and the second named entity, the LDA including an input of the first instance, the first instance comprising the first and the second named entity, a type of named entity for the first and the second named entity and text in a sentence between the first and the second named entity, and then determining a distribution of types of relationship as an output of the LDA, the types of relationships comprising labels of how the first named entity relates to the second named entity; and selecting one of the types of relationship as the relationship for the first and the second named entity, wherein applying the LDA comprises applying a supervised maximum entropy discrimination LDA with the characteristic types of relationships as observed response variables of an output for supervision of the supervised maximum entropy discrimination LDA.
-
Specification