Systems and methods for semantic inference and reasoning
First Claim
1. A method of analyzing a corpus of data artifacts, the method comprising:
- obtaining, by a computer, a semantic representation of the data artifacts, wherein the semantic representation indicates;
entities identified in the data artifacts; and
semantic relationships among the entities, as indicated by the data artifacts;
clustering the data artifacts, by a computer, into clusters of semantically related data artifacts based on the semantic representation; and
inferring, by a computer, additional semantic relationships between pairs of the entities, wherein the inferring comprises;
on a cluster-by-cluster basis, applying a multi-tiered network of inference engines to a portion of the semantic representation corresponding to the cluster, wherein the multi-tiered network of inference engines comprises a domain-independent inference tier and a domain-specific inference tier,wherein applying the multi-tiered network of inference engines comprises;
applying two or more inference engines sequentially, in parallel, or iteratively according to a static or dynamic schedule.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for analyzing a corpus of data artifacts is disclosed. The method comprises obtaining, by a computer, a semantic representation of the data artifacts, where the semantic representation indicates (1) entities identified in the data artifacts, and (2) semantic relationships among the entities as indicated by the data artifacts. The method further comprises clustering the data artifacts into clusters of semantically related data artifacts based on the semantic representation and inferring additional semantic relationships between pairs of the entities. The inferring comprises applying, on a cluster-by-cluster basis, a multi-tiered network of inference engines to a portion of the semantic representation corresponding to the cluster, where the multi-tiered network of inference engines includes a domain-independent inference tier and a domain-specific inference tier.
80 Citations
21 Claims
-
1. A method of analyzing a corpus of data artifacts, the method comprising:
-
obtaining, by a computer, a semantic representation of the data artifacts, wherein the semantic representation indicates; entities identified in the data artifacts; and semantic relationships among the entities, as indicated by the data artifacts; clustering the data artifacts, by a computer, into clusters of semantically related data artifacts based on the semantic representation; and inferring, by a computer, additional semantic relationships between pairs of the entities, wherein the inferring comprises; on a cluster-by-cluster basis, applying a multi-tiered network of inference engines to a portion of the semantic representation corresponding to the cluster, wherein the multi-tiered network of inference engines comprises a domain-independent inference tier and a domain-specific inference tier, wherein applying the multi-tiered network of inference engines comprises; applying two or more inference engines sequentially, in parallel, or iteratively according to a static or dynamic schedule. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a parallel processing facility comprising; a plurality of computer processing cores; and one or more memories coupled to the computer processing cores and storing program instructions executable by the processing cores to implement a semantic inference and reasoning engine configured to analyze a corpus of data artifacts by; obtaining a semantic representation of the data artifacts, wherein the semantic representation indicates; entities identified in the data artifacts; and semantic relationships among the entities, as indicated by the data artifacts; clustering the data artifacts into clusters of semantically related data artifacts based on the semantic representation; and inferring additional semantic relationships between pairs of the entities, wherein the inferring comprises; on a cluster-by-cluster basis, applying a multi-tiered network of inference engines to a portion of the semantic representation corresponding to the cluster, wherein the multi-tiered network of inference engines comprises a domain-independent inference tier and a domain-specific inference tier, wherein applying the multi-tiered network of inference engines comprises applying two or more inference engines sequentially, in parallel, or iteratively according to a static or dynamic schedule. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of analyzing a corpus of data artifacts, the method comprising:
-
obtaining, by a computer, a semantic representation of the data artifacts, wherein the semantic representation indicates; entities identified in the data artifacts; and semantic relationships among the entities, as indicated by the data artifacts; wherein obtaining the semantic representation comprises; determining that the same entity is identified in the artifacts using different identifiers; and disambiguating the entity by replacing one or more of the different identifiers with a common identifier for the entity; clustering the data artifacts, by a computer, into clusters of semantically related data artifacts based on the semantic representation; and inferring, by a computer, additional semantic relationships between pairs of the entities, wherein the inferring comprises; on a cluster-by-cluster basis, applying a multi-tiered network of inference engines to a portion of the semantic representation corresponding to the cluster, wherein the multi-tiered network of inference engines comprises a domain-independent inference tier and a domain-specific inference tier.
-
-
20. A method of analyzing a corpus of data artifacts, the method comprising:
-
obtaining, by a computer, a semantic representation of the data artifacts, wherein the semantic representation indicates; entities identified in the data artifacts; and semantic relationships among the entities, as indicated by the data artifacts; clustering the data artifacts, by a computer, into clusters of semantically related data artifacts based on the semantic representation, wherein clustering the data artifacts comprises at least one of; performing a semantic analysis to determine semantic interrelatedness of the data artifacts based on respective ones of the entities and relationships in the data artifacts; and performing a syntactic analysis to determine syntactic interrelatedness of the data artifacts based on syntactic overlap of respective content of the data artifacts; and inferring, by a computer, additional semantic relationships between pairs of the entities, wherein the inferring comprises; on a cluster-by-cluster basis, applying a multi-tiered network of inference engines to a portion of the semantic representation corresponding to the cluster, wherein the multi-tiered network of inference engines comprises a domain-independent inference tier and a domain-specific inference tier.
-
-
21. A method of analyzing a corpus of data artifacts, the method comprising:
-
obtaining, by a computer, a semantic representation of the data artifacts, wherein the semantic representation indicates; entities identified in the data artifacts; and semantic relationships among the entities, as indicated by the data artifacts; clustering the data artifacts, by a computer, into clusters of semantically related data artifacts based on the semantic representation; and inferring, by a computer, additional semantic relationships between pairs of the entities, wherein the inferring comprises; on a cluster-by-cluster basis, applying a multi-tiered network of inference engines to a portion of the semantic representation corresponding to the cluster, wherein the multi-tiered network of inference engines comprises a domain-independent inference tier and a domain-specific inference tier, wherein applying the multi-tiered network of inference engines comprises; applying a plurality of domain-independent inference engines in the domain-independent tier and subsequently applying a plurality of inference engines in the domain-specific tier.
-
Specification