Information-theory based measure of similarity between instances in ontology
First Claim
1. A method of measuring similarity between instances in an ontology for use in an information retrieval system, the method comprising the steps of:
- obtaining a set of instances from the ontology;
computing a first similarity metric that measures similarity between instances in the set of instances with respect to ontology concepts to which the instances belong; and
storing at least one taxonomy induced by the first similarity metric, wherein the at least one induced taxonomy is usable for responding to requests submitted to the information retrieval system;
wherein the first similarity metric measures similarity of instances i and j in the set of instances based on the similarity of C(i) and C(j), where the C(i) and the C(j) represent sets of concepts to which the instances belong; and
wherein the first similarity metric considers concept membership statements of the instances in the set of instances by defining a description of an individual and a commonality between the instances based on the ontology concepts to which the instances belong;
and further wherein the obtaining, computing and storing steps are performed by a processor and memory.
1 Assignment
0 Petitions
Accused Products
Abstract
Improved information processing techniques for measuring similarity between instances in an ontology are disclosed. For example, a method of measuring similarity between instances in an ontology for use in an information retrieval system includes the following steps. A set of instances from the ontology is obtained. At least one of the following similarity metrics for the set of instances is computed: (i) a first metric that measures similarity between instances in the set of instances with respect to ontology concepts to which the instances belong; (ii) a second metric which measures similarity between instances in the set of instances where the instances are subjects in statements involving a given ontology property; and (iii) a third metric which measures similarity between instances in the set of instances where the instances are objects in statements involving a given ontology property. At least one taxonomy induced by the at least one computed similarity metric is stored, wherein the at least one induced taxonomy is usable for responding to requests submitted to an information retrieval system. When two or more of the first metric, the second metric and the third metric are computed, and two or more induced taxonomies corresponding to the two or more computed similarity metrics are stored, the method may include merging the two or more induced taxonomies to form a combined taxonomy, wherein the combined taxonomy is usable for responding to requests submitted to an information retrieval system.
-
Citations
25 Claims
-
1. A method of measuring similarity between instances in an ontology for use in an information retrieval system, the method comprising the steps of:
-
obtaining a set of instances from the ontology; computing a first similarity metric that measures similarity between instances in the set of instances with respect to ontology concepts to which the instances belong; and storing at least one taxonomy induced by the first similarity metric, wherein the at least one induced taxonomy is usable for responding to requests submitted to the information retrieval system; wherein the first similarity metric measures similarity of instances i and j in the set of instances based on the similarity of C(i) and C(j), where the C(i) and the C(j) represent sets of concepts to which the instances belong; and wherein the first similarity metric considers concept membership statements of the instances in the set of instances by defining a description of an individual and a commonality between the instances based on the ontology concepts to which the instances belong; and further wherein the obtaining, computing and storing steps are performed by a processor and memory. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of measuring similarity between instances in an ontology for use in an information retrieval system, the method comprising the steps of:
-
obtaining a set of instances from the ontology; computing at least one of the following similarity metrics for the set of instances; a first metric that measures similarity between instances in the set of instances with respect to ontology concepts to which the instances belong; a second metric which measures similarity between instances in the set of instances where the instances are subjects in statements involving a given ontology property; and a third metric which measures similarity between instances in the set of instances where the instances are objects in statements involving a given ontology property; and storing at least one taxonomy induced by the at least one computed similarity metric, wherein the at least one induced taxonomy is usable for responding to requests submitted to an information retrieval system; wherein the first metric, the second metric and the third metric comprise information theory-based measurements.
-
-
9. A method of measuring similarity between instances in an ontology for use in an information retrieval system, the method comprising the steps of:
-
obtaining a set of instances from the ontology; computing a similarity metric which measures similarity between instances in the set of instances where the instances are subjects in statements involving at least one given ontology property; and storing at least one taxonomy induced by the similarity metric, wherein the at least one induced taxonomy is usable for responding to requests submitted to the information retrieval system; wherein the similarity metric measures similarity of instances i and j in the set of instances based on the similarity of sets of objects in statements where the instances are subjects in the statements; and further wherein the obtaining, computing and storing steps are performed by a processor and memory. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A method of measuring similarity between instances in an ontology for use in an information retrieval system, the method comprising the steps of:
-
obtaining a set of instances from the ontology; computing a similarity metric which measures similarity between instances in the set of instances where the instances are objects in statements involving at least one given ontology property; and storing at least one taxonomy induced by the similarity metric, wherein the at least one induced taxonomy is usable for responding to requests submitted to the information retrieval system; wherein the similarity metric measures similarity of instances i and j in the set of instances based on the similarity of sets of subjects in statements where the instances are objects in the statements; and further wherein the obtaining, computing and storing steps are performed by a processor and memory. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
-
23. An article of manufacture for measuring similarity between instances in an ontology for use in an information retrieval system, comprising a computer readable storage medium containing one or more programs which when executed by a processor implement the steps of:
-
obtaining a set of instances from the ontology; computing at least one of the following similarity metrics for the set of instances; a first similarity metric that measures similarity between first instances in the set of instances with respect to ontology concepts to which the first instances belong, wherein the first similarity metric measures similarity of first instances i1 and j1 in the set of instances based on the similarity of C(i1) and C(j1), wherein the C(i1) and the C(j1) represent sets of concepts to which the first instances belong, and wherein the first similarity metric considers concept membership statements of the first instances in the set of instances by defining a description of an individual and a commonality between the first instances based on the ontology concepts to which the first instances belong; a second similarity metric which measures similarity between second instances in the set of instances, wherein the second instances are subjects in first statements involving at least one given first ontology property, wherein the second similarity metric measures similarity of second instances i2 and j2 in the set of instances based on similarity of sets of objects in the first statements where the second instances are the subjects in the first statements; and a third similarity metric which measures similarity between third instances in the set of instances, wherein the third instances are objects in second statements involving at least one given second ontology property, wherein the third similarity metric measures similarity of third instances i3 and j3 in the set of instances based on similarity of sets of subjects in the second statements where the third instances are objects in the second statements; and storing at least one taxonomy induced by at least one of the first, the second and the third similarity metric, wherein the at least one induced taxonomy is usable for responding to requests submitted to the information retrieval system.
-
-
24. Apparatus for measuring similarity between instances in an ontology for use in an information retrieval system, the apparatus comprising:
-
a memory; and a processor coupled to the memory and operative to; (i) obtain a set of instances from the ontology; (ii) compute at least one of the following similarity metrics for the set of instances; a first similarity metric that measures similarity between first instances in the set of instances with respect to ontology concepts to which the first instances belong, wherein the first similarity metric measures similarity of first instances i1 and j1 in the set of instances based on the similarity of C(i1) and C(j1), wherein the C(i1) and the C(j1) represent sets of concepts to which the first instances belong, and wherein the first similarity metric considers concept membership statements of the first instances in the set of instances by defining a description of an individual and a commonality between the first instances based on the ontology concepts to which the first instances belong; a second similarity metric which measures similarity between second instances in the set of instances where the second instances are subjects in first statements involving at least one given first ontology property, wherein the second similarity metric measures similarity of second instances i2 and j2 in the set of instances based on similarity of sets of objects in the first statements where the second instances are the subjects in the first statements; and a third similarity metric which measures similarity between third instances in the set of instances where the third instances are objects in second statements involving at least one given second ontology property, wherein the third similarity metric measures similarity of third instances i3 and j3 in the set of instances based on similarity of sets of subjects in the second statements where the third instances are objects in the second statements; and (iii) store at least one taxonomy induced by at least one of the first, the second and the third similarity metrics, wherein the at least one induced taxonomy is usable for responding to requests submitted to an information retrieval system.
-
-
25. An information retrieval system, comprising a similarity measurement system comprising a memory and a processor coupled to the memory, the information retrieval system configured to:
-
(i) obtain a set of instances from the ontology; (ii) compute at least one of the following similarity metrics for the set of instances; a first similarity metric that measures similarity between first instances in the set of instances with respect to ontology concepts to which the first instances belong, wherein the first similarity metric measures similarity of first instances i1 and j1 in the set of instances based on the similarity of C(i1) and C(j1), wherein the C(i1) and the C(j1) represent sets of concepts to which the first instances belong, and wherein the first similarity metric considers concept membership statements of the first instances in the set of instances by defining a description of an individual and a commonality between the first instances based on the ontology concepts to which the first instances belong; a second similarity metric which measures similarity between second instances in the set of instances where the second instances are subjects in first statements involving at least one given first ontology property, wherein the second similarity metric measures similarity of second instances i2 and j2 in the set of instances based on similarity of sets of objects in the first statements where the second instances are the subjects in the first statements; and a third similarity metric which measures similarity between third instances in the set of instances where the third instances are objects in second statements involving at least one given second ontology property, wherein the third similarity metric measures similarity of third instances i3 and j3 in the set of instances based on similarity of sets of subjects in the second statements where the third instances are objects in the second statements; and (iii) store at least one taxonomy induced by at least one of the first, the second and the third similarity metrics, wherein the at least one induced taxonomy is usable for responding to requests submitted to an information retrieval system.
-
Specification