Methods for searching with semantic similarity scores in one or more ontologies
First Claim
1. A method of scoring items comprisingobtaining a plurality of items having annotations that are semantically related to the terms in a query by using one or more ontologies and said annotations in order to determine the statistical significance of similarities between query terms and sets of terms used in the annotations of the items,assigning a P-value to the probability of obtaining the observed semantic similarity score for each obtained item, andscoring the items according to their individual P-values.
1 Assignment
0 Petitions
Accused Products
Abstract
A method assigns importance ranks to documents within repositories or databases, such as any database of documents such as books or other printed material, electronic documentation, and pages within the world-wide web. The method uses a corpus of indexed documents that has been annotated to the terms of one or more ontologies in order to assign a semantic similarity score to queries based on terms taken from the ontologies. A statistical model is used to test the significance of matches between query terms and documents or categories. The method results in an acceleration of over 10,000-fold for realistic queries and ontologies, and makes it practicable to calculate P-values dynamically or to keep database annotations and the related P-value distributions up to date by frequent recalculation.
-
Citations
20 Claims
-
1. A method of scoring items comprising
obtaining a plurality of items having annotations that are semantically related to the terms in a query by using one or more ontologies and said annotations in order to determine the statistical significance of similarities between query terms and sets of terms used in the annotations of the items, assigning a P-value to the probability of obtaining the observed semantic similarity score for each obtained item, and scoring the items according to their individual P-values.
-
2. A method of scoring items comprising the steps of:
-
obtaining a plurality of items that are semantically related to the terms in a query by using one or more ontologies and annotations to them in order to determine the statistical significance of similarity scores between query terms and sets of terms used to annotate the items, collapsing portions of said ontologies that contribute equally to final score by identifying ontology terms, which contribute equally to the score, and by calculating their contribution to the score distribution, and using the calculated contribution of said collapsed portions to determine the statistical significance of similarities between said query terms and said sets of terms used to annotate the items. - View Dependent Claims (5)
-
-
3. A method of scoring items comprising the steps of:
-
obtaining a plurality of items that are semantically related to the terms in a query by using one or more attribute ontologies and annotations to them in order to determine the similarities between query terms and sets of terms used to annotate the items, and using ontological similarity score measures derived from attribute ontologies to rank said items according to one or more ontologies.
-
Specification