Process and system for determining relevance
First Claim
1. A process for determining relevance between two documents implemented using an electronic system, the process comprising:
- providing a first feature vector representing a first document;
providing a second feature vector representing a second document;
providing an indexing parameter;
providing a parametric family of sampling distributions for the first feature vector using the indexing parameter;
providing a parametric family of sampling distributions for the second feature vector using the indexing parameter;
providing a prior distribution of the indexing parameter;
assigning a distribution of the indexing parameter, given the second feature vector and an event that the first document is not relevant to the second document, the value of the prior distribution of the indexing parameter;
assigning a distribution of the indexing parameter, given the second feature vector and an event that the first document is relevant to the second document, the value of the posterior distribution of the indexing parameter given the second feature vector;
generating a log likelihood ratio that the first document is relevant to the second document using the two assigned distributions of the indexing parameter; and
storing the log likelihood ratio as representing relevance between the first document and the second document.
4 Assignments
0 Petitions
Accused Products
Abstract
A process is provided for determining relevance using an electronic system. The process includes providing a first feature vector, providing a second feature vector, and providing an indexing parameter. A parametric family of sampling distributions are provided for the first feature vector using the indexing parameter. A parametric family of sampling distributions are also provided for the second feature vector using the indexing parameter. The process further includes providing a prior distribution of the indexing parameter. A distribution of the indexing parameter, given the second feature vector and an event that the first feature vector is not relevant to the second feature vector, is assigned the value of the prior distribution of the indexing parameter. A distribution of the indexing parameter, given the second feature vector and an event that the first feature vector is relevant to the second feature vector, is assigned the value of the posterior distribution of the indexing parameter given the second feature vector. A log likelihood ratio that the first feature vector is relevant to the second feature vector is then generated using the two assigned distributions of the indexing parameter. The log likelihood ratio is stored as representing relevance between the first feature vector and the second feature vector.
-
Citations
22 Claims
-
1. A process for determining relevance between two documents implemented using an electronic system, the process comprising:
-
providing a first feature vector representing a first document; providing a second feature vector representing a second document; providing an indexing parameter; providing a parametric family of sampling distributions for the first feature vector using the indexing parameter; providing a parametric family of sampling distributions for the second feature vector using the indexing parameter; providing a prior distribution of the indexing parameter; assigning a distribution of the indexing parameter, given the second feature vector and an event that the first document is not relevant to the second document, the value of the prior distribution of the indexing parameter; assigning a distribution of the indexing parameter, given the second feature vector and an event that the first document is relevant to the second document, the value of the posterior distribution of the indexing parameter given the second feature vector; generating a log likelihood ratio that the first document is relevant to the second document using the two assigned distributions of the indexing parameter; and storing the log likelihood ratio as representing relevance between the first document and the second document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer system operable to determine relevance between two documents, comprising:
-
a memory operable to store program instructions and data; a first feature vector representing a first document, the first feature vector stored in the memory; a second feature vector representing a second document, the second feature vector stored in the memory; an indexing parameter, the indexing parameter stored in the memory; a parametric family of sampling distributions for the first feature vector using the indexing parameter, the parametric family stored in the memory; a parametric family of sampling distributions for the second feature vector using the indexing parameter, the parametric family stored in the memory; a prior distribution for the indexing parameter, the prior distribution stored in memory; and a processor coupled to the memory and operable to access the program instructions and data, the processor operable to perform a process under control of the program instructions for; assigning a distribution of the indexing parameter, given the second feature vector and an event that the first document is not relevant to the second document, the value of the prior distribution for the indexing parameter; assigning a distribution of the indexing parameter, given the second feature vector and an event that the first document is relevant to the second document, the value of a distribution of the indexing parameter given the second document; determining a log likelihood ratio that the first document is relevant to the second document using the two assigned distributions of the indexing parameter; and storing the log likelihood ratio in the memory as representing a relevance between the first document and the second document. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A relevance generation system operable to determine relevance between two documents, comprising:
-
a first feature vector representing a first document; a second feature vector representing a second document; an indexing parameter; a parametric family of sampling distributions for the first feature vector using the indexing parameter; a parametric family of sampling distributions for the second feature vector using the indexing parameter; a prior distribution for the indexing parameter; and a relevance generator operable to access the first feature vector, the second feature vector, the parametric families and the prior distribution, the relevance generator operable to; assign a distribution of the indexing parameter, given the second feature vector and an event that the first document is not relevant to the second document, the value of the prior distribution for the indexing parameter; assign a distribution of the indexing parameter, given the second feature vector and an event that the first document is relevant to the second document, the value of a distribution of the indexing parameter given the second document; generate a log likelihood ratio that the first document is relevant to the second document using the two assigned distributions of the indexing parameter; and store the log likelihood ratio as representing a relevance between the first document and the second document. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A process for determining relevance implemented using an electronic system, the process comprising:
-
providing a first feature vector; providing a second feature vector; providing an indexing parameter; providing a parametric family of sampling distributions for the first feature vector using the indexing parameter; providing a parametric family of sampling distributions for the second feature vector using the indexing parameter; providing a prior distribution of the indexing parameter; assigning a distribution of the indexing parameter, given the second feature vector and an event that the first feature vector is not relevant to the second feature vector, the value of the prior distribution of the indexing parameter; assigning a distribution of the indexing parameter, given the second feature vector and an event that the first feature vector is relevant to the second feature vector, the value of the posterior distribution of the indexing parameter given the second feature vector; generating a log likelihood ratio that the first feature vector is relevant to the second feature vector using the two assigned distributions of the indexing parameter; and storing the log likelihood ratio as representing relevance between the first feature vector and the second feature vector. - View Dependent Claims (22)
-
Specification