Apparatus and method for term context modeling for information retrieval
First Claim
Patent Images
1. A method for computing similarity between a first text object and a second text object, the method comprising:
- a. Using the first text object to derive a context model associated with the first text object; and
b. Using the derived context model to compute similarity between the first text object and the second text object.
2 Assignments
0 Petitions
Accused Products
Abstract
A novel method for going beyond the observed properties of a keyword, to a model in which the presence of a term in a document is assessed not by looking at the actual occurrence of that term, but by a set of non-independent supporting terms, defining the context. In other words, similarity is determined not by properties of the keyword, but by properties of the keyword'"'"'s context. This yields a scoring for documents which is useful for ad hoc retrieval and, by extension, any information retrieval task where keyword-based similarity is needed.
66 Citations
38 Claims
-
1. A method for computing similarity between a first text object and a second text object, the method comprising:
-
a. Using the first text object to derive a context model associated with the first text object; and b. Using the derived context model to compute similarity between the first text object and the second text object. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for automatic induction of a context model for a term, the method comprising:
-
a. Selecting a feature set to contain only a feature for the term with no context; b. Setting the initial weight to one for the feature; c. Updating the weight for the feature; and d. Performing feature induction. - View Dependent Claims (18, 19)
-
-
20. A computer programming product embodied on a computer readable medium, for computing similarity between a first text object and a second text object, the computer programming product comprising:
-
a. Code for using the first text object to derive a context model associated with the first text object; and b. Code for using the derived context model to compute similarity between the first text object and the second text object. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A computer programming product embodied on a computer readable medium, for automatic induction of a context model for a term, the computer programming product comprising:
-
a. Code for selecting a feature set to contain only a feature for the term with no context; b. Code for setting the initial weight to one for the feature; c. Code for updating the weight for the feature; and d. Code for performing feature induction. - View Dependent Claims (37, 38)
-
Specification