System and method for measuring the quality of document sets
First Claim
1. A method for improving interaction with a collection of information, the method comprising:
- providing an interface wherein interaction with the collection of information occurs through the interface;
generating, by a computer system, a set of results based, at least in part, on a first interpretation of the interaction with the collection of information, wherein the set of results includes a plurality of results, and the set of results having a set size;
evaluating, by the computer system, the set of results using a measure of the distinctiveness of the set of results;
generating, by the computer system, at least one candidate set having a plurality of results and a candidate set size, based at least in part, on a second interpretation of the interaction with the collection of information;
comparing, by the computer system, the measure of distinctiveness of the set of results against a measure of distinctiveness of the at least one candidate set;
generating a second result for the interaction with the collection of information from at least the set of results and the at least one candidate set based on the comparison of the measures of distinctiveness; and
outputting, by the computer system, the second result in response to the act of comparing.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are described that calculate the interestingness of a set of one or more records in a database, either absolutely (i.e., compared to an overall collection of records) or relative to some other set of records. In one embodiment, the measure is a relative entropy value that has been normalized. Various applications of the measure are described in the context of an information retrieval system. These applications include, for example, guiding query interpretation, guiding view selection and summarization, intelligent ranges, event detection, concept triggers and interpreting user actions, hierarchy discovery, and adaptive data mining.
86 Citations
55 Claims
-
1. A method for improving interaction with a collection of information, the method comprising:
providing an interface wherein interaction with the collection of information occurs through the interface; generating, by a computer system, a set of results based, at least in part, on a first interpretation of the interaction with the collection of information, wherein the set of results includes a plurality of results, and the set of results having a set size;
evaluating, by the computer system, the set of results using a measure of the distinctiveness of the set of results;generating, by the computer system, at least one candidate set having a plurality of results and a candidate set size, based at least in part, on a second interpretation of the interaction with the collection of information; comparing, by the computer system, the measure of distinctiveness of the set of results against a measure of distinctiveness of the at least one candidate set;
generating a second result for the interaction with the collection of information from at least the set of results and the at least one candidate set based on the comparison of the measures of distinctiveness; andoutputting, by the computer system, the second result in response to the act of comparing. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
29. A non-transitory computer-readable medium having computer-readable instructions stored thereon that define instructions that, as a result of being executed by a computer, instruct the computer to perform a method for improving interaction with a collection of information, the method comprising the acts of:
-
providing an interface wherein interaction with the collection of information occurs through the interface; generating a set of results based, at least in part, on a first interpretation of the interaction with the collection of information, wherein the set of results includes a plurality of results, and the set of results having a set size; evaluating the set of results using a measure of the distinctiveness of the set of results; generating at least one candidate set, having a plurality of results and a candidate set size, based at least in part, on a second interpretation of the interaction with the collection of information; comparing the measure of distinctiveness of the set of results against a measure of distinctiveness of the at least one candidate set, wherein the act of comparing includes generating a second result for the interaction with the collection of information from at least the set of results and the at least one candidate set; and
outputting the second result in response to the act of comparing.
-
-
30. A system for improving interaction with a collection of information, the system comprising:
-
at least one processor operatively connected to a memory, wherein the system is configured to execute system components, and the system further comprises; an I/O engine adapted to output at least a portion of an interactive display, wherein the I/O engine is further adapted to output at least one option in response to the comparison made by an analysis engine; a data retrieval engine adapted to generate a set of results, having a plurality of results and a set size, based, at least in part, on a first interpretation of an interaction with the collection of information; an analysis engine adapted to evaluate the set of results using a measure of distinctiveness, wherein the analysis engine is further adapted to compare the measure of distinctiveness for the set of results against a measure of distinctiveness of a candidate set; and a generation engine adapted to generate at least one candidate set, having a plurality of results and a candidate set size, based, at least in part, on a second interpretation of the interaction with the collection of information, wherein the generation engine is further configured to generate a second result for the interaction with the collection of information from at least the set of results and the at least one candidate set based on the compared measures of distinctiveness. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55)
-
Specification