Clustering fuzzy expected value system
First Claim
1. A computer system for processing a database having language text records to evaluate their relevance to a predetermined subject of interest, the system comprising:
- processors for processing instructions and the text records;
database storage means for storing the text records in the database;
a unique word generator for generating and storing a list of unique words contained in the database;
a relevant word generator for generating and storing a list of relevant words including a system user'"'"'s selections from the stored list of unique words;
a modified database generator for generating and storing a modified database, the modified database including the list of relevant words and synonyms, if any, associated with each word in the list of relevant words, whereinafter the unique word generator generates a list of unique words of the modified database;
a relevant word table including the list of unique words of the modified database; and
means for calculating and storing confidence values associated with each word in the relevant word table, each of the confidence values based on input values from a plurality of personnel other than the system user that reflects their perceptions of a relevance of said each word in the relevant word table to the subject of interest;
said means for calculating and storing confidence values including a clustering fuzzy expected value system for determining a membership grade for said each word in the relevant word table;
said clustering fuzzy expected value system determining said membership grade for a word in the relevant word table including grouping the confidence values for said word in the relevant word table into a plurality of clusters according to a predetermined formula, determining a mean of all the confidence values for said word in the relevant word table, and determining a plurality of mean confidence values each associated with one of said plurality of clusters of said confidence values.
1 Assignment
0 Petitions
Accused Products
Abstract
A system provides a tool for computing the most typical fuzzy expected value of a membership function in a fuzzy set. The clustering fuzzy expected value system is used in a question answering system. CFEV is computed by the tool is based on grouping of individual responses, that meet certain criteria, to clusters. Each cluster is considered a "super response" and contributes to the result proportionally to its relative size and the difference in opinion from the mean of the entire sample. In so doing, CFEV represents the opinion of the majority of the population, but it also respects the opinion of the minority. A comparison is made with existed tools such as the FEV and the WFEV and the advantages of CFEV are demonstrated by examples for cases where other methods fail to perform.
-
Citations
13 Claims
-
1. A computer system for processing a database having language text records to evaluate their relevance to a predetermined subject of interest, the system comprising:
-
processors for processing instructions and the text records; database storage means for storing the text records in the database; a unique word generator for generating and storing a list of unique words contained in the database; a relevant word generator for generating and storing a list of relevant words including a system user'"'"'s selections from the stored list of unique words; a modified database generator for generating and storing a modified database, the modified database including the list of relevant words and synonyms, if any, associated with each word in the list of relevant words, whereinafter the unique word generator generates a list of unique words of the modified database; a relevant word table including the list of unique words of the modified database; and means for calculating and storing confidence values associated with each word in the relevant word table, each of the confidence values based on input values from a plurality of personnel other than the system user that reflects their perceptions of a relevance of said each word in the relevant word table to the subject of interest; said means for calculating and storing confidence values including a clustering fuzzy expected value system for determining a membership grade for said each word in the relevant word table; said clustering fuzzy expected value system determining said membership grade for a word in the relevant word table including grouping the confidence values for said word in the relevant word table into a plurality of clusters according to a predetermined formula, determining a mean of all the confidence values for said word in the relevant word table, and determining a plurality of mean confidence values each associated with one of said plurality of clusters of said confidence values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for evaluating a relevance of a database having text records to a predetermined subject of interest, the method comprising computer executed steps of:
-
compiling and storing a list of unique words contained in the database having text records; extracting and storing from the list of unique words a list of relevant words selected by a user of the method; storing confidence values furnished by personnel other than the user who are familiar with the subject of interest that reflect their perceptions of a relevance of each word in the list of relevant words to the subject of interest; and calculating and storing a membership grade for each word in the list of relevant words, including grouping the confidence values for one word of the list of relevant words into a plurality of clusters according to a predetermined formula, determining a mean of the confidence values for said one word, and determining for each of said clusters a mean of the confidence values in each of said clusters. - View Dependent Claims (10, 11, 12, 13)
-
Specification