Determining Words Related To A Given Set Of Words
First Claim
Patent Images
1. An apparatus comprising:
- an interface configured to receive an initial set of terms of a query; and
a processor configured to perform a method comprising;
initializing a set of candidate terms;
performing the following for a predetermined number of iterations;
obtaining a predetermined number of temporary terms for each candidate term of the set of candidate terms, the predetermined number of temporary terms having a higher differential affinity to the each candidate term;
placing each temporary term and the associated higher differential affinity into a set of temporary terms, if a temporary term is obtained from more than one candidate term, the differential affinity of the temporary term being related to the differential affinities to the more than one candidate term;
calculating an average differential affinity for each temporary term of the set of temporary terms, the average differential affinity representing an average of differential affinities from the each temporary term to every term of the initial set of terms;
removing from the temporary set one or more terms with an average differential affinity that fails to satisfy a predetermined threshold; and
placing one or more terms of the temporary set with higher differential affinities into the set of candidate terms;
selecting one or more terms of the set of candidate terms; and
returning the one or more selected terms.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, display of a user entry window of a graphical user interface is initiated. Search terms entered into the user entry window to initiate a first search are received. One or more first search results from a corpus of documents are determined according to the search terms. Display of the search terms at a current search terms window of the graphical user interface is initiated. Display of the first search results at a search results window of the graphical user interface is initiated. Display of the first search suggestions at a search suggestion window of the graphical user interface is initiated.
125 Citations
50 Claims
-
1. An apparatus comprising:
-
an interface configured to receive an initial set of terms of a query; and a processor configured to perform a method comprising; initializing a set of candidate terms; performing the following for a predetermined number of iterations; obtaining a predetermined number of temporary terms for each candidate term of the set of candidate terms, the predetermined number of temporary terms having a higher differential affinity to the each candidate term; placing each temporary term and the associated higher differential affinity into a set of temporary terms, if a temporary term is obtained from more than one candidate term, the differential affinity of the temporary term being related to the differential affinities to the more than one candidate term; calculating an average differential affinity for each temporary term of the set of temporary terms, the average differential affinity representing an average of differential affinities from the each temporary term to every term of the initial set of terms; removing from the temporary set one or more terms with an average differential affinity that fails to satisfy a predetermined threshold; and placing one or more terms of the temporary set with higher differential affinities into the set of candidate terms; selecting one or more terms of the set of candidate terms; and returning the one or more selected terms. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. An method comprising:
-
receiving an initial set of terms of a query; initializing a set of candidate terms; performing the following for a predetermined number of iterations; obtaining a predetermined number of temporary terms for each candidate term of the set of candidate terms, the predetermined number of temporary terms having a higher differential affinity to the each candidate term; placing each temporary term and the associated higher differential affinity into a set of temporary terms, if a temporary term is obtained from more than one candidate term, the differential affinity of the temporary term being related to the differential affinities to the more than one candidate term; calculating an average differential affinity for each temporary term of the set of temporary terms, the average differential affinity representing an average of differential affinities from the each temporary term to every term of the initial set of terms; removing from the temporary set one or more terms with an average differential affinity that fails to satisfy a predetermined threshold; and placing one or more terms of the temporary set with higher differential affinities into the set of candidate terms; selecting one or more terms of the set of candidate terms; and returning the one or more selected terms. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a tangible computer readable medium configured to store a set of terms; and a processor configured to; determine a term frequency-inverse document frequency (TF-IDF) value for each term of the set of terms of a document; sort the terms according to the TF-IDF values to yield a sorted list, a first term with the highest TF-IDF value being at the top of the sorted list, a second term with the lowest TF-IDF value being at the bottom of the sorted list; calculate a difference in TF-IDF values of successive terms for each of the successive terms of the sorted list to yield a plurality of differences; calculate a standard deviation of the differences; remove one or more terms from the sorted list according to the standard deviation; and output the remaining terms. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
-
21. A method comprising:
-
determining a term frequency-inverse document frequency (TF-IDF) value for each term of a set of terms of a document; sorting the terms according to the TF-IDF values to yield a sorted list, a first term with the highest TF-IDF value being at the top of the sorted list, a second term with the lowest TF-IDF value being at the bottom of the sorted list; calculating a difference in TF-IDF values of successive terms for each of the successive terms of the sorted list to yield a plurality of differences; calculating a standard deviation of the differences; removing one or more terms from the sorted list according to the standard deviation; and outputting the remaining terms. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28)
-
-
29. An apparatus comprising:
-
a tangible computer readable medium configured to store a corpus of documents; and a processor configured to; initiate display of a user entry window of a graphical user interface; receive a plurality of search terms entered into the user entry window to initiate a first search; determine one or more first search results from the corpus according to the search terms; initiate display of the search terms at a current search terms window of the graphical user interface; initiate display of the one or more first search results at a search results window of the graphical user interface; and initiate display of the one or more first search suggestions at a search suggestion window of the graphical user interface. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
-
40. A method comprising:
-
initiating display of a user entry window of a graphical user interface; receiving a plurality of search terms entered into the user entry window to initiate a first search; determining one or more first search results from a corpus of documents according to the search terms; initiating display of the search terms at a current search terms window of the graphical user interface; initiating display of the one or more first search results at a search results window of the graphical user interface; and initiating display of the one or more first search suggestions at a search suggestion window of the graphical user interface. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
Specification