Term suggestion for multi-sense query
First Claim
Patent Images
1. A computer-implemented method for related term suggestion, the method comprising:
- mining search results via a multi-sense query, wherein the multi-sense query comprises;
determining terms/phrases semantically related to submitted terms/phrases, wherein semantic relationships are discovered by mining a context of the terms/phrases to determine meaning;
configuring a threshold frequency of occurrence (FOO) value;
assigning historical queries to high FOO or low FOO based on the configured threshold value;
generating term vectors from the search results associated with a set of high FOO historical queries previously submitted to a search engine; and
generating term clusters as a function of calculated similarity of term vectors, wherein calculated similarity, sim(qj, qk), is determined as follows;
wherein d represents vector dimension, q represents a query, k is a dimension index, and wherein weight w for the ith vector'"'"'s jth term is calculated as follows;
wij=TFij×
log(N/DFj); and
wherein TFij represents term frequency, N is a total number of query terms, and DFj is a number of extracted feature records that contain the ith vector'"'"'s jth term;
responsive to receiving a term/phrase from an entity, evaluating the term/phrase via the multi-sense query in view of terms/phrases in the term clusters to identify one or more related term suggestions, wherein the identifying is based on a combination of FOO and a confidence value; and
returning at least one suggested term list ordered by the combination of FOO and confidence value, wherein multiple suggested term lists are generated when the term/phrase matches terms in more than one term cluster.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for related term suggestion are described. In one aspect, term clusters are generated as a function of calculated similarity of term vectors. Each term vector having been generated from search results associated with a set of high frequency of occurrence (FOO) historical queries previously submitted to a search engine. Responsive to receiving a term/phrase from an entity, the term/phrase is evaluated in view of terms/phrases in the term clusters to identify one or more related term suggestions.
-
Citations
43 Claims
-
1. A computer-implemented method for related term suggestion, the method comprising:
-
mining search results via a multi-sense query, wherein the multi-sense query comprises; determining terms/phrases semantically related to submitted terms/phrases, wherein semantic relationships are discovered by mining a context of the terms/phrases to determine meaning; configuring a threshold frequency of occurrence (FOO) value; assigning historical queries to high FOO or low FOO based on the configured threshold value; generating term vectors from the search results associated with a set of high FOO historical queries previously submitted to a search engine; and generating term clusters as a function of calculated similarity of term vectors, wherein calculated similarity, sim(qj, qk), is determined as follows; wherein d represents vector dimension, q represents a query, k is a dimension index, and wherein weight w for the ith vector'"'"'s jth term is calculated as follows;
wij=TFij×
log(N/DFj); andwherein TFij represents term frequency, N is a total number of query terms, and DFj is a number of extracted feature records that contain the ith vector'"'"'s jth term; responsive to receiving a term/phrase from an entity, evaluating the term/phrase via the multi-sense query in view of terms/phrases in the term clusters to identify one or more related term suggestions, wherein the identifying is based on a combination of FOO and a confidence value; and returning at least one suggested term list ordered by the combination of FOO and confidence value, wherein multiple suggested term lists are generated when the term/phrase matches terms in more than one term cluster. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A tangible computer-readable data storage medium comprising computer-executable instructions for executing a method, the method comprising:
-
mining search results via a multi-sense query, wherein the multi-sense query comprises; determining terms/phrases semantically related to submitted terms/phrases, wherein semantic relationships are discovered by mining a context of the terms/phrases to determine meaning; configuring a threshold frequency of occurrence (FOO) value; assigning historical queries to high FOO or low FOO based on the configured threshold value; generating term vectors from the search results associated with a set of high FOO historical queries previously submitted to a search engine; and generating term clusters as a function of calculated similarity of term vectors, wherein calculated similarity, sim(qj, qk), is determined as follows; wherein d represents vector dimension, q represents a query, k is a dimension index, and wherein weight w for the ith vector'"'"'s jth term is calculated as follows;
wij=TFij×
log(N/DFj); andwherein TFj represents term frequency, N is a total number of query terms, and DFj is a number of extracted feature records that contain the ith vector'"'"'s jth term; responsive to receiving a term/phrase from an entity, evaluating the term/phrase via the multi-sense query in view of terms/phrases in the term clusters to identify one or more related term suggestions, wherein the identifying is based on a combination of FOO and a confidence value; and returning at least one suggested term list ordered by the combination of FOO and confidence value, wherein multiple suggested term lists are generated when the term/phrase matches terms in more than one term cluster. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computing device comprising:
-
a processor; and a memory couple to the processor, the memory comprising computer-program instructions executable by the processor for; mining search results via a multi-sense query, wherein the multi-sense query comprises; determining terms/phrases semantically related to submitted terms/phrases, wherein semantic relationships are discovered by mining a context of the terms/phrases to determine meaning; configuring a threshold frequency of occurrence (FOO) value; assigning historical queries to high FOO or low FOO based on the configured threshold value; generating term vectors from the search results associated with a set of high FOO historical queries previously submitted to a search engine; and generating term clusters as a function of calculated similarity of term vectors, wherein calculated similarity, sim(qi, qk), is determined as follows; wherein d represents vector dimension, q represents a query, k is a dimension index, and wherein weight w for the ith vector'"'"'s jth term is calculated as follows;
wij=TFij×
log(N/DFj); andwherein TFj represents term frequency, N is a total number of query terms, and DFj is a number of extracted feature records that contain the ith vector'"'"'s jth term; responsive to receiving a term/phrase from an entity, evaluating the term/phrase via the multi-sense query in view of terms/phrases in the term clusters to identify one or more related term suggestions, wherein the identifying is based on a combination of FOO and a confidence value; and returning at least one suggested term list ordered by the combination of FOO and confidence value, wherein multiple suggested term lists are generated when the term/phrase matches terms in more than one term cluster. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
-
37. A computing device comprising at least one processor, the device further comprising:
-
means for mining search results via a multi-sense query, wherein the multi-sense query comprises; means for determining terms/phrases semantically related to submitted terms/phrases, wherein semantic relationships are discovered by mining a context of the terms/phrases to determine meaning; means for configuring a threshold frequency of occurrence (FOO) value; means for assigning historical queries to high FOO or low FOO based on the configured threshold value; means for generating term vectors from the search results associated with a set of high FOO historical queries previously submitted to a search engine; and means for generating term clusters as a function of calculated similarity of term vectors, wherein calculated similarity, sim(qi, qk), is determined as follows; wherein d represents vector dimension, q represents a query, k is a dimension index, and wherein weight w for the ith vector'"'"'s jth term is calculated as follows;
wij=TFij×
log(N/DFj); andwherein TFj represents term frequency, N is a total number of query terms, and DFj is a number of extracted feature records that contain the ith vector'"'"'s jth term; responsive to receiving a term/phrase from an entity, means for evaluating the term/phrase via the multi-sense query in view of terms/phrases in the term clusters to identify one or more related term suggestions, wherein the identifying is based on a combination of FOO and a confidence value; and means for returning at least one suggested term list ordered by the combination of FOO and confidence value, wherein multiple suggested term lists are generated when the term/phrase matches terms in more than one term cluster. - View Dependent Claims (38, 39, 40, 41, 42, 43)
-
Specification