System and method for determining semantically related terms based on sequences of search queries
First Claim
1. A method for determining one or more semantically related terms based on sequences of search queries, the method comprising:
- examining search query information to associate search queries with a user submitting the search query;
establishing a plurality of sequences of search queries, each sequence of search queries comprising one or more search queries associated with a common user and relating to a common concept;
receiving one or more seed terms;
creating a first set of vectors representing for each search query in the search query information, whether the search query is associated with each sequence of the plurality of sequences;
creating a second set of vectors representing for each search query in the search query information, a weight associated with the search query based on the number of sequences of the plurality of sequences the search query is associated with; and
determining one or more terms semantically related to the seed terms based on the first set of vectors, the second set of vectors, and at least one seed term of the received one or more seed terms;
wherein determining one or more terms semantically related to the seed terms comprises determining a weighted average over each sequence of the plurality of sequences as a function of the first and second set of vectors, and the at least one seed term; and
wherein the weighted average is calculated using the formula;
T=Sum of (V1*VectorSimilarity(V2,S)),wherein V1*VectorSimilarity (V2,S) is calculated for a number of sequences found in the search query information;
V1 is the relevant vector of the first set of vectors indicating for each search query in the search query information, whether a search query is associated with the relevant sequence;
V2 is the relevant vector of the second set of vectors including for each search query in the search query information, an entry indicating a weight of a search query associated with the sequence; and
S is the seed term vector indicating for each search query in the search query information, whether the search query is one of the received seed terms.
9 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure is directed to systems and methods for determining semantically related terms based on sequences of search queries. Generally, a semantically related term tool examines search logs to associate search queries with a user submitting the search query. The semantically related term tool establishes a plurality of sequences of search queries, each sequence of search queries comprising one or more search queries associated with a common user and relating to a common concept. The semantically related term tool receives one or more seed terms and determines one or more terms related to the received seed terms based on the established plurality of sequences of search queries.
33 Citations
14 Claims
-
1. A method for determining one or more semantically related terms based on sequences of search queries, the method comprising:
-
examining search query information to associate search queries with a user submitting the search query; establishing a plurality of sequences of search queries, each sequence of search queries comprising one or more search queries associated with a common user and relating to a common concept; receiving one or more seed terms; creating a first set of vectors representing for each search query in the search query information, whether the search query is associated with each sequence of the plurality of sequences; creating a second set of vectors representing for each search query in the search query information, a weight associated with the search query based on the number of sequences of the plurality of sequences the search query is associated with; and determining one or more terms semantically related to the seed terms based on the first set of vectors, the second set of vectors, and at least one seed term of the received one or more seed terms; wherein determining one or more terms semantically related to the seed terms comprises determining a weighted average over each sequence of the plurality of sequences as a function of the first and second set of vectors, and the at least one seed term; and wherein the weighted average is calculated using the formula;
T=Sum of (V1*VectorSimilarity(V2,S)),wherein V1*VectorSimilarity (V2,S) is calculated for a number of sequences found in the search query information;
V1 is the relevant vector of the first set of vectors indicating for each search query in the search query information, whether a search query is associated with the relevant sequence;
V2 is the relevant vector of the second set of vectors including for each search query in the search query information, an entry indicating a weight of a search query associated with the sequence; and
S is the seed term vector indicating for each search query in the search query information, whether the search query is one of the received seed terms.- View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-readable storage medium comprising a set of instructions for determining one or more semantically related terms based on sequences of search queries, the set of instructions to direct a processor to perform acts of:
-
examining search query information to associate search queries with a user submitting the search query; establishing a plurality of sequences of search queries, each sequence of search queries comprising one or more search queries associated with a common user and relating to a common concept; receiving one or more seed terms; creating a first set of vectors representing for each search query in the search query information, whether the search query is associated with each sequence of the plurality of sequences; creating a second set of vectors representing for each search query in the search query information, a weight associated with the search query based on the number of sequences of the plurality of sequences the search query is associated with; and determining one or more terms semantically related to the seed terms based on the first set of vectors, the second set of vectors, and at least one seed term of the received one or more seed terms; wherein determining one or more terms semantically related to the seed terms comprises determining a weighted average over each sequence of the plurality of sequences as a function of the first and second set of vectors, and the at least one seed term; and wherein the weighted average is calculated using the formula;
T=Sum of (V1*VectorSimilarity(V2,S)),wherein V1*VectorSimilarity (V2,S) is calculated for a number of sequences found in the search query information;
V1 is the relevant vector of the first set of vectors indicating for each search query in the search query information, whether a search query is associated with the relevant sequence;
V2 is the relevant vector of the second set of vectors including for each search query in the search query information, an entry indicating a weight of a search query associated with the sequence; and
S is the seed term vector indicating for each search query in the search query information, whether the search query is one of the received seed terms.- View Dependent Claims (7)
-
-
8. A system for determining one or more semantically related terms based on sequences of search queries, the system comprising:
-
a search query memory module storing search query information, the search query information associating at least search queries received at a search engine and a user submitting the search queries, a semantically related term tool operative to; access the search query memory module storing search query information and determine a plurality of search sequences, each search sequence comprising one or more search queries submitted by a common user and related to a common concept; create a first set of vectors representing for each search query in the search query information of the search query memory module, whether the search query is associated with each sequence of the plurality of sequences; create a second set of vectors representing for each search query in the search query information of the search query memory module, a weight associated with the search query based on the number of sequences of the plurality of sequences the search query is associated with; and determine one or more terms semantically related to the seed terms based on the first set of vectors, the second set of vectors, and at least one seed term of the received one or more seed terms; wherein to determine one or more terms semantically related to the seed terms, the semantically related term tool is further operative to determine a weighted average over each sequence of the plurality of sequences as a function of the first and second set of vectors, and the at least one seed term; and wherein the weighted average is calculated using the formula;
T=Sum of (V1*VectorSimilarity(V2,S)),wherein V1*VectorSimilarity (V2,S) is calculated for a number of sequences found in the search query information;
V1 is the relevant vector of the first set of vectors indicating for each search query in the search query information, whether a search query is associated with the relevant sequence;
V2 is the relevant vector of the second set of vectors including for each search query in the search query information, an entry indicating a weight of a search query associated with the sequence; and
S is the seed term vector indicating for each search query in the search query information, whether the search query is one of the received seed terms.- View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification