REPRESENTING QUERIES AND DETERMINING SIMILARITY BASED ON AN ARIMA MODEL
First Claim
1. A method in a computing device for determining similarity between queries, the method comprising:
- storing frequencies of the queries during intervals;
for each of the queries, generating ARIMA coefficients for that query based on the stored frequencies for that query; and
for a pair of queries, calculating a similarity score for the queries based on a correlation between the ARIMA coefficients of the queries.
2 Assignments
0 Petitions
Accused Products
Abstract
Representing queries and determining similarity of queries based on an autoregressive integrated moving average (“ARIMA”) model is provided. A query analysis system represents each query by its ARIMA coefficients. The query analysis system may estimate the frequency information for a desired past or future interval based on frequency information for some initial intervals. The query analysis system may also determine the similarity of a pair of queries based on the similarity of their ARIMA coefficients. The query analysis system may use various metrics, such as a correlation metric, to determine the similarity of the ARIMA coefficients.
47 Citations
20 Claims
-
1. A method in a computing device for determining similarity between queries, the method comprising:
-
storing frequencies of the queries during intervals; for each of the queries, generating ARIMA coefficients for that query based on the stored frequencies for that query; and for a pair of queries, calculating a similarity score for the queries based on a correlation between the ARIMA coefficients of the queries. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-readable medium encoded with instructions for controlling a computing device to determine frequency of a query at an interval, by a method comprising:
-
storing frequencies of the query during intervals; generating ARIMA coefficients representing the query based on the stored frequencies; and estimating the frequency of the query at the interval based on the ARIMA coefficients for the query. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A computing device for representing a query comprising:
-
a query frequency store having, for each of a plurality of intervals, a frequency of the query during the interval; a component that generates ARIMA coefficients for the query based on the frequencies of the query during the intervals; and an ARIMA coefficient store for storing the generated ARIMA coefficients representing the query. - View Dependent Claims (19, 20)
-
Specification