Multiple correlation measures for measuring query similarity
First Claim
1. A computer-implemented method performed by data processing apparatus, the method comprising:
- receiving a first query and a plurality of second queries;
determining a temporal correlation score between the first query and each second query based on a comparison of a temporal series of occurrences of elements of the first query in a first corpus comprising a first document of a first document type and a temporal series of occurrences of elements of the second query in a second different textual corpus comprising a second document of a second document type that differs from the first document type, wherein the comparison is based on the first document and the second document having timestamps in a same time period;
computing a similarity score for the first query and a second query, the similarity score between the first query and a second query being computed based on the temporal correlation score between the first document and the second document; and
ranking the second query according to the similarity score.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining query suggestions from multiple correlation measures. In one aspect, a method includes receiving a first query and second queries, each of the first and second queries including one or more terms; for each second query and a linear model, receiving correlation scores measuring the correlation between the first query and the respective second query, each correlation score received from a respective correlation process, and each respective correlation process being different from the other respective correlation processes, and applying the linear model to the plurality of correlation scores to determine a combined correlation score that quantifies a combined correlation between the first query and the respective second query based on the plurality of correlation scores. The second queries are ranked in an order according to their respective combined correlations scores.
14 Citations
20 Claims
-
1. A computer-implemented method performed by data processing apparatus, the method comprising:
-
receiving a first query and a plurality of second queries; determining a temporal correlation score between the first query and each second query based on a comparison of a temporal series of occurrences of elements of the first query in a first corpus comprising a first document of a first document type and a temporal series of occurrences of elements of the second query in a second different textual corpus comprising a second document of a second document type that differs from the first document type, wherein the comparison is based on the first document and the second document having timestamps in a same time period; computing a similarity score for the first query and a second query, the similarity score between the first query and a second query being computed based on the temporal correlation score between the first document and the second document; and ranking the second query according to the similarity score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving a first query and a plurality of second queries; determining a temporal correlation score between the first query and each second query based on a comparison of a temporal series of occurrences of elements of the first query in a first corpus comprising a first document of a first document type and a temporal series of occurrences of elements of the second query in a second different textual corpus comprising a second document of a second document type that differs from the first document type, wherein the comparison is based on the first document and the second document having timestamps in a same time period; computing a similarity score for the first query and a second query, the similarity score between the first query and a second query being computed based on the temporal correlation score between the first document and the second document; and ranking the second query according to the similarity score. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product, encoded on one or more non-transitory computer storage media, comprising instructions that when executed by one or more computers cause the one or more computers to perform operations comprising:
-
receiving a first query and a plurality of second queries; determining a temporal correlation score between the first query and each second query based on a comparison of a temporal series of occurrences of elements of the first query in a first corpus comprising a first document of a first document type and a temporal series of occurrences of elements of the second query in a second different textual corpus comprising a second document of a second document type that differs from the first document type, wherein the comparison is based on the first document and the second document having timestamps in a same time period; computing a similarity score for the first query and a second query, the similarity score between the first query and a second query being computed based on the temporal correlation score between the first document and the second document; and ranking the second query according to the similarity score. - View Dependent Claims (18, 19, 20)
-
Specification