Generating improved document classification data using historical search results
First Claim
1. A computer-implemented method, comprising:
- at a server system having one or more processors and memory,respectively accessing historical query information for queries having search results that correspond to first information items and second information items, wherein the first information items are initially classified and the second information items are initially unclassified;
accessing classification data of the first information items;
generating classification data for the initially unclassified information items based on the classification data of the first information items and the historical query information;
storing the generated classification data in the server system; and
providing customized services associated with the second information items to a plurality of client devices using the corresponding classification data stored in the server system;
wherein generating classification data for an initially unclassified information item includes;
identifying a set of queries in the historical query information, wherein at least a subset of the queries each has an associated search result corresponding to the initially unclassified information item;
generating classification data for the set of queries based on the classification data of the first information items and the historical query information for the set of queries; and
generating classification data for the initially unclassified information item by combining the generated classification data of the subset of the queries, each of which has an associated search result corresponding to the initially unclassified information item; and
wherein generating classification data for the set of queries includes;
for each of at least a subset of the queries,identifying a set of search results corresponding to the query and a set of the first information items corresponding to the set of search results;
weighting the classification data of the identified first information items in accordance with at least one of;
their respective predefined information retrieval scores, their corresponding search results'"'"' positions in the set of search results, and user interaction with the corresponding search results; and
aggregating the weighted classification data of the identified first information items as the query'"'"'s classification data.
4 Assignments
0 Petitions
Accused Products
Abstract
A server system accesses, respectively, historical query information for queries that have search results corresponding to first information items and second information items and classification data of the first information items. Initially, the first information items are classified and the second information items are unclassified. Based on the classification data of the first information items and the historical query information, the server system generates classification data for the second information items and stores the generated classification data therein. In response to requests for service from client devices, the server system provides customized services to the client devices using the second information items and the corresponding classification data generated for the second information items.
88 Citations
36 Claims
-
1. A computer-implemented method, comprising:
-
at a server system having one or more processors and memory, respectively accessing historical query information for queries having search results that correspond to first information items and second information items, wherein the first information items are initially classified and the second information items are initially unclassified; accessing classification data of the first information items; generating classification data for the initially unclassified information items based on the classification data of the first information items and the historical query information; storing the generated classification data in the server system; and providing customized services associated with the second information items to a plurality of client devices using the corresponding classification data stored in the server system; wherein generating classification data for an initially unclassified information item includes; identifying a set of queries in the historical query information, wherein at least a subset of the queries each has an associated search result corresponding to the initially unclassified information item; generating classification data for the set of queries based on the classification data of the first information items and the historical query information for the set of queries; and generating classification data for the initially unclassified information item by combining the generated classification data of the subset of the queries, each of which has an associated search result corresponding to the initially unclassified information item; and wherein generating classification data for the set of queries includes; for each of at least a subset of the queries, identifying a set of search results corresponding to the query and a set of the first information items corresponding to the set of search results; weighting the classification data of the identified first information items in accordance with at least one of;
their respective predefined information retrieval scores, their corresponding search results'"'"' positions in the set of search results, and user interaction with the corresponding search results; andaggregating the weighted classification data of the identified first information items as the query'"'"'s classification data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method, comprising:
-
at a server system having one or more processors and memory, respectively accessing historical query information for queries having search results that correspond to first information items and second information items, wherein the first information items are initially classified and the second information items are initially unclassified; accessing classification data of the first information items; generating classification data for the initially unclassified information items based on the classification data of the first information items and the historical query information; storing the generated classification data in the server system; and providing customized services associated with the second information items to a plurality of client devices using the corresponding classification data stored in the server system; wherein generating classification data for an initially unclassified information item includes; identifying a set of queries in the historical query information, wherein at least a subset of the queries each has an associated search result corresponding to the initially unclassified information item; generating classification data for the set of queries based on the classification data of the first information items and the historical query information for the set of queries; and generating classification data for the initially unclassified information item by combining the generated classification data of the subset of queries, each of which has an associated search result corresponding to the initially unclassified information item; and wherein generating classification data for the initially unclassified information item by combining the generated classification data of the subset of queries includes; for each query of the subset of queries, weighting the classification data of the query in accordance with at least one of;
the initially unclassified information item'"'"'s predefined information retrieval score, a search result position of a search result corresponding to the initially unclassified information item in a set of search results for the query, and user interaction with the corresponding search result; andaggregating the weighted classification data of the subset of queries as the initially unclassified information item'"'"'s classification data. - View Dependent Claims (12)
-
-
13. A computer system, comprising:
-
one or more processors; memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including; instructions for respectively accessing historical query information for queries having search results that correspond to first information items and second information items, wherein the first information items are initially classified and the second information items are initially unclassified; instructions for accessing classification data of the first information items; instructions for generating classification data for the second information items based on the classification data of the first information items and the historical query information; instructions for storing the generated classification data in the server system; and instructions for providing customized services associated with the second information items to a plurality of client devices using the corresponding classification data stored in the server system; wherein the instructions for generating classification data for an initially unclassified information item include; instructions for identifying a set of queries in the historical query information, wherein at least a subset of the queries each has an associated search result corresponding to the initially unclassified information item; instructions for generating classification data for the set of queries based on the classification data of the first information items and the historical query information for the set of queries; and instructions for generating classification data for the initially unclassified information item by combining the generated classification data of the subset of queries, each of which has an associated search result corresponding to the initially unclassified information item; and wherein the instructions for generating classification data for the set of queries include; for each of at least a subset of the queries, instructions for identifying a set of search results corresponding to each of at least a subset of the queries and a set of the first information items corresponding to the set of search results; instructions for weighting the classification data of the identified first information items in accordance with at least one of;
their respective predefined information retrieval scores, their corresponding search results'"'"' positions in the set of search results, and user interaction with the corresponding search results; andinstructions for aggregating the weighted classification data of the identified first information items as the query'"'"'s classification data. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer system, comprising:
-
one or more processors; memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for; respectively accessing historical query information for queries having search results that correspond to first information items and second information items, wherein the first information items are initially classified and the second information items are initially unclassified; accessing classification data of the first information items; generating classification data for the second information items based on the classification data of the first information items and the historical query information; storing the generated classification data in the server system; and providing customized services associated with the second information items to a plurality of client devices using the corresponding classification data stored in the server system; wherein the instructions for generating classification data for an initially unclassified information item include instructions for; identifying a set of queries in the historical query information, wherein at least a subset of the queries each has an associated search result corresponding to the initially unclassified information item; generating classification data for the set of queries based on the classification data of the first information items and the historical query information for the set of queries; and generating classification data for the initially unclassified information item by combining the generated classification data of the subset of queries, each of which has an associated search result corresponding to the initially unclassified information item; and wherein the instructions for generating classification data for the initially unclassified information item by combining the generated classification data of the subset of queries include instructions for; for each query of the subset of queries, weighting the classification data of the query in accordance with at least one of;
the initially unclassified information item'"'"'s predefined information retrieval score, a search result position of a search result corresponding to the initially unclassified information item in a set of search results for the query, and user interaction with the corresponding search result; andaggregating the weighted classification data of the subset of queries as the initially unclassified information item'"'"'s classification data. - View Dependent Claims (24)
-
-
25. A non-transitory computer readable storage medium and one or more computer programs embedded therein, the one or more computer programs comprising instructions which, when executed by a computer system, cause the computer system to:
-
respectively access historical query information for queries having search results that correspond to first information items and second information items, wherein the first information items are initially classified and the second information items are initially unclassified; access classification data of the first information items; generate classification data for the second information items based on the classification data of the first information items and the historical query information; store the generated classification data in the server system; and provide customized services associated with the second information items to a pluralit of client devices using the corresponding classification data stored in the server system; wherein the instructions to generate classification data for an initially unclassified information item include instructions to; identify a set of queries in the historical query information, wherein at least a subset of the queries each has an associated search result corresponding to the initially unclassified information item; generate classification data for the set of queries based on the classification data of the first information items and the historical query information for the set of queries; and generate classification data for the initially unclassified information item by combining the generated classification data of the subset of queries, each of which has an associated search result corresponding to the initially unclassified information item; and wherein the instructions to generate classification data for the set of queries include instructions to; for each of at least a subset of the queries, identify a set of search results corresponding to each of at least a subset of the queries and a set of the first information items corresponding to the set of search results; weight the classification data of the identified first information items in accordance with at least one of;
their respective predefined information retrieval scores, their corresponding search results'"'"' positions in the set of search results, and user interaction with the corresponding search results; andaggregate the weighted classification data of the identified first information items as the query'"'"'s classification data. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. A non-transitory computer readable storage medium and one or more computer programs embedded therein, the one or more computer programs comprising instructions which, when executed by a computer system, cause the computer system to:
-
respectively access historical query information for queries having search results that correspond to first information items and second information items, wherein the first information items are initially classified and the second information items are initially unclassified; access classification data of the first information items; generate classification data for the second information items based on the classification data of the first information items and the historical query information; store the generated classification data in the server system; and provide customized services associated with the second information items to a plurality of client devices using the corresponding classification data stored in the server system; wherein the instructions to generate classification data for an initially unclassified information item include instructions to; identify a set of queries in the historical query information, wherein at least a subset of the queries each has an associated search result corresponding to the initially unclassified information item; generate classification data for the set of queries based on the classification data of the first information items and the historical query information for the set of queries; and generate classification data for the initially unclassified information item by combining the generated classification data of the subset of queries, each of which has an associated search result corresponding to the initially unclassified information item; and wherein the instructions to generate classification data for the initially unclassified information item by combining the generated classification data of the subset of queries include instructions to; for each query of the subset of queries, weight the classification data of the query in accordance with at least one of;
the initially unclassified information item'"'"'s predefined information retrieval score, a search result position of a search result corresponding to the initially unclassified information item in a set of search results for the query, and user interaction with the corresponding search result; andaggregate the weighted classification data of the subset of queries as the initially unclassified information item'"'"'s classification data. - View Dependent Claims (36)
-
Specification