Systems and methods to tune a general-purpose search engine for a search entry point
First Claim
1. A system that refines a general-purpose search engine, comprising:
- a memory;
a processor;
a component, stored in the memory and executed by the processor, that identifies an entry point that includes a link utilized to access the general-purpose search engine; and
a tuning component, stored in the memory and executed by the processor, that receives search query results of the general-purpose search engine and filters the search results based at least on criteria associated with the entry point through which the general-purpose search engine was accessed, the criteria comprises at least a first set of data categorized as relevant to a user'"'"'s context and a second set of data categorized as non-relevant to the user'"'"'s context, wherein user selection of a query result from a ranked list of the query results causes the selected result to be added to the first set of data and causes the results not selected by the user but ranked higher than the selected result to be automatically added to the second set of data, the first and second sets of data persisted to a computer-readable storage medium.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides systems and methods that employ statistical filtering and ranking techniques to improve content search engine search results by tuning a general-purpose search engine for an entry point for a group of users. The filter can be manually and/or automatically configured via providing training sets of relevant and non-relevant data. For example, a relevant set of data comprising web pages associated with a desired search context for the group utilizing the entry point can be employed. In addition, a non-relevant set of data comprising random and unrelated documents can be employed. The learned filter can then be employed to compare a returned result with the known relevant and the known non-relevant data sets to determine whether the result is relevant to the user for the entry point, and the degree of relevance. The results can then be presented to the user based on the relevance.
17 Citations
42 Claims
-
1. A system that refines a general-purpose search engine, comprising:
- a memory;
a processor;a component, stored in the memory and executed by the processor, that identifies an entry point that includes a link utilized to access the general-purpose search engine; and a tuning component, stored in the memory and executed by the processor, that receives search query results of the general-purpose search engine and filters the search results based at least on criteria associated with the entry point through which the general-purpose search engine was accessed, the criteria comprises at least a first set of data categorized as relevant to a user'"'"'s context and a second set of data categorized as non-relevant to the user'"'"'s context, wherein user selection of a query result from a ranked list of the query results causes the selected result to be added to the first set of data and causes the results not selected by the user but ranked higher than the selected result to be automatically added to the second set of data, the first and second sets of data persisted to a computer-readable storage medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
- a memory;
-
13. A system that tunes a general-purpose search engine, comprising:
- a memory;
a processor;a filter component, stored in the memory and executed by the processor, that receives search query results of a general-purpose search engine and parses relevant and non-relevant results based on training data associated with the entry point that provides a link employed to traverse to the general-purpose search engine, the training data comprises a first set of data categorized as relevant to a search context of a user for the entry point and a second set of data categorized as non-relevant to the search context of the user; and a ranking component, stored in the memory and executed by the processor;
that sorts the filtered results in accordance with the training data for presentation to a user, wherein a user clicking a link associated with a search result from the sorted results causes the result to be added to the first set of data and causes the results whose links were not clicked by the user but that are ranked higher than the clicked result to be automatically added to the second set of data, the first and second sets of data persisted to a computer-readable storage medium. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
- a memory;
-
22. A computer-implemented method to filter and rank general-purpose search engine results based on criteria associated with an entry point, comprising:
-
executing a query search with the general-purpose search engine accessed through a link associated with the entry point; filtering the general-purpose search engine results by tuning the general-purpose search engine based on a set of training data associated with the entry point employed to access the general purpose search engine; ranking the filtered general-purpose search engine results; automatically storing a first query result selected by a user in a first data set categorized as relevant; automatically storing at least one non-selected query result that is ranked higher than the first query result in a second data set categorized as non-relevant upon selection of the first query result; and including the first data set and second data set in the set of training data associated with the entry point employed to access the general purpose search engine. - View Dependent Claims (23, 24, 25, 26, 27, 28)
-
-
29. A computer-implemented method to customize a general-purpose search engine to improve context search query results, comprising:
-
tuning a general-purpose search engine for an entry point by employing a method further comprising; providing a first set of data categorized as relevant that is used by a component to discern query results relevant to a search context of a user employing the entry point, the entry point provides a link employed to access the general-purpose search engine; providing a second set of data categorized as non-relevant that is used by the component to discern query results unrelated to the search context, the first set of data and the second set of data are manually provided; determining whether a query result is relevant or non-relevant to the search context based on the first set of relevant data and the second set of non-relevant data, each query result is compared with both the first set of data and second set of data to determine the relevance of the query result; executing a search query with the general purpose search engine to obtain a ranked list of query results; selecting a link associated with a query result from the list; automatically adding the selected query result to the first set of data; and automatically adding non-selected results from the list that are ranked higher than the selected query result to the second set of data upon selection of the selected query result. - View Dependent Claims (30, 31, 32, 33)
-
-
34. A computer-implemented method to automatically customize a general-purpose search engine for an entry point, comprising:
-
identifying the entry point; executing a query search via the entry point that includes a link employed to route to the general-purpose search engine; recording a first query result from a ranked list of query results returned from the executed query as relevant when a user views the document associated with the first query result; recording at least one second query result whose associated document was not viewed by the user but that is ranked higher than the first query result as non-relevant when the first result is selected for viewing by the user; and providing the recorded results to automatically train the filter for the entry point, in order to discriminate between results relevant to a search context of the user for the entry point and results non-relevant to the search context. - View Dependent Claims (35, 36, 37, 38, 39, 40)
-
-
41. A computer readable storage device storing computer executable components that tunes a general-purpose search engine to improve context search query results, comprising:
-
a component that, when executed by a processor, receives search query results of a general-purpose search engine and filters the results based on training data sets associated with the search context of a user depending on the entry point that provides a link utilized to arrive at the general-purpose search engine, the training data sets include at least a first category of data explicitly defined to be relevant to the search context and a second category of data explicitly defined to be non-relevant to the search context; and a component that, when executed by a processor, ranks the filtered general-purpose search engine results according to the similarity of the search engine results to the training data sets, wherein selecting a link associated with a first search result from the ranked results causes the first result to be added to the first set of data and causes results that are ranked higher than the first result and have not been selected by the user to be automatically added to the second set of data.
-
-
42. A system that receives, filters and ranks general-purpose search engine results, comprising:
- a memory;
a processor;a component, stored in the memory and executed by the processor, for filtering general-purpose search engine results by determining whether a query result is relevant to a search context of a group of users, the search context is associated with an entry point that includes a link employed to navigate to the general-purpose search engine, the search context further having an associated first set of training data categorized as relevant to the context and an associated second set of training data categorized as non-relevant to the context; and a component, stored in the memory and executed by the processor, for ranking the filtered general-purpose search engine results based on a relevance of the general-purpose search engine results to the search context of the group of users and the entry point as determined by a comparison of the search engine results with the first and second sets of training data, wherein a user viewing a document associated with a first search result from the ranked results causes the first result to be added to the first set of training data and causes the results that are unviewed but ranked higher than the first result to be automatically added to the second set of training data, the first and second sets of training data stored on a computer-readable storage medium.
- a memory;
Specification