METHODS AND APPARATUS FOR CHARACTERIZING A SEARCH RESULT AS POTENTIAL SPAM
First Claim
1. A method of characterizing a search result as potential spam, the method comprising:
- receiving a search result from a search engine, the search result being based on a search query;
comparing data indicative of the search result to data indicative of the search query to determine a spam score, the spam score being based on at least one of a character length and a word length of a longest matching substring, the longest matching substring appearing in the search result and the search query;
comparing the spam score to a threshold; and
characterizing the search result as potential spam if the spam score crosses the threshold.
8 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus assessing, ranking, organizing, and presenting search results associated with a user'"'"'s current work context are disclosed. The system disclosed assesses, ranks, organizes and presents search results against a user'"'"'s current work context by comparing statistical and heuristic models of the search results to a statistical and heuristic model of the user'"'"'s current work context. In this manner, search results are assessed, ranked, organized, and/or presented with the benefit of attributes of the user'"'"'s current work context that are predictive of relevance, such as words in a user'"'"'s document (e.g., web page or word processing document) that may not have been included in the search query. In addition, search results from multiple search engines are combined into an organization scheme that best reflects the user'"'"'s current task. As a result, lists of search results from different search engines can be more usefully presented to the user.
-
Citations
33 Claims
-
1. A method of characterizing a search result as potential spam, the method comprising:
-
receiving a search result from a search engine, the search result being based on a search query;
comparing data indicative of the search result to data indicative of the search query to determine a spam score, the spam score being based on at least one of a character length and a word length of a longest matching substring, the longest matching substring appearing in the search result and the search query;
comparing the spam score to a threshold; and
characterizing the search result as potential spam if the spam score crosses the threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. An apparatus for characterizing a search result as potential spam, the apparatus comprising:
-
a processor;
a memory device operatively coupled to the processor; and
a network device operatively coupled to the processor;
wherein the memory device stores a software program to cause the processor to;
receive a search result from a search engine via the network device, the search result being based on a search query;
compare data indicative of the search result to data indicative of the search query to determine a spam score, the spam score being based on at least one of a character length and a word length of a longest matching substring, the longest matching substring appearing in the search result and the search query;
compare the spam score to a threshold; and
characterize the search result as potential spam if the spam score crosses the threshold. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 31, 32, 33)
-
-
26. A computer readable medium storing a software program to cause a computing device to:
-
receive a search result from a search engine via the network device, the search result being based on a search query;
compare data indicative of the search result to data indicative of the search query to determine a spam score, the spam score being based on at least one of a character length and a word length of a longest matching substring, the longest matching substring appearing in the search result and the search query;
compare the spam score to a threshold; and
characterize the search result as potential spam if the spam score crosses the threshold. - View Dependent Claims (27, 28, 29, 30)
-
Specification