Method and system for identifying keywords for use in placing keyword-targeted advertisements
First Claim
1. A computer-implemented method in a computer system for identifying phrases related to an item from a set of one or more documents, the method comprising:
- under control of one or more computer systems configured with executable instructions,receiving a description of the item;
performing a search across a corpus of documents to retrieve a search result set of a plurality of documents, wherein each document in the plurality of documents in the search result set is related to the item based on the description, and wherein each document in the plurality of documents in the search result set comprises a respective plurality of words;
determining a first frequency of each word in the plurality of documents of the search result set, wherein the first frequency corresponds to an average number of times each word appears in each document of the search result set;
determining a second frequency of each word in the plurality of documents of the search result set, wherein the second frequency corresponds to an average number of times each word appears in each document of the corpus of documents;
for each word of the plurality of words, determining a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency, wherein the frequency score indicates a level of relevance of the word to the item;
identifying a set of highly related words to the item based on a threshold number of the plurality of words with the highest frequency scores;
selecting at least one anchor word from the set of highly related words; and
identifying at least one phrase including the at least one anchor word by searching each of the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for identifying search terms for placing advertisements along with search results is provided. The advertisement system selects a description of an item that is to be advertised. The advertisement system then retrieves documents that match the selected description. The advertisement system generates a score for each word of the retrieved documents that indicates relatedness of the word to the item to be advertised. After generating the scores for the words, the advertisement system identifies phrases of the words within the documents that are related to the item. The advertisement system then generates search terms for the item to be advertised from the identified phrases. The advertisement system submits the search terms and an advertisement to a search engines service for placement of a paid-for advertisement for the item.
-
Citations
21 Claims
-
1. A computer-implemented method in a computer system for identifying phrases related to an item from a set of one or more documents, the method comprising:
-
under control of one or more computer systems configured with executable instructions, receiving a description of the item; performing a search across a corpus of documents to retrieve a search result set of a plurality of documents, wherein each document in the plurality of documents in the search result set is related to the item based on the description, and wherein each document in the plurality of documents in the search result set comprises a respective plurality of words; determining a first frequency of each word in the plurality of documents of the search result set, wherein the first frequency corresponds to an average number of times each word appears in each document of the search result set; determining a second frequency of each word in the plurality of documents of the search result set, wherein the second frequency corresponds to an average number of times each word appears in each document of the corpus of documents; for each word of the plurality of words, determining a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency, wherein the frequency score indicates a level of relevance of the word to the item; identifying a set of highly related words to the item based on a threshold number of the plurality of words with the highest frequency scores; selecting at least one anchor word from the set of highly related words; and identifying at least one phrase including the at least one anchor word by searching each of the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 18, 19)
-
-
8. A non-transitory computer-readable storage medium having stored thereon instructions identifying phrases related to an item from a set of one or more information sources, the instructions upon being executed by one or more processors causing one or more computer systems to:
-
receive a description of the item; perform a search across a corpus of documents to retrieve a search result set of a plurality of documents, wherein each document in the plurality of documents is related to the item based on the description, and wherein each document in the plurality of documents in the search result set comprises a respective plurality of words; determine a first frequency of each word in the plurality of documents of the search result set, wherein the first frequency corresponds to an average number of times the word appears in each document of the search result set; determine a second frequency of each word in the plurality of documents in the search result set, wherein the second frequency corresponds to an average number of times the word appears in each document of the corpus of documents; for each word of the plurality of words, determine a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency, wherein the frequency score indicates a level of relevance of the word to the item; identify a predetermined number of highly related words to the item based on the predetermined number of words with the highest frequency scores; select at least one anchor word from the predetermined number of highly related words; and identify at least one phrase including the at least one anchor word by searching each of the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents. - View Dependent Claims (9, 10, 11, 12, 20)
-
-
13. A computer system for identifying phrases related to an item from a set of one or more documents, comprising:
-
at least one processor; memory including instructions that, upon being executed by the at least one processor, cause the computer system to; receive a description of the item; perform a search across a corpus of documents and retrieve a search result set of a plurality of documents, wherein each document in the plurality of documents is related to the item based on the description, and wherein each document in the plurality of documents in the search result set comprises a respective plurality of words; determine a first frequency of each word in the plurality of documents in the search result set, wherein the first frequency corresponds to an average number of times the word appears in each document of the search result set; determine a second frequency of each word in the plurality of documents in the search result set, wherein the second frequency corresponds to an average number of times the word appears in each document of the corpus of documents; determine a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency of each word, wherein the frequency score indicates a level of relevance of the word to the item; identify a predetermined number of highly related words to the item based on the predetermined number of words with the highest frequency scores; select at least one anchor word from the predetermined number of highly related words; and identify at least one phrase including the at least one anchor word by searching each of the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents. - View Dependent Claims (14, 15, 16, 17, 21)
-
Specification