Method and system for identifying keywords for use in placing keyword-targeted advertisements
First Claim
1. A computer system for identifying phrases related to an item, comprising:
- a data store storing information from one or more documents related to the item;
at least one processor operable to access the information from the data store, and the at least one processor operable to cause the computer system to;
determine a result set including a plurality of documents from a corpus of documents, wherein each document in the plurality of documents of the result set is related to the item;
determine a first frequency of at least one word in the plurality of documents of the result set, wherein the first frequency corresponds to an average number of times that the at least one word appears in each document of the result set;
determine a second frequency of the at least one word in the plurality of documents of the corpus of documents, wherein the second frequency corresponds to an average number of times the at least one word appears in each document of the corpus of documents;
determine a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency for the at least one word;
identify a set of highly related words to the item based on a threshold number of the at least one word having a highest frequency score;
select at least one anchor word from the set of highly related words; and
identify at least one phrase in the plurality of documents in the result set that contains the selected at least one anchor word by searching the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for identifying search terms for placing advertisements along with search results is provided. The advertisement system selects a description of an item that is to be advertised. The advertisement system then retrieves documents that match the selected description. The advertisement system generates a score for each word of the retrieved documents that indicates relatedness of the word to the item to be advertised. After generating the scores for the words, the advertisement system identifies phrases of the words within the documents that are related to the item. The advertisement system then generates search terms for the item to be advertised from the identified phrases. The advertisement system submits the search terms and an advertisement to a search engines service for placement of a paid-for advertisement for the item.
102 Citations
20 Claims
-
1. A computer system for identifying phrases related to an item, comprising:
-
a data store storing information from one or more documents related to the item; at least one processor operable to access the information from the data store, and the at least one processor operable to cause the computer system to; determine a result set including a plurality of documents from a corpus of documents, wherein each document in the plurality of documents of the result set is related to the item; determine a first frequency of at least one word in the plurality of documents of the result set, wherein the first frequency corresponds to an average number of times that the at least one word appears in each document of the result set; determine a second frequency of the at least one word in the plurality of documents of the corpus of documents, wherein the second frequency corresponds to an average number of times the at least one word appears in each document of the corpus of documents; determine a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency for the at least one word; identify a set of highly related words to the item based on a threshold number of the at least one word having a highest frequency score; select at least one anchor word from the set of highly related words; and identify at least one phrase in the plurality of documents in the result set that contains the selected at least one anchor word by searching the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computing-implemented method for identifying phrases related to an item, the method comprising:
-
determining a result set including a plurality of documents from a corpus of documents, wherein each document in the plurality of documents of the result set is related to the item; determining a first frequency of at least one word in the plurality of documents of the result set, wherein the first frequency corresponds to an average number of times that the at least one word appears in each document of the result set; determining a second frequency of the at least one word in the plurality of documents of the corpus of documents, wherein the second frequency corresponds to an average number of times the at least one word appears in each document of the corpus of documents; determining a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency for the at least one word; identifying a set of highly related words to the item based on a threshold number of the at least one word having a highest frequency score; selecting at least one anchor word from the set of highly related words; and identifying at least one phrase in the plurality of documents in the result set that contains the selected at least one anchor word by searching the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A computer-readable storage medium having stored thereon instructions for causing one or more computing systems to perform a method of identifying phrases related to an item, the method comprising:
-
determining a result set including a plurality of documents from a corpus of documents, wherein each document in the plurality of documents of the result set is related to the item; determining a first frequency of at least one word in the plurality of documents of the result set, wherein the first frequency corresponds to an average number of times that the at least one word appears in each document of the result set; determining a second frequency of the at least one word in the plurality of documents of the corpus of documents, wherein the second frequency corresponds to an average number of times the at least one word appears in each document of the corpus of documents; determining a frequency score that is based on a difference between the corresponding first frequency and the corresponding second frequency for the at least one word; identifying a set of highly related words to the item based on a threshold number of the at least one word having a highest frequency score; selecting at least one anchor word from the set of highly related words; and identifying at least one phrase in the plurality of documents in the result set that contains the selected at least one anchor word by searching the plurality of documents in the search result set for the at least one anchor word and identifying at least one word proximal to the at least one anchor word within the plurality of documents. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification