Authoritative document identification
First Claim
Patent Images
1. A computer implemented method for identifying an authoritative web page corresponding to a business, the method comprising:
- identifying, using a processor associated with the computer, a plurality of candidate web pages that are associated with the business;
identifying, using the processor, a plurality of signals respectively associated with each of the plurality of candidate web pages;
determining, using the processor, an authoritative score for each of the plurality of candidate web pages based on the plurality of signals respectively associated with each of the plurality of candidate web pages,the authoritative score for each web page being determined based on one or more of;
a number of outlinks, in one or more of the plurality of candidate web pages, that point to the web page,a match between anchor text associated with the outlinks that point to the web page and a name of the business,a match between a title of the web page and the name of the business,a number of geographic locations identified in the web page, ora match between a domain name associated with the web page and the name of the business,determining the authoritative score for each web page of the plurality of candidate web pages including;
weighting and combining a respective plurality of scores associated with the respective plurality of signals for each web page to determine the authoritative score for each web page; and
identifying, using the processor, a particular web page of the plurality of candidate web pages as an authoritative web page for the business,the particular web page having a highest authoritative score of the authoritative scores for the plurality of candidate web pages.
1 Assignment
0 Petitions
Accused Products
Abstract
A system determines documents that are associated with a location, identifies a group of signals associated with each of the documents, and determines authoritativeness of the documents for the location based on the signals.
21 Citations
20 Claims
-
1. A computer implemented method for identifying an authoritative web page corresponding to a business, the method comprising:
-
identifying, using a processor associated with the computer, a plurality of candidate web pages that are associated with the business; identifying, using the processor, a plurality of signals respectively associated with each of the plurality of candidate web pages; determining, using the processor, an authoritative score for each of the plurality of candidate web pages based on the plurality of signals respectively associated with each of the plurality of candidate web pages, the authoritative score for each web page being determined based on one or more of; a number of outlinks, in one or more of the plurality of candidate web pages, that point to the web page, a match between anchor text associated with the outlinks that point to the web page and a name of the business, a match between a title of the web page and the name of the business, a number of geographic locations identified in the web page, or a match between a domain name associated with the web page and the name of the business, determining the authoritative score for each web page of the plurality of candidate web pages including; weighting and combining a respective plurality of scores associated with the respective plurality of signals for each web page to determine the authoritative score for each web page; and identifying, using the processor, a particular web page of the plurality of candidate web pages as an authoritative web page for the business, the particular web page having a highest authoritative score of the authoritative scores for the plurality of candidate web pages. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system to identify an authoritative web page corresponding to a business, the system comprising:
one or more computers configured to; identify a plurality of candidate web pages that are associated with the business; identify a plurality of signals respectively associated with each of the plurality of candidate web pages; determine an authoritative score for each of the plurality of candidate web pages based on the plurality of signals respectively associated with each of the plurality of candidate web pages, the authoritative score for each web page being determined based on one or more of; a number of outlinks in one or more of the plurality of candidate web pages that point to the web page, a match between anchor text associated with the outlinks that point to the web page and a name of the business, a match between a title of the web page and the name of the business, a number of geographic locations identified in the web page, or a match between a domain name associated with the web page and the name of the business, when determining the authoritative score for each candidate web page of the plurality of candidate web pages, the one or more computers being to; weight and combine a respective plurality of scores associated with the respective plurality of signals for each candidate web page to determine the authoritative score for each candidate web page; and identify a particular web page of the plurality of candidate web pages as an authoritative web page for the business, the particular web page having a highest authoritative score of the authoritative scores for the plurality of candidate web pages. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
14. A non-transitory computer-readable medium to store instructions, the instructions comprising:
-
one or more instructions which, when executed by a processor, cause the processor to identify a plurality of candidate web pages that are associated with a business; one or more instructions which, when executed by the processor, cause the processor to identify a plurality of signals respectively associated with each of the plurality of candidate web pages; one or more instructions which, when executed by the processor, cause the processor to determine an authoritative score for each of the plurality of candidate web pages based on the respective plurality of signals, the authoritative score for each web page being determined based on one or more of; a number of outlinks in one or more of the plurality of candidate web pages that point to the web page, a match between anchor text associated with the outlinks that point to the web page and a name of the business, a match between a title of the web page and the name of the business, a number of geographic locations identified in the web page, or a match between a domain name associated with the web page and the name of the business, and the one or more instructions to determine the authoritative score for each web page of the plurality of candidate web pages, when executed by the processor, further cause the processor to; weight and combine a respective plurality of scores associated with the respective plurality of signals for each candidate web page to determine the authoritative score for each candidate web page; and one or more instructions which, when executed by the processor, cause the processor to identify a particular web page of the plurality of candidate web pages as an authoritative web page for the business, the particular web page having a highest authoritative score of the authoritative scores for the plurality of candidate web pages. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification