Internet profile service
First Claim
1. A computer-implemented method for generating a domain profile, comprising:
- determining, using a processor, a purpose associated with a first domain by accessing a first web page associated with the first domain, wherein the determining comprises;
determining whether content is obtainable by resolving the first domain and accessing the first web page;
in response to determining that content is obtainable, determining whether the first web page results in a redirection;
in response to determining that the first web page does not result in a redirection, determining whether retrieved content of the first web page is content from another web page;
comparing the retrieved content with predetermined data to determine contextual matches;
in response to determining that the retrieved content is not content from another web page, determining, based on the contextual matches, whether the purpose associated with the first domain is to receive advertising traffic; and
in response to determining that the purpose of the first domain is not to receive advertising traffic, determining, based on the contextual matches, whether the purpose of the first domain is commercial;
determining, based on the contextual matches, a category associated with the first domain, wherein the category associated with the first domain is distinct from the purpose associated with the first domain and defines a commercial sector of the first domain; and
storing, in a memory, the purpose associated with the first domain and the category associated with the first domain.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for creating and using a domain profile include identifying a status of a first page associated with a domain. The first page is retrieved and additional pages from the domain are identified based on hyperlinks from the first page. The status of the additional pages is identified and the hyperlinks are prioritized based on the status and/or a comparison with predetermined data. Content is extracted from the first page and selected pages from among the additional pages. The specific additional pages may be selected based on the prioritization. The retrieved content may be processed through a signature marker set to determine a contextual match. A purpose of the domain is determined according to the status of the first page, the status of the additional pages and results of the processing of the content. The domain profile can be displayed, stored, sent and/or searched to identify web sites or attributes of interest.
-
Citations
21 Claims
-
1. A computer-implemented method for generating a domain profile, comprising:
-
determining, using a processor, a purpose associated with a first domain by accessing a first web page associated with the first domain, wherein the determining comprises; determining whether content is obtainable by resolving the first domain and accessing the first web page; in response to determining that content is obtainable, determining whether the first web page results in a redirection; in response to determining that the first web page does not result in a redirection, determining whether retrieved content of the first web page is content from another web page; comparing the retrieved content with predetermined data to determine contextual matches; in response to determining that the retrieved content is not content from another web page, determining, based on the contextual matches, whether the purpose associated with the first domain is to receive advertising traffic; and in response to determining that the purpose of the first domain is not to receive advertising traffic, determining, based on the contextual matches, whether the purpose of the first domain is commercial; determining, based on the contextual matches, a category associated with the first domain, wherein the category associated with the first domain is distinct from the purpose associated with the first domain and defines a commercial sector of the first domain; and storing, in a memory, the purpose associated with the first domain and the category associated with the first domain. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory computer-readable storage medium storing instructions that, when executed by a computer, cause the computer to perform a method for generating a domain profile, comprising:
-
determining a purpose associated with a first domain by accessing a first web page associated with the first domain, wherein the determining comprises; determining whether content is obtainable by resolving the first domain and accessing the first web page; in response to determining that content is obtainable, determining whether the first web page results in a redirection; in response to determining that the first web page does not result in a redirection, determining whether retrieved content of the first web page is content from another web page; comparing the retrieved content with predetermined data to determine contextual matches; in response to determining that the retrieved content is not content from another web page, determining, based on the contextual matches, whether the purpose associated with the first domain is to receive advertising traffic; and in response to determining that the purpose of the first domain is not to receive advertising traffic, determining, based on the contextual matches, whether the purpose of the first domain is commercial; determining, based on the contextual matches, a category associated with the first domain, wherein the category associated with the first domain is distinct from the purpose associated with the first domain and defines a commercial sector of the first domain; and storing, in a memory, the purpose associated with the first domain and the category associated with the first domain. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a memory storing data and instructions; and a processor configured to access the memory and execute the instructions, causing the processor to; determine a purpose associated with a first domain by accessing a first web page associated with the first domain, wherein the determining comprises; determining whether content is obtainable by resolving the first domain and accessing the first web page; in response to determining that content is obtainable, determining whether the first web page results in a redirection; in response to determining that the first web page does not result in a redirection, determining whether retrieved content of the first web page is content from another web page; comparing the retrieved content with predetermined data to determine contextual matches; in response to determining that the retrieved content is not content from another web page, determining, based on the contextual matches, whether the purpose associated with the first domain is to receive advertising traffic; and in response to determining that the purpose of the first domain is not to receive advertising traffic, determining, based on the contextual matches, whether the purpose of the first domain is commercial; determine, based on the contextual matches, a category associated with the first domain, wherein the category associated with the first domain is distinct from the purpose associated with the first domain and defines a commercial sector of the first domain; and store, in the memory, the purpose associated with the first domain and the category associated with the first domain. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A computer-implemented method for generating a domain profile, comprising:
-
determining, using a processor, a purpose associated with a first domain, wherein the determining comprises; determining whether content is obtainable by resolving the first domain and accessing the first web page; in response to determining that content is obtainable, determining whether the first web page results in a redirection; in response to determining that the first web page does not result in a redirection, determining whether retrieved content of the first web page is content from another web page, wherein the retrieved content comprises at least one hyperlink corresponding to one or more subsequent web page; comparing the retrieved content with predetermined data to determine first contextual matches; determining a purpose of the one or more subsequent web pages, wherein the determining comprises determining whether content is obtainable by accessing the one or more subsequent web page; in response to determining that content is obtainable from the one or more subsequent web page, determining whether the one or more subsequent web page results in a redirection; in response to determining that the one or more subsequent web page does not result in a redirection, determining whether retrieved content of the one or more subsequent web page is content from another web page; comparing the retrieved content of the one or more subsequent web page with predetermined data to determine second contextual matches; in response to determining that the retrieved content of the first web page and the retrieved content of the one or more subsequent web page are not content from another web page, determining, based on the first contextual matches and the second contextual matches, whether the purpose associated with the first domain is to receive advertising traffic; and in response to determining that the purpose of the first domain is not to receive advertising traffic, determining, based on the first contextual matches and the second contextual matches, whether the purpose of the first domain is commercial; determining, based on the first contextual matches and the second contextual matches, a category associated with the first domain, wherein the category associated with the first domain is distinct from the purpose associated with the first domain and defines a commercial sector of the first domain; and storing, in a memory, the purpose of the first domain that is determined. - View Dependent Claims (20, 21)
-
Specification