Internet profile service
First Claim
1. A computer-implemented method, comprising:
- receiving a first webpage associated with a domain;
determining a plurality of additional webpages associated with the domain based on hyperlinks in the first webpage;
extracting content from the first webpage and the plurality of additional webpages;
determining technical data associated with the first webpage and the plurality of additional webpages;
processing the content and the technical data through a signature marker set to determine a contextual match;
determining a purpose of the domain based on the contextual match;
determining, based on the content, a category associated with the domain, wherein the category is distinct from the purpose and defines a commercial sector of the domain;
generating a domain profile for the domain, wherein the domain profile comprises an indication of the purpose of the domain and an indication of the category associated with the domain;
storing, in a memory, the domain profile;
receiving a query comprising at least one of a purpose and a category;
adding the domain to a list of identified domains based on a determination that the at least one of a purpose and a category corresponds to at least one of the purpose and the category associated with the domain profile; and
outputting the list of identified domains for display in response to the query.
0 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for creating and using a domain profile include identifying a status of a first page associated with a domain. The first page is retrieved and additional pages from the domain are identified based on hyperlinks from the first page. The status of the additional pages is identified and the hyperlinks are prioritized based on the status and/or a comparison with predetermined data. Content is extracted from the first page and selected pages from among the additional pages. The specific additional pages may be selected based on the prioritization. The retrieved content may be processed through a signature marker set to determine a contextual match. A purpose of the domain is determined according to the status of the first page, the status of the additional pages and results of the processing of the content. The domain profile can be displayed, stored, sent and/or searched to identify web sites or attributes of interest.
82 Citations
18 Claims
-
1. A computer-implemented method, comprising:
-
receiving a first webpage associated with a domain; determining a plurality of additional webpages associated with the domain based on hyperlinks in the first webpage; extracting content from the first webpage and the plurality of additional webpages; determining technical data associated with the first webpage and the plurality of additional webpages; processing the content and the technical data through a signature marker set to determine a contextual match; determining a purpose of the domain based on the contextual match; determining, based on the content, a category associated with the domain, wherein the category is distinct from the purpose and defines a commercial sector of the domain; generating a domain profile for the domain, wherein the domain profile comprises an indication of the purpose of the domain and an indication of the category associated with the domain; storing, in a memory, the domain profile; receiving a query comprising at least one of a purpose and a category; adding the domain to a list of identified domains based on a determination that the at least one of a purpose and a category corresponds to at least one of the purpose and the category associated with the domain profile; and outputting the list of identified domains for display in response to the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, comprising:
-
a processing system comprising one or more processors; and a memory system comprising one or more non-transitory computer-readable media, wherein the one or more non-transitory computer-readable media contain instructions that, when executed by the processing system, cause the processing system to perform operations comprising; receiving a first webpage associated with a domain; determining a plurality of additional webpages associated with the domain based on hyperlinks in the first webpage; extracting content from the first webpage and the plurality of additional webpages; determining technical data associated with the first webpage and the plurality of additional webpages; processing the content and the technical data through a signature marker set to determine a contextual match; determining a purpose of the domain based on the contextual match; determining, based on the content, a category associated with the domain, wherein the category is distinct from the purpose and defines a commercial sector of the domain; generating a domain profile for the domain, wherein the domain profile comprises an indication of the purpose of the domain and an indication of the category associated with the domain; and storing, in the one or more non-transitory computer-readable media, the domain profile; receiving a query comprising at least one of a purpose and a category; adding the domain to a list of identified domains based on a determination that the at least one of a purpose and a category corresponds to at least one of the purpose and the category associated with the domain profile; and outputting the list of identified domains for display in response to the query. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification