×

Methods and systems for scanning and monitoring content on a network

  • US 20060106866A1
  • Filed: 10/29/2004
  • Published: 05/18/2006
  • Est. Priority Date: 10/29/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method for scanning and monitoring content on a computer network, the method comprising:

  • scanning the network to identify network resources on which the relevant content appears and the location of said resources on the network;

    scanning each of the identified network locations to determine any network address information, said network address information identifying the computer or computer system related to that network location;

    resolving each of said identified network locations by network address information into one or more network addresses;

    profiling said network resources by classifying the content available at said resource;

    acquiring said relevant content, said acquiring including copying said network resource and said content to another computer system;

    analyzing said network resource by breaking up the content at that network resource into one or more constituent elements;

    identifying and analyzing any links on a network resource to any other network resource and categorizing said links into local, neighbor, or remote links based on their relationship with the current resource being analyzed;

    identifying broken links, in which the network resource linked to is not available;

    keyword scanning of content on said network resource for predetermined keywords or phrases and determining whether said keywords are present on the network resource being analyzed;

    analyzing content for language patterns to obtain instances of information such as street addresses, phone numbers, email addresses and other pattern-defined items;

    fingerprinting content on said network resource to obtain a quantitative fingerprint for at least one of said constituent elements of said content; and

    fingerprinting said network location to obtain a single quantitative measurement for all of said network resources and content at said network location.

View all claims
  • 7 Assignments
Timeline View
Assignment View
    ×
    ×