×

Methods and systems for detecting unwanted web contents

  • US 9,811,664 B1
  • Filed: 08/15/2011
  • Issued: 11/07/2017
  • Est. Priority Date: 08/15/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method of detecting unwanted web contents, the method to be performed by a first computer and a second computer that each comprises a processor and a memory, the method comprising:

  • the first computer receiving a first web page from a first website;

    the first computer extracting a plurality of hypertext markup language (HTML) tags from the first web page;

    the first computer generating page structure traits of the first web page by forming the plurality of HTML tags together into a pattern that comprises the plurality of HTML tags;

    the first computer comparing the page structure traits of the first web page to page structure traits of a normal web page;

    to prevent false positives, the first computer removing from the page structures of the first web page a feature that makes the page structure traits of the normal web page match the page structure traits of the first web page;

    the second computer receiving the page structure traits of the first web page after the feature has been removed from the page structure traits of the first web page; and

    the second computer detecting unwanted web content in a second web page received from a second website by comparing page structure traits of the second web page against the page structure traits of the first web page.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×