System and method for identifying and blocking pornogarphic and other web content on the internet
First Claim
1. A system for identifying possibly pornographic web sites comprising:
- a feature extraction module, the feature extraction module comprising;
a first module for extracting the URL of the website from a request for web content;
a second module for extracting text from text portions of the web page;
a third module for extracting image portions from the web page that likely correspond to the skin of an individual; and
a fusion module for evaluating the output from the feature extraction module and determining whether the web page comprises possibly pornographic content.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method are disclosed for identifying and blocking unacceptable web content, including pornographic web content. In a preferred embodiment, the system comprises a proxy server connected between a client and the Internet that checks a requested URL against a block list that may include URLs identified by a web spider. If the URL is not on the block list, the proxy server requests the web content. When the web content is received, the proxy server processes its text content and compares the processing results using a thresholder. If necessary, the proxy server then processes the image content of the retrieved web content to determine if it comprises skin tones and textures. Based on these processing results, the proxy server may either block the retrieved web content or permit user access to it. Also disclosed is a system and method for inserting advertisements into retrieved web content.
-
Citations
16 Claims
-
1. A system for identifying possibly pornographic web sites comprising:
-
a feature extraction module, the feature extraction module comprising;
a first module for extracting the URL of the website from a request for web content;
a second module for extracting text from text portions of the web page;
a third module for extracting image portions from the web page that likely correspond to the skin of an individual; and
a fusion module for evaluating the output from the feature extraction module and determining whether the web page comprises possibly pornographic content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
- 11. The system of claim further comprising an image analysis engine.
-
14. A method for inserting an advertisement into retrieved web content, comprising:
-
retrieving web content;
retrieving an advertisement;
inserting the advertisement into the web content in a computer that is either the client computer that requested the web content or a server connected to the same LAN or WAN as the computer that requested the web content. - View Dependent Claims (15, 16)
-
Specification