SYSTEMS AND METHODS FOR FILTERING WEB PAGE CONTENTS
First Claim
Patent Images
1. A method of selectively filtering web page contents for web page analysis, comprising:
- generating a document object model (DOM) structure and a visual information of the web page contents;
analyzing the DOM structure and the visual information to determine multiple web page content attributes for filtering;
selecting one or more filtering parameters from the multiple web page content attributes; and
filtering the web page contents based on the selected one or more filtering parameters for the web page analysis.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for selectively filtering web page contents are disclosed. In one example embodiment a document object model (DOM) structure and visual information of the web page contents are generated. The document object model (DOM) structure and the visual information are analyzed to determine multiple web page content attributes. One or more filtering parameters are selected from the multiple web page content attributes. The web page is filtered based on the one or more filtering parameters.
60 Citations
15 Claims
-
1. A method of selectively filtering web page contents for web page analysis, comprising:
-
generating a document object model (DOM) structure and a visual information of the web page contents; analyzing the DOM structure and the visual information to determine multiple web page content attributes for filtering; selecting one or more filtering parameters from the multiple web page content attributes; and filtering the web page contents based on the selected one or more filtering parameters for the web page analysis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for selectively filtering web page contents for web page extraction, comprising:
-
a processor; and a memory operatively coupled to the processor, wherein the memory includes a web page filtering module for filtering the web page contents, having instructions capable of; generating a document object model (DOM) structure and a visual information of the web page contents; analyzing the DOM structure and the visual information to determine multiple web page content attributes; selecting one or more filtering parameters from the multiple web page content attributes; and filtering the web page contents based on the selected one or more filtering parameters for the web page extraction. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage medium for selective filtering of web page contents for web page extraction, having instructions that, when executed by a computing device, causes the computing device to perform a method comprising:
-
generating a document object model (DOM) structure and a visual information of the web page contents; analyzing the DOM structure and the visual information to determine multiple web page content attributes; selecting one or more filtering parameters from the multiple web page content attributes; and filtering the web page contents based on the selected one or more filtering parameters for the web page extraction.
-
Specification