×

Visual web page analysis system and method

  • US 10,664,647 B2
  • Filed: 10/28/2014
  • Issued: 05/26/2020
  • Est. Priority Date: 10/28/2014
  • Status: Active Grant
First Claim
Patent Images

1. A visual web page analysis system for analyzing data of a web page based on vision, the system comprising:

  • a processor;

    an image analyzing program executed by a processer to enable a processer to load information of a web page and to segment content of the web page into a plurality of blocks based on at least a visual feature of the web page;

    a block analyzing program executed by the processer to enable the processer to classify the plurality of blocks based on at least an attribute of each block;

    a vision identifying program executed by the processer to enable the processer to compare at least a relative feature of each block to determine a function of each classified block on the web page; and

    an output program executed by the processer to enable the processer to collect the plurality of blocks and their functions into an information interface and to output the information interface,wherein the processor provides an analyzed result shown on the information interface, andwherein the block analyzing program executed by the processer further enables the processer to receive a plurality of web page tags, to determine the attribute of each block in accordance with the following formulas for Degree of Picture Hyperlink (DoPH), Picture Text Ratio (PTR), Degree of Local Text Hyperlink (DoLTH), Text Ratio, and Degree of Local Picture Hyperlink (DoLPH);

    (DoTH)=(number of text hyperlinks in the block)/(number of text tags in the block), where a text tag is any HTML grammar instruction that can be used to present texts;

    (DoPH)=(number of picture hyperlinks in the block)/(number of picture tags in the block), where a picture tag is any HTML grammar instruction that can be used to present pictures;

    Text Ratio=(number of characters in the block)/(number of characters in the web page);

    (PTR)=(number of image tags in the block)/(number of text tags in the block), where the PTR is used to measure a pictures-versus-text ratio in the block;

    (DoLTH)=(number of local text hyperlinks in the block)/(number of text hyperlinks in the block), where the local text hyperlinks are text hyperlinks linked to the same web domain; and

    (DoLPH)=(number of local picture hyperlinks in the block)/(number of picture hyperlinks in the block), where the local picture hyperlinks are picture hyperlinks linked to the same web domain.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×