×

Discrete Wavelet Transform Method for Document Structure Similarity

  • US 20140236968A1
  • Filed: 10/31/2011
  • Published: 08/21/2014
  • Est. Priority Date: 10/31/2011
  • Status: Active Grant
First Claim
Patent Images

1. A method for determining document structure similarity, comprising:

  • segmenting path sequences (206) of Document Object. Model (DOM) trees (120, 462) from a number of web pages (202) into B components (561);

    determining path signals (210) corresponding to the path sequences (206) based on a count of the occurrences of particular paths in the Bth component (571);

    transforming unique path signals (210) into discrete wavelet signals (214) (572); and

    analyzing the discrete wavelet signals (214) multiple DOM ee resolution levels (573).

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×