×

Classification of images as advertisement images or non-advertisement images of web pages

  • US 7,840,502 B2
  • Filed: 06/13/2007
  • Issued: 11/23/2010
  • Est. Priority Date: 06/13/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method in a computing device for identifying advertisement images of web pages, the method comprising:

  • providing training images of web pages, each provided training image being referenced by a web page;

    labeling the images as advertisement images or non-advertisement images;

    generating a feature vector for each of the training images, the feature vector including visual layout features derived from the web page of the image and content features derived from content of the image, the content features being selected from the group consisting of number of different colors in the content of the image, an indication of whether the image has high contrast, and an indication of whether the image is a photograph;

    training a binary classifier using the feature vectors and labels of the images; and

    classifying an image as an advertisement image or non-advertisement image by generating a feature vector for the image and applying the trained binary classifier to the generated feature vector of the image.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×