CLASSIFICATION OF IMAGES AS ADVERTISEMENT IMAGES OR NON-ADVERTISEMENT IMAGES
First Claim
1. A method in a computing device for identifying advertisement images of display pages, the method comprising:
- providing training images of display pages;
labeling the images as advertisement images or non-advertisement images;
generating a feature vector for each of the training images, the feature vector including features derived from the display page of the image;
training a binary classifier using the feature vectors and labels of the images; and
classifying an image as an advertisement image or non-advertisement image by generating a feature vector for the image and applying the trained binary classifier to the generated feature vector of the image.
2 Assignments
0 Petitions
Accused Products
Abstract
An advertisement image classification system trains a binary classifier to classify images as advertisement images or non-advertisement images and then uses the binary classifier to classify images of web pages as advertisement images or non-advertisement images. During a training phase, the classification system generates training data of feature vectors representing the images and labels indicating whether an image is an advertisement image or a non-advertisement image. The classification system trains a binary classifier to classify images using training data. During a classification phase, the classification system inputs a web page with an image and generates a feature vector for the image. The classification system then applies the trained binary classifier to the feature vector to generate a score indicating whether the image is an advertisement image or a non-advertisement image.
35 Citations
20 Claims
-
1. A method in a computing device for identifying advertisement images of display pages, the method comprising:
-
providing training images of display pages; labeling the images as advertisement images or non-advertisement images; generating a feature vector for each of the training images, the feature vector including features derived from the display page of the image; training a binary classifier using the feature vectors and labels of the images; and classifying an image as an advertisement image or non-advertisement image by generating a feature vector for the image and applying the trained binary classifier to the generated feature vector of the image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-readable medium for generating a binary classifier for classifying images of web pages as advertisement images or non-advertisement images, by a method comprising:
-
providing training web pages; identifying the images of the training web pages; receiving labels for the images indicating whether an image is an advertisement image or non-advertisement image; generating a feature vector for each of the identified images, the feature vector including visual layout features of the image on the web page; and training a binary classifier using the feature vectors and labels of the images wherein the training identifies weights of the features for use in classifying images. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A computing device for identifying features of images of web pages for use in classifying images as advertisement images or non-advertisement images, comprising:
-
a training data store including feature vectors for images of web pages, the images being labeled as advertisement images or non-advertisement images, the feature vectors including candidate features; a component that trains a classifier using the feature vectors with candidate features and labels of the training data store; and a component that selects as features for use in classifying images those candidate features whose weights indicate they are effective at distinguishing advertisement images from non-advertisement images. - View Dependent Claims (18, 19, 20)
-
Specification