AUTOMATIC CLASSIFICATION OF DISPLAY ADS USING AD IMAGES AND LANDING PAGES
First Claim
1. A method for classifying ads automatically in a taxonomy of categories, the method comprising steps of:
- extracting text features from ad images using optical character recognition (OCR) techniques;
identifying objects of interest from ad images using object detection and recognition techniques in computer vision;
extracting text features from the web-page of the advertiser that the user is redirected to when clicking the ad;
training statistical models using the extracted features as well as the advertiser attributes from a historical dataset of ads labeled by human editors; and
determining the relevant categories of unlabeled ads using the trained models.
9 Assignments
0 Petitions
Accused Products
Abstract
A system and method for automatically classifying ads into a taxonomy of categories, the method including: extracting text features from ad images using OCR (optical character recognition) techniques; identifying objects of interest from ad images using object detection and recognition techniques in computer vision; extracting text features from the web-page of the advertiser to which the user is re-directed when clicking the ad; training statistical models using the extracted features mentioned above as well as advertiser attributes from a historical dataset of ads labeled by human editors; and determining the relevant categories of unlabeled ads using the trained models.
-
Citations
3 Claims
-
1. A method for classifying ads automatically in a taxonomy of categories, the method comprising steps of:
-
extracting text features from ad images using optical character recognition (OCR) techniques; identifying objects of interest from ad images using object detection and recognition techniques in computer vision; extracting text features from the web-page of the advertiser that the user is redirected to when clicking the ad; training statistical models using the extracted features as well as the advertiser attributes from a historical dataset of ads labeled by human editors; and determining the relevant categories of unlabeled ads using the trained models. - View Dependent Claims (2)
-
-
3. A system of display ad categorization, the system comprising:
-
a processor device; a storage device operably coupled with the processor device, said storage device comprising instructions that are executed by said processor device; wherein the instructions cause a computer to perform a method comprising steps of; reading an ad image and a landing page for an advertisement from the storage device; extracting text features from an ad image using optical character recognition (OCR) techniques; executing object detection and recognition to identify objects of interest from the ad image; parsing the landing page to extract text features; storing the extracted features from the ad image and the landing page in the storage device; training statistical models using the extracted features as well as advertiser attributes from a historical dataset of ads labeled by human editors; and determining the relevant categories of unlabeled ads using the trained models.
-
Specification