Smart image filtering method with domain rules application
First Claim
1. A method for building computing components of an image filtering system prior to a runtime assessment of images for compliance by the image filtering system, comprising:
- defining, by the image filtering system, a vocabulary for a given domain, comprising;
extracting a set of entities from a set of corpus documents related to the given domain; and
grouping the set of entities into a set of image categories based on probabilities of co-occurrence of the set of entities;
building, by the image filtering system, an image catalog, comprising;
identifying one or more objects in one or more images from the image catalog; and
associating the one or more objects with the set of entities in the set of image categories;
generating, by the image filtering system, a set of compliance rules, comprising;
identifying a given image category of the set of image categories that relate to topics in corpus data sources; and
generating the set of compliance rules, using the set of entities grouped in the given image category; and
training, by the image filtering system, an image classification pipeline, comprising;
identifying one or more entities in a given image of the one or more images from the image catalog;
classifying the given image according to one or more of the set of compliance rules; and
repeating the identifying of the one or more entities in the given image and the classifying of additional images until a set of performance metrics are met.
1 Assignment
0 Petitions
Accused Products
Abstract
An image filtering system for compliance assessment is built and modified dynamically. The system defines a vocabulary for a domain by extracting entities from corpus documents and grouping entities into image categories. An image catalog is built by identifying objects in images from a system image catalog and associating the objects with entities in the image categories. Compliance rules are generated by identifying and using image categories related to topics in corpus data sources. An image classification pipeline is then trained to classify images according to the compliance rules until performance metrics are met. The training process may be repeated to extract new and updated knowledge in corpus document and/or corpus data sources. For compliance assessment of an image, entities in related image categories are identified. The image is then classified according to actions associated with any applicable compliance rules.
38 Citations
20 Claims
-
1. A method for building computing components of an image filtering system prior to a runtime assessment of images for compliance by the image filtering system, comprising:
-
defining, by the image filtering system, a vocabulary for a given domain, comprising; extracting a set of entities from a set of corpus documents related to the given domain; and grouping the set of entities into a set of image categories based on probabilities of co-occurrence of the set of entities; building, by the image filtering system, an image catalog, comprising; identifying one or more objects in one or more images from the image catalog; and associating the one or more objects with the set of entities in the set of image categories; generating, by the image filtering system, a set of compliance rules, comprising; identifying a given image category of the set of image categories that relate to topics in corpus data sources; and generating the set of compliance rules, using the set of entities grouped in the given image category; and training, by the image filtering system, an image classification pipeline, comprising; identifying one or more entities in a given image of the one or more images from the image catalog; classifying the given image according to one or more of the set of compliance rules; and repeating the identifying of the one or more entities in the given image and the classifying of additional images until a set of performance metrics are met. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product for building computing components of an image filtering system prior to a runtime assessment of images for compliance by the image filtering system, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to:
-
define a vocabulary for a given domain, comprising; extracting a set of entities from a set of corpus documents related to the given domain; and grouping the set of entities into a set of image categories based on probabilities of co-occurrence of the set of entities; build an image catalog, comprising; identifying one or more objects in one or more images from the image catalog; and associating the one or more objects with the set of entities in the set of image categories; generate a set of compliance rules, comprising; identifying a given image category of the set of image categories that relating to topics in corpus data sources; and generating the set of compliance rules, using the set of entities grouped in the given image category; and train an image classification pipeline, comprising; identifying one or more entities in a given image of the one or more images from the image catalog; classifying the given image according to one or more of the set of compliance rules; and repeating the identifying of the one or more entities in the given image and the classifying of additional images until a set of performance metrics are met. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. An image filtering system, comprising:
-
a processor; and a computer readable storage medium having program instructions embodied therewith for building computing components of the image filtering system prior to a runtime assessment of images for compliance by the image filtering system, the program instructions executable by the processor to cause the processor to; define a vocabulary for a given domain, comprising; extracting a set of entities from a set of corpus documents related to the given domain; and grouping the set of entities into a set of image categories based on probabilities of co-occurrence of the set of entities; build an image catalog, comprising; identifying one or more objects in one or more images from the image catalog; and associating the one or more objects with the set of entities in the set of image categories; generate a set of compliance rules, comprising; identifying a given image category of the set of image categories that relate to topics in corpus data sources; and generating the set of compliance rules, using the set of entities grouped in the given image category; and train an image classification pipeline, comprising; identifying one or more entities in a given image of the one or more images from the image catalog; classifying the given image according to one or more of the set of compliance rules; and repeating the identifying of the one or more entities in the given image and the classifying of additional images until a set of performance metrics are met. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification