DETECTING RECURRING THEMES IN CONSUMER IMAGE COLLECTIONS
First Claim
1. A method of identifying groups of related digital images in a digital image collection including a plurality of digital images, comprising:
- analyzing each of the digital images in the digital image collection to generate associated feature descriptors related to image content or image capture conditions;
storing the feature descriptors associated with the digital images in a metadata database;
using a data processor to automatically analyze the metadata database to identify a plurality of frequent itemsets, wherein each of the frequent itemsets is a co-occurring feature descriptor group that occurs in at least a predefined fraction of the digital images, each frequent itemset being associated with a subset of the digital images;
determining a probability of occurrence for each the identified frequent itemsets based on one or more probability distributions determined from an analysis of a large number of image collections;
determining a quality score for each of the identified frequent itemsets responsive to the determined probability of occurrence;
ranking the frequent itemsets based at least on the determined quality scores;
identifying one or more groups of related digital images corresponding to one or more of the top ranked frequent itemsets; and
storing an indication of the identified groups of related digital images in a processor-accessible memory.
5 Assignments
0 Petitions
Accused Products
Abstract
A method of identifying groups of related digital images in a digital image collection, comprising: analyzing each of the digital images to generate associated feature descriptors related to image content or image capture conditions; storing the feature descriptors associated with the digital images in a metadata database; automatically analyzing the metadata database to identify a plurality of frequent itemsets, wherein each of the frequent itemsets is a co-occurring feature descriptor group that occurs in at least a predefined fraction of the digital images; determining a probability of occurrence for each the identified frequent itemsets; determining a quality score for each of the identified frequent itemsets responsive to the determined probability of occurrence; ranking the frequent itemsets based at least on the determined quality scores; and identifying one or more groups of related digital images corresponding to one or more of the top ranked frequent itemsets.
-
Citations
16 Claims
-
1. A method of identifying groups of related digital images in a digital image collection including a plurality of digital images, comprising:
-
analyzing each of the digital images in the digital image collection to generate associated feature descriptors related to image content or image capture conditions; storing the feature descriptors associated with the digital images in a metadata database; using a data processor to automatically analyze the metadata database to identify a plurality of frequent itemsets, wherein each of the frequent itemsets is a co-occurring feature descriptor group that occurs in at least a predefined fraction of the digital images, each frequent itemset being associated with a subset of the digital images; determining a probability of occurrence for each the identified frequent itemsets based on one or more probability distributions determined from an analysis of a large number of image collections; determining a quality score for each of the identified frequent itemsets responsive to the determined probability of occurrence; ranking the frequent itemsets based at least on the determined quality scores; identifying one or more groups of related digital images corresponding to one or more of the top ranked frequent itemsets; and storing an indication of the identified groups of related digital images in a processor-accessible memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system comprising:
-
a data processing system; and a memory system communicatively connected to the data processing system and storing instructions configured to cause the data processing system to implement a method for identifying groups of related digital images in a digital image collection including a plurality of digital images, wherein the method comprises; analyzing each of the digital images in the digital image collection to generate associated feature descriptors related to image content or image capture conditions; storing the feature descriptors associated with the digital images in a metadata database; automatically analyzing the metadata database to identify a plurality of frequent itemsets, wherein each of the frequent itemsets is a co-occurring feature descriptor group that occurs in at least a predefined fraction of the digital images, each frequent itemset being associated with a subset of the digital images; determining a probability of occurrence for each the identified frequent itemsets based on one or more probability distributions determined from an analysis of a large number of image collections; determining a quality score for each of the identified frequent itemsets responsive to the determined probability of occurrence; ranking the frequent itemsets based at least on the determined quality scores; identifying one or more groups of related digital images corresponding to one or more of the top ranked frequent itemsets; and storing an indication of the identified groups of related digital images in a processor-accessible memory.
-
Specification