DETECTING RECURRING THEMES IN CONSUMER IMAGE COLLECTIONS
First Claim
1. A method comprising:
- storing, in a computing device, feature descriptors for each image in a first plurality of images, wherein the feature descriptors are related to image content or image capture conditions;
identifying, by the computing device, a plurality of itemsets, wherein each itemset of the plurality of itemsets comprises a set of co-occurring feature descriptors that occurs in a respective subset of images in the first plurality of images,associating, by the computing device, each itemset with the respective subset of images in which the itemset occurs;
determining, by the computing device, a quality score for each itemset, wherein the quality score is related to a probability that the itemset occurs in a general population of images based on a probability distribution determined from an analysis of a second plurality of images;
identifying, by the computing device, a first itemset based on the determined quality score; and
storing, by the computing device, an indication of a respective subset of images associated with the identified first itemset.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of identifying groups of related digital images in a digital image collection, comprising: analyzing each of the digital images to generate associated feature descriptors related to image content or image capture conditions; storing the feature descriptors associated with the digital images in a metadata database; automatically analyzing the metadata database to identify a plurality of frequent itemsets, wherein each of the frequent itemsets is a co-occurring feature descriptor group that occurs in at least a predefined fraction of the digital images; determining a probability of occurrence for each the identified frequent itemsets; determining a quality score for each of the identified frequent itemsets responsive to the determined probability of occurrence; ranking the frequent itemsets based at least on the determined quality scores; and identifying one or more groups of related digital images corresponding to one or more of the top ranked frequent itemsets.
-
Citations
20 Claims
-
1. A method comprising:
-
storing, in a computing device, feature descriptors for each image in a first plurality of images, wherein the feature descriptors are related to image content or image capture conditions; identifying, by the computing device, a plurality of itemsets, wherein each itemset of the plurality of itemsets comprises a set of co-occurring feature descriptors that occurs in a respective subset of images in the first plurality of images, associating, by the computing device, each itemset with the respective subset of images in which the itemset occurs; determining, by the computing device, a quality score for each itemset, wherein the quality score is related to a probability that the itemset occurs in a general population of images based on a probability distribution determined from an analysis of a second plurality of images; identifying, by the computing device, a first itemset based on the determined quality score; and storing, by the computing device, an indication of a respective subset of images associated with the identified first itemset. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computing system comprising:
-
a memory configured to store feature descriptors for each image in a first plurality of images, wherein the feature descriptors are related to image content or image capture conditions; and a processing system configured to; identify a plurality of itemsets, wherein each itemset of the plurality of itemsets comprises a set of co-occurring feature descriptors that occurs in a respective subset of images in the first plurality of images, associate each itemset with the respective subset of images in which the itemset occurs; determine a quality score for each respective itemset, wherein the quality score is related to a probability that the itemset occurs in a general population of images based on a probability distribution determined from an analysis of a second plurality of images; identify a first itemset based on the determined quality score; and store, in the memory, an indication of a respective subset of images associated with the identified first itemset. - View Dependent Claims (18, 19)
-
-
20. A non-transitory computer readable medium having stored thereon instructions executable by a computing device to cause the computing device to perform functions, the functions comprising:
-
storing feature descriptors for each image in a first plurality of images, wherein the feature descriptors are related to image content or image capture conditions; identifying a plurality of itemsets, wherein each of the identified itemsets is a set of co-occurring feature descriptors that occurs in a respective subset of images in the first plurality of images, associating each itemset with the respective subset of images in which the itemset occurs; determining a quality score for each respective itemset, wherein the quality score is related to a probability that the itemset occurs in a general population of images based on a probability distribution determined from an analysis of a second plurality of images; identifying a first itemset based on the determined quality score; and storing an indication of a respective subset of images associated with the identified first itemset.
-
Specification