Approach for near duplicate image detection
First Claim
1. A computer-implemented method for processing image data, the computer-implemented method comprising:
- generating, based upon first color image data, first color histogram descriptor data that indicates a number of occurrences in the first color image data of each of a plurality of colors;
generating first gray-scaled image data based upon the first color image data;
generating, based upon the first gray-scaled image data, first texture histogram descriptor data that indicates a number of occurrences in the first gray-scaled image data of each of a plurality of edge orientations;
generating, based upon second color image data, second color histogram descriptor data that indicates a number of occurrences in the second color image data of each of a plurality of colors;
generating second gray-scaled image data based upon the second color image data;
generating, based upon the second gray-scaled image data, second texture histogram descriptor data that indicates a number of occurrences in the second gray-scaled image data of each of a plurality of edge orientations; and
determining whether the first color image data is a near duplicate of the second color image data bycomparing the first color histogram descriptor data to the second color histogram descriptor data, andcomparing the first texture histogram descriptor data to the second texture histogram descriptor data;
wherein one or more of the steps are performed using one or more processors.
9 Assignments
0 Petitions
Accused Products
Abstract
A content-based approach is provided for detecting near duplicate images. The approach generally involves analyzing the content of images to be compared and generating color and texture histogram descriptor data for each image. The images may then be compared based upon the color and texture histogram descriptor data to determine whether the images are near duplicate images. Content-based image signature data may also be generated for each of the images based upon the color and texture histogram descriptor data. The image signature data may then be compared to determine whether the corresponding images are near duplicate images.
-
Citations
36 Claims
-
1. A computer-implemented method for processing image data, the computer-implemented method comprising:
-
generating, based upon first color image data, first color histogram descriptor data that indicates a number of occurrences in the first color image data of each of a plurality of colors; generating first gray-scaled image data based upon the first color image data; generating, based upon the first gray-scaled image data, first texture histogram descriptor data that indicates a number of occurrences in the first gray-scaled image data of each of a plurality of edge orientations; generating, based upon second color image data, second color histogram descriptor data that indicates a number of occurrences in the second color image data of each of a plurality of colors; generating second gray-scaled image data based upon the second color image data; generating, based upon the second gray-scaled image data, second texture histogram descriptor data that indicates a number of occurrences in the second gray-scaled image data of each of a plurality of edge orientations; and determining whether the first color image data is a near duplicate of the second color image data by comparing the first color histogram descriptor data to the second color histogram descriptor data, and comparing the first texture histogram descriptor data to the second texture histogram descriptor data; wherein one or more of the steps are performed using one or more processors. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method for processing image data, the computer-implemented method comprising:
-
generating, based upon first color image data, first color histogram descriptor data that indicates a number of occurrences in the first color image data of each of a plurality of colors; generating first gray-scaled image data based upon the first color image data; generating, based upon the first gray-scaled image data, first texture histogram descriptor data that indicates a number of occurrences in the first gray-scaled image data of each of a plurality of edge orientations; generating, based upon second color image data, second color histogram descriptor data that indicates a number of occurrences in the second color image data of each of a plurality of colors; generating second gray-scaled image data based upon the second color image data; generating, based upon the second gray-scaled image data, second texture histogram descriptor data that indicates a number of occurrences in the second gray-scaled image data of each of a plurality of edge orientations; generating first signature data based upon the first color histogram data and the first texture histogram data; generating second signature data based upon the second color histogram data and the second texture histogram data; determining whether the first color image data is a near duplicate of the second color image data by comparing the first signature data to the second signature data; wherein one or more of the steps are performed using one or more processors. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A non-transitory computer-readable storage medium for processing image data, the non-transitory computer-readable medium storing instructions which, when processed by one or more processors, cause:
-
generating, based upon first color image data, first color histogram descriptor data that indicates a number of occurrences in the first color image data of each of a plurality of colors; generating first gray-scaled image data based upon the first color image data; generating, based upon the first gray-scaled image data, first texture histogram descriptor data that indicates a number of occurrences in the first gray-scaled image data of each of a plurality of edge orientations; generating, based upon second color image data, second color histogram descriptor data that indicates a number of occurrences in the second color image data of each of a plurality of colors; generating second gray-scaled image data based upon the second color image data; generating, based upon the second gray-scaled image data, second texture histogram descriptor data that indicates a number of occurrences in the second gray-scaled image data of each of a plurality of edge orientations; and determining whether the first color image data is a near duplicate of the second color image data by comparing the first color histogram descriptor data to the second color histogram descriptor data, and comparing the first texture histogram descriptor data to the second texture histogram descriptor data. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. A non-transitory computer-readable storage medium for processing image data, the non-transitory computer-readable medium storing instructions which, when processed by one or more processors, cause:
-
generating, based upon first color image data, first color histogram descriptor data that indicates a number of occurrences in the first color image data of each of a plurality of colors; generating first gray-scaled image data based upon the first color image data; generating, based upon the first gray-scaled image data, first texture histogram descriptor data that indicates a number of occurrences in the first gray-scaled image data of each of a plurality of edge orientations; generating, based upon second color image data, second color histogram descriptor data that indicates a number of occurrences in the second color image data of each of a plurality of colors; generating second gray-scaled image data based upon the second color image data; generating, based upon the second gray-scaled image data, second texture histogram descriptor data that indicates a number of occurrences in the second gray-scaled image data of each of a plurality of edge orientations; generating first signature data based upon the first color histogram data and the first texture histogram data; generating second signature data based upon the second color histogram data and the second texture histogram data; determining whether the first color image data is a near duplicate of the second color image data by comparing the first signature data to the second signature data. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. An apparatus for processing image data, the apparatus comprising a memory storing instructions which, when processed by one or more processors, cause:
-
generating, based upon first color image data, first color histogram descriptor data that indicates a number of occurrences in the first color image data of each of a plurality of colors; generating first gray-scaled image data based upon the first color image data; generating, based upon the first gray-scaled image data, first texture histogram descriptor data that indicates a number of occurrences in the first gray-scaled image data of each of a plurality of edge orientations; generating, based upon second color image data, second color histogram descriptor data that indicates a number of occurrences in the second color image data of each of a plurality of colors; generating second gray-scaled image data based upon the second color image data; generating, based upon the second gray-scaled image data, second texture histogram descriptor data that indicates a number of occurrences in the second gray-scaled image data of each of a plurality of edge orientations; and determining whether the first color image data is a near duplicate of the second color image data by comparing the first color histogram descriptor data to the second color histogram descriptor data, and comparing the first texture histogram descriptor data to the second texture histogram descriptor data. - View Dependent Claims (26, 27, 28, 29, 30)
-
-
31. An apparatus for processing image data, the apparatus comprising a memory storing instructions which, when processed by one or more processors, cause:
-
generating, based upon first color image data, first color histogram descriptor data that indicates a number of occurrences in the first color image data of each of a plurality of colors; generating first gray-scaled image data based upon the first color image data; generating, based upon the first gray-scaled image data, first texture histogram descriptor data that indicates a number of occurrences in the first gray-scaled image data of each of a plurality of edge orientations; generating, based upon second color image data, second color histogram descriptor data that indicates a number of occurrences in the second color image data of each of a plurality of colors; generating second gray-scaled image data based upon the second color image data; generating, based upon the second gray-scaled image data, second texture histogram descriptor data that indicates a number of occurrences in the second gray-scaled image data of each of a plurality of edge orientations; generating first signature data based upon the first color histogram data and the first texture histogram data; generating second signature data based upon the second color histogram data and the second texture histogram data; determining whether the first color image data is a near duplicate of the second color image data by comparing the first signature data to the second signature data. - View Dependent Claims (32, 33, 34, 35, 36)
-
Specification