Statistical approach to large-scale image annotation
First Claim
Patent Images
1. A method of annotating a personal image comprising:
- compiling visual features and textual information from a plurality of images;
hashing the visual features;
clustering the plurality of images based at least in part on a hash value, the clustering creating clustered images;
building one or more statistical language models based at least in part on the clustered images; and
annotating the personal image by selecting words with a maximum joint probability between the personal image and the clustered images.
2 Assignments
0 Petitions
Accused Products
Abstract
Statistical approaches to large-scale image annotation are described. Generally, the annotation technique includes compiling visual features and textual information from a number of images, hashing the images visual features, and clustering the images based on their hash values. An example system builds statistical language models from the clustered images and annotates the image by applying one of the statistical language models.
25 Citations
20 Claims
-
1. A method of annotating a personal image comprising:
-
compiling visual features and textual information from a plurality of images; hashing the visual features; clustering the plurality of images based at least in part on a hash value, the clustering creating clustered images; building one or more statistical language models based at least in part on the clustered images; and annotating the personal image by selecting words with a maximum joint probability between the personal image and the clustered images. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 20)
-
-
9. A computer readable storage device comprising computer executable instructions that when executed by one or more processors cause one or more computing devices to perform a method comprising:
-
compiling visual information and textual information from a plurality of images; extracting the visual information from the plurality of images by using a gray block methodology; reducing the visual information by employing a projection matrix; hashing the reduced visual information; clustering the plurality of images based at least in part on a hash value to create image clusters; building one or more statistical language models based at least in part on the image clusters; and annotating a personal image by selecting words with a maximum joint probability with the personal image. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A computer readable storage device comprising:
-
a personal image; and a textual annotation associated with the personal image, the textual annotation being associated with the personal image by; compiling visual features and textual information from a plurality of images; extracting the visual features from the plurality of images by using a gray block methodology; hashing the visual features to generate a hash value; clustering the plurality of images based at least in part on the hash value; building one or more statistical language models based at least in part on the clustered images; and associating the textual annotation with the personal image by selecting words with a maximum joint probability between the personal image and the clustered images. - View Dependent Claims (16, 17, 18, 19)
-
Specification