Estimating word correlations from images
First Claim
1. A system for estimating word correlations, comprising:
- a processor; and
a computer readable storage media having stored thereupon a plurality of instructions that, when executed by the processor, cause the processor to perform acts comprising;
providing a first image representation set having a first plurality of images representing a first word in response to a first search based at least in part on the first word and a second image representation set having a second plurality of images representing a second word in response to a second search based at least in part on the second word;
extracting, from at least one image of the first image representation set and from at least one image of the second image representation set, multi-modal visual features; and
estimating a correlation between the first word and the second word at least partially based on a calculation of a mathematical function having at least the extracted multi-modal visual features corresponding to the first image representation set and at least the extracted multi-modal visual features corresponding to the second image representation set as variables thereto, wherein estimating the correlation includes;
selecting one of a first correlation function for a calculation of an inter-set visual aspect correlation based on multiple images of the first image representation set and multiple images of the second image representation set and a second correlation function for a calculation of an intra-set visual aspect correlation based on multiple images within one of the first image representation set and the second image representation set; and
calculating the selected correlation function.
2 Assignments
0 Petitions
Accused Products
Abstract
Word correlations are estimated using a content-based method, which uses visual features of image representations of the words. The image representations of the subject words may be generated by retrieving images from data sources (such as the Internet) using image search with the subject words as query words. One aspect of the techniques is based on calculating the visual distance or visual similarity between the sets of retrieved images corresponding to each query word. The other is based on calculating the visual consistence among the set of the retrieved images corresponding to a conjunctive query word. The combination of the content-based method and a text-based method may produce even better result.
-
Citations
21 Claims
-
1. A system for estimating word correlations, comprising:
-
a processor; and a computer readable storage media having stored thereupon a plurality of instructions that, when executed by the processor, cause the processor to perform acts comprising; providing a first image representation set having a first plurality of images representing a first word in response to a first search based at least in part on the first word and a second image representation set having a second plurality of images representing a second word in response to a second search based at least in part on the second word; extracting, from at least one image of the first image representation set and from at least one image of the second image representation set, multi-modal visual features; and estimating a correlation between the first word and the second word at least partially based on a calculation of a mathematical function having at least the extracted multi-modal visual features corresponding to the first image representation set and at least the extracted multi-modal visual features corresponding to the second image representation set as variables thereto, wherein estimating the correlation includes; selecting one of a first correlation function for a calculation of an inter-set visual aspect correlation based on multiple images of the first image representation set and multiple images of the second image representation set and a second correlation function for a calculation of an intra-set visual aspect correlation based on multiple images within one of the first image representation set and the second image representation set; and calculating the selected correlation function. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for estimating word correlations, comprising:
-
a processor; and a computer readable storage media having stored thereupon a plurality of instructions that, when executed by the processor, cause the processor to perform acts comprising; providing a first image representation set having a first plurality of images representing a first word in response to a first search based at least in part on the first word and a second image representation set having a second plurality of images representing a second word in response to a second search based at least in part on the second word; extracting, from at least one image of the first image representation set and from at least one image of the second image representation set, multi-modal visual features; and estimating a correlation between the first word and the second word at least partially based on a calculation of a mathematical function having at least the extracted multi-modal visual features corresponding to the first image representation set and at least the extracted multi-modal visual features corresponding to the second image representation set as variables thereto, wherein estimating the correlation between the first word and the second word includes; calculating a content correlation between the multi-modal visual features of the image representations of the first word and the second word; calculating a text correlation between the first word and the second word using a text-based method; and estimating the correlation between the first word and the second word by combining the content correlation and the text correlation. - View Dependent Claims (11, 12, 13)
-
-
14. A method for estimating word correlations, the method comprising:
-
providing a computing device with a conjunctive image representation of a first word and a second word, the conjunctive image representation comprising a first set of multiple images; providing the computing device with a first image representation of the first word and a second image representation of the second word, the first image representation of the first word comprising a second set of multiple images, and the second image representation of the second word comprising a third set of multiple images; and estimating, by the computing device, a correlation between the first word and second word at least partially based on a calculation of visual features calculated from the first set of multiple images comprising the conjunctive image representation of the first word and the second word, the second set of multiple images comprising the first image representation of the first word and the third set of multiple images comprising the second image representation of the second word. - View Dependent Claims (15, 16, 17, 18)
-
-
19. One or more computer readable storage media device having stored thereupon a plurality of instructions that, when executed by a processor, causes the processor to:
-
provide a first image representation of a first word in response to a first search based at least in part on the first word and a second image representation of a second word in response to a second search based at least in part on the second word, the first image representation comprising a first set of images having a first plurality of images, the second image representation comprising a second set of images having a second plurality of images; select one of a first correlation function for a calculation of an inter-set visual aspect correlation based on multiple images of the first set of images and multiple images of the second set of images or a second correlation function for a calculation of an intra-set visual aspect correlation based on multiple images within one of the first set of images and the second set of images; calculate the selected correlation function; and estimate a correlation between the first word and the second word at least partially based on the calculation of the selected correlation function. - View Dependent Claims (20, 21)
-
Specification