Autocaptioning of images
First Claim
Patent Images
1. A system, comprising:
- a set of information modules, individual information modules configured to operate on an image or metadata associated with the image, the set of information modules including;
a scene analysis module configured to identify a scenario of the image, the scenario involving a human and a non-human object, anda proxemics module configured to receive the scenario identified by the scene analysis module and utilize the scenario to identify a relative relationship between the human and the non-human object; and
,a set of sentence generation modules, individual sentence generation modules configured to produce a sentence caption for the image that reflects the scenario identified by the scene analysis module and the relative relationship between the human and the non-human object identified by the proxemics module; and
,a processing device that executes computer-executable instructions associated with at least the set of sentence generation modules.
3 Assignments
0 Petitions
Accused Products
Abstract
The description relates to sentence autocaptioning of images. One example can include a set of information modules and a set of sentence generation modules. The set of information modules can include individual information modules configured to operate on an image or metadata associated with the image to produce image information. The set of sentence generation modules can include individual sentence generation modules configured to operate on the image information to produce a sentence caption for the image.
46 Citations
20 Claims
-
1. A system, comprising:
-
a set of information modules, individual information modules configured to operate on an image or metadata associated with the image, the set of information modules including; a scene analysis module configured to identify a scenario of the image, the scenario involving a human and a non-human object, and a proxemics module configured to receive the scenario identified by the scene analysis module and utilize the scenario to identify a relative relationship between the human and the non-human object; and
,a set of sentence generation modules, individual sentence generation modules configured to produce a sentence caption for the image that reflects the scenario identified by the scene analysis module and the relative relationship between the human and the non-human object identified by the proxemics module; and
,a processing device that executes computer-executable instructions associated with at least the set of sentence generation modules. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable storage media having instructions stored thereon that when executed by a computing device cause the computing device to perform acts, comprising:
-
obtaining an image comprising image data and associated metadata; producing information about the image using the image data and the associated metadata; receive a label from a user, the label corresponding to an individual non-human element that is visible in the image; automatically generating multiple sentence captions or sentence fragment captions for the image from the information and the label of the corresponding individual non-human element in the image; presenting a display of the multiple sentence captions or the sentence fragment captions for the user; and
,utilizing a user selection of an individual sentence caption or sentence fragment caption to automatically generate a subsequent sentence caption for a subsequent image. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A computing device, comprising:
-
an image sensor configured to capture an image comprising image data; a processor configured to associate metadata with the image; an information fuser configured to; determine weighted reliabilities of portions of the metadata, the weighted reliabilities being particular to the image, and filter the metadata for the image based on the weighted reliabilities that are particular to the image; a set of sentence generation modules configured to generate sentences for the image from at least some of the image data and the filtered metadata; an evaluator configured to evaluate the sentences generated by the set of sentence generation modules and to select an individual sentence as a sentence caption for the image; and
,a display configured to present the image and the sentence caption. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification