Natural Language Image Tags
First Claim
Patent Images
1. A method comprising:
- defining at least a portion of an image displayed by a display device based on a gesture, the gesture identified from one or more touch inputs detected using touchscreen functionality of the display device;
locating text received in a natural language input; and
tagging the portion of the image using one or more items of the text received in the natural language input.
2 Assignments
0 Petitions
Accused Products
Abstract
Natural language image tags are described. In one or more implementations, at least a portion of an image displayed by a display device is defined based on a gesture. The gesture is identified from one or more touch inputs detected using touchscreen functionality of the display device. Text received in a natural language input is located and used to tag the portion of the image using one or more items of the text received in the natural language input.
-
Citations
20 Claims
-
1. A method comprising:
-
defining at least a portion of an image displayed by a display device based on a gesture, the gesture identified from one or more touch inputs detected using touchscreen functionality of the display device; locating text received in a natural language input; and tagging the portion of the image using one or more items of the text received in the natural language input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
receiving a natural language input converted from audio data using a speech-to-text engine; and responsive to a determination that the natural language input includes a tag and specifies one or more image editing operations; identifying at least a portion of an image that corresponds to the tag; and initiating performance of the one or more image editing operations on at least the portion of the image. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A system comprising:
-
a speech-to-text engine configured to convert audio data captured by one or more audio-capture devices into a natural language input comprising text; a gesture module configured to recognize a gesture from one or more touch inputs detected using one or more touch sensors, the gesture involving an image displayed by a display device; an object identification module configured to identify one or more objects in the image including a boundary of the identified one or more objects, respectively; and a natural language processing module configured to; identify a name from the natural language input; initiate operation of the object identification module to identify at least one said object in the image that corresponds to the name; and tag the identified object in the image using the name such that a subsequent natural language input that includes the proper and specifies an operation is usable to initiate performance of the operation using the identified object. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification