Natural language image tags
First Claim
Patent Images
1. A method comprising:
- displaying an image by a display device;
defining at least a portion of the image displayed based on a gesture, the gesture identified from one or more touch inputs detected using touchscreen functionality of the display device;
receiving a processed natural language input subsequent to displaying the image, the processed natural language input processed from audio data that is based at least on a speech input from a user;
locating one or more items in text received in the processed natural language input;
tagging the portion of the image defined by the gesture with the one or more items of the text received in the processed natural language input, the tag effective to enable identification of the portion from an entirety of the image; and
editing the portion of the image defined by the gesture and the processed natural language input.
2 Assignments
0 Petitions
Accused Products
Abstract
Natural language image tags are described. In one or more implementations, at least a portion of an image displayed by a display device is defined based on a gesture. The gesture is identified from one or more touch inputs detected using touchscreen functionality of the display device. Text received in a natural language input is located and used to tag the portion of the image using one or more items of the text received in the natural language input.
-
Citations
20 Claims
-
1. A method comprising:
-
displaying an image by a display device; defining at least a portion of the image displayed based on a gesture, the gesture identified from one or more touch inputs detected using touchscreen functionality of the display device; receiving a processed natural language input subsequent to displaying the image, the processed natural language input processed from audio data that is based at least on a speech input from a user; locating one or more items in text received in the processed natural language input; tagging the portion of the image defined by the gesture with the one or more items of the text received in the processed natural language input, the tag effective to enable identification of the portion from an entirety of the image; and editing the portion of the image defined by the gesture and the processed natural language input. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
receiving a processed natural language input converted from audio data using a speech-to-text engine, the processed natural language input processed from the audio data, the audio data based on at least a speech input from a user; and responsive to a determination that the processed natural language input includes a tag corresponding to a portion of an image, the tag effective to enable identification of the portion from an entirety of the image, and specifies one or more image editing operations; identifying the portion of the image that corresponds to the tag; and initiating performance of the one or more image editing operations on the portion of the image based on the tag and the processed natural language input. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A system comprising:
-
a speech-to-text engine configured to convert audio data captured by one or more audio-capture devices into a processed natural language input comprising text, the processed natural language input processed from the audio data, the audio data based on at least a speech input from a user; a gesture module configured to recognize a gesture from one or more touch inputs detected using one or more touch sensors, the gesture involving a portion of an image displayed by a display device, the portion comprising less than an entirety of the image; an object identification module configured to identify one or more objects in the image corresponding to the portion including a boundary of the identified one or more objects, respectively; and a natural language processing module configured to; identify a name from the processed natural language input; initiate operation of the object identification module to identify at least one said object in the image corresponding to the portion that corresponds to the name; and tag the identified object in the image corresponding to the portion using the name such that a subsequent processed natural language input that includes the name and specifies an editing operation is usable to initiate performance of the editing operation using the identified object corresponding to the portion, the tag effective to enable identification of the portion from the entirety of the image for the editing operation, the editing operation performed on the portion of the image based on the tag and the subsequent processed natural language input. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification