Natural language image spatial and tonal localization
First Claim
Patent Images
1. A method comprising:
- receiving a natural language input by one or more computing devices, the natural language input formed using arbitrary language;
parsing the natural language input by the one or more computing devices into action data containing arbitrary vocabulary, the arbitrary vocabulary comprising nouns and verbs referencing an action to be performed;
translating portions of the arbitrary vocabulary contained in the action data to constrained vocabulary data by the one or more computing devices and mapping non-translated portions of the arbitrary vocabulary contained in the action data to a generalized vocabulary specifying parameters for image processing;
determining spatial and tonal localizations of one or more image editing operations as specified by the generalized vocabulary and constrained vocabulary data by the one or more computing devices; and
initiating performance of the one or more image editing operations on image data using the determined spatial and tonal localization by the one or more computing devices.
2 Assignments
0 Petitions
Accused Products
Abstract
Natural language image spatial and tonal localization techniques are described. In one or more implementations, a natural language input is processed to determine spatial and tonal localization of one or more image editing operations specified by the natural language input. Performance is initiated of the one or more image editing operations on image data using the determined spatial and tonal localization.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving a natural language input by one or more computing devices, the natural language input formed using arbitrary language; parsing the natural language input by the one or more computing devices into action data containing arbitrary vocabulary, the arbitrary vocabulary comprising nouns and verbs referencing an action to be performed; translating portions of the arbitrary vocabulary contained in the action data to constrained vocabulary data by the one or more computing devices and mapping non-translated portions of the arbitrary vocabulary contained in the action data to a generalized vocabulary specifying parameters for image processing; determining spatial and tonal localizations of one or more image editing operations as specified by the generalized vocabulary and constrained vocabulary data by the one or more computing devices; and initiating performance of the one or more image editing operations on image data using the determined spatial and tonal localization by the one or more computing devices. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method comprising:
-
generating one or more localization masks by one or more computing devices based on a phrase in a natural language input, where the natural language input comprises arbitrary verbs and nouns, each of the one or more localization masks being a combination of a spatial localization mask and a tonal region localization mask, respectively; identifying one or more image editing operations that are included in the phrase by the one or more computing devices, where the one or more image editing operations are identified by using lexical ontologies and semantic distances to map the arbitrary verbs and nouns included in the phrase to the one or more specific image editing operations; and initiating performance of the one or more identified image editing operations on image data by the one or more computing devices using the generated one or more localization masks. - View Dependent Claims (15, 16, 17)
-
-
18. One or more computer-readable storage media comprising instructions stored thereon that, responsive to execution on a computing device, causes the computing device to perform operations comprising:
-
determining a strength as well as spatial and tonal localization of one or more image editing operations specified by a natural language input, where the natural language input comprises arbitrary nouns and verbs, the determining of the spatial and tonal localization based at least in part of identification of a direction and a modifier included in the natural language input, the one or more image editing operations specified identified by using lexical ontologies and semantic distances to map the arbitrary nouns and verbs in the natural language input to one or more specific image editing operations; and initiating performance of the one or more image editing operations on image data using the determined spatial and tonal localization and strength. - View Dependent Claims (19, 20)
-
Specification