Voice-Based Image Tagging and Searching
First Claim
Patent Images
1. A method for tagging or searching images using a voice-based digital assistant, comprising:
- at an electronic device with a processor and memory storing instructions for execution by the processor;
providing a digital photograph of a real-world scene;
providing a natural language text string corresponding to a speech input associated with the digital photograph;
performing natural language processing on the text string to identify one or more terms associated with an entity, an activity, or a location; and
tagging the digital photograph with the one or more terms and their associated entity, activity, or location.
1 Assignment
0 Petitions
Accused Products
Abstract
The electronic device with one or more processors and memory provides a digital photograph of a real-world scene. The electronic device provides a natural language text string corresponding to a speech input associated with the digital photograph. The electronic device performs natural language processing on the text string to identify one or more terms associated with an entity, an activity, or a location. The electronic device tags the digital photograph with the one or more terms and their associated entity, activity, or location.
426 Citations
27 Claims
-
1. A method for tagging or searching images using a voice-based digital assistant, comprising:
at an electronic device with a processor and memory storing instructions for execution by the processor; providing a digital photograph of a real-world scene; providing a natural language text string corresponding to a speech input associated with the digital photograph; performing natural language processing on the text string to identify one or more terms associated with an entity, an activity, or a location; and tagging the digital photograph with the one or more terms and their associated entity, activity, or location. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
26. A computer system, comprising:
-
one or more processors; and memory storing one or more programs for execution by the one or more processors, the one or more programs including instructions for; providing a digital photograph of a real-world scene; providing a natural language text string corresponding to a speech input associated with the digital photograph; performing natural language processing on the text string to identify one or more terms associated with an entity, an activity, or a location; and tagging the digital photograph with the one or more terms and their associated entity, activity, or location.
-
-
27. A non-transitory computer readable storage medium storing one or more programs configured for execution by an electronic device, the one or more programs comprising instructions for:
-
providing a digital photograph of a real-world scene; providing a natural language text string corresponding to a speech input associated with the digital photograph; performing natural language processing on the text string to identify one or more terms associated with an entity, an activity, or a location; and tagging the digital photograph with the one or more terms and their associated entity, activity, or location.
-
Specification