VOICE DIRECTED CONTEXT SENSITIVE VISUAL SEARCH
2 Assignments
0 Petitions
Accused Products
Abstract
Various technologies described herein pertain to voice directed context sensitive visual searching. Visual content can be rendered on a display, and a voice directed query related to the visual content can be received. Contextual information related to the visual content can also be identified. Moreover, a search word recognized from the voice directed query and/or the contextual information can be used to detect an object from the visual content, where the object can be a part of the visual content. Further, a search can be performed using the object detected from the visual content, and a result of the search can be rendered on the display.
17 Citations
40 Claims
-
1-20. -20. (canceled)
-
21. A method of searching, comprising:
-
receiving a voice directed query related to visual content rendered on a display, wherein the visual content is one of a frame from a video stream, a two-dimensional image, or a three-dimensional image; detecting an object from the visual content based on a search word from the voice directed query, wherein; detecting the object from the visual content further comprises performing image processing on the visual content to identify an image of the object from the visual content based on the search word from the voice directed query; the image of the object is a portion of the visual content and the visual content comprises a remainder of the visual content other than the image of the object; and an edge of the image of the object is not delineated in the visual content prior to the performing of the image processing on the visual content; using the image of the object identified from the visual content as an input for a reverse visual search, wherein the reverse visual search is executed based upon the image of the object identified from the visual content, and wherein the reverse visual search returns a result; and rendering the result of the reverse visual search on the display. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
-
34. A device, comprising:
-
a camera; a display; at least one processor; and memory that comprises computer-executable instructions that, when executed by the at least one processor, cause the at least one processor to perform acts including; capturing visual content using the camera, wherein the visual content is one of a frame from a video stream, a two-dimensional image, or a three-dimensional image; rendering the visual content on the display; receiving a voice directed query related to the visual content rendered on the display; detecting an object from the visual content based on a search word from the voice directed query, wherein; detecting the object from the visual content further comprises performing image processing on the visual content to identify an image of the object from the visual content based on the search word from the voice directed query; the image of the object is a portion of the visual content and the visual content comprises a remainder of the visual content other than the image of the object; and an edge of the image of the object is not delineated in the visual content prior to the performing of the image processing on the visual content; using the image of the object identified from the visual content as an input for a reverse visual search, wherein the reverse visual search is executed based upon the image of the object identified from the visual content, and wherein the reverse visual search returns a result; and rendering the result of the reverse visual search on the display. - View Dependent Claims (35, 36)
-
-
37. A system, comprising:
-
at least one processor; and memory that comprises computer-executable instructions that, when executed by the at least one processor, cause the at least one processor to perform acts including; rendering a video stream on a display; receiving a voice directed query related to the video stream rendered on the display; capturing a frame from the video stream in response to the voice directed query; detecting an object from the frame based on a search word from the voice directed query, wherein; detecting the object from the frame further comprises performing image processing on the frame to identify an image of the object from the frame based on the search word from the voice directed query; the image of the object is a portion of the frame and the frame comprises a remainder other than the image of the object; and an edge of the image of the object is not delineated in the frame prior to the performing of the image processing on the frame; using the image of the object identified from the frame as an input for a reverse visual search, wherein the reverse visual search is executed based upon the image of the object identified from the frame, and wherein the reverse visual search returns a result; and rendering the result of the reverse visual search on the display. - View Dependent Claims (38, 39, 40)
-
Specification