Verbal queries relative to video content
First Claim
Patent Images
1. A non-transitory computer-readable medium embodying a program executable in at least one computing device, wherein when executed the program causes the at least one computing device to at least:
- receive a verbal query via a microphone associated with a user account;
perform natural language processing on the verbal query to determine a region of a video frame expressed in relative terms in the verbal query;
identify a portion of video content that is currently being presented via a display associated with the user account;
identify an item depicted in the portion of the video content at the region;
determine information about the item as an answer to the verbal query; and
cause the information about the item to be presented via an audio device associated with the user account using a speech synthesizer.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are various embodiments for processing verbal queries relative to video content. A verbal query that is associated with a portion of video content is received. The verbal query specifies a relative frame location. An item depicted in the portion of the video content at the relative frame location is identified. Information about the item is determined as an answer to the verbal query. Information about the item is then presented to a user.
-
Citations
20 Claims
-
1. A non-transitory computer-readable medium embodying a program executable in at least one computing device, wherein when executed the program causes the at least one computing device to at least:
-
receive a verbal query via a microphone associated with a user account; perform natural language processing on the verbal query to determine a region of a video frame expressed in relative terms in the verbal query; identify a portion of video content that is currently being presented via a display associated with the user account; identify an item depicted in the portion of the video content at the region; determine information about the item as an answer to the verbal query; and cause the information about the item to be presented via an audio device associated with the user account using a speech synthesizer. - View Dependent Claims (2, 3)
-
-
4. A system, comprising:
-
at least one computing device; and at least one application executed in the at least one computing device, wherein when executed the at least one application causes the at least one computing device to at least; receive a verbal query associated with a user account, the verbal query specifying a region of a video frame expressed in relative terms; identify a portion of video content that is currently being presented via the user account; identify an item depicted in the portion of the video content at the region; determine information about the item as an answer to the verbal query; and cause the information about the item to be presented. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method, comprising:
-
receiving, via at least one computing device, a verbal query associated with a portion of video content, the verbal query specifying a region in a video frame expressed in relative terms; identifying, via the at least one computing device, an item depicted in the portion of the video content at the region; determining, via the at least one computing device, information about the item as an answer to the verbal query; and causing, via the at least one computing device, the information about the item to be presented to a user. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification