Event-based speech interactive media player
First Claim
1. A method, comprising:
- playing first media content from a content provider to a user, wherein the first media content embeds speech interactive content that includes a prompt that instructs the user to respond with a predefined action phrase;
activating a microphone at a user device of the user in response to detecting a wakeup word spoken by the user to detect voice commands spoken by the user;
while the microphone is activated, determining whether a voice command spoken by the user is received at the user device;
in response to determining that the voice command is received, transmitting the voice command from the user device to a server configured to analyze the voice command to generate instructions for responding to the voice command;
receiving the instructions from the server; and
performing, at the user device, an action corresponding to the voice command based on the instructions received from the server.
0 Assignments
0 Petitions
Accused Products
Abstract
Interactive content containing audio or video may be provided in conjunction with non-interactive content containing audio or video to enhance user engagement and interest with the contents and to increase the effectiveness of the distributed information. Interactive content may be directly inserted into the existing, non-interactive content. Additionally or alternatively, interactive content may be streamed in parallel to the existing content, with minimal modification to the existing content. For example, the server may monitor content from a content provider; detect an event (e.g., a marker embedded in the content stream, or in a data source external to the content stream); upon detecting the event, play interactive content at a designated time while silencing the content stream of the content provider (e.g., by muting, pausing, playing silence.) The marker may be a sub-audible tone or metadata associated with the content stream. The user may respond to the interactive content by voice.
91 Citations
18 Claims
-
1. A method, comprising:
-
playing first media content from a content provider to a user, wherein the first media content embeds speech interactive content that includes a prompt that instructs the user to respond with a predefined action phrase; activating a microphone at a user device of the user in response to detecting a wakeup word spoken by the user to detect voice commands spoken by the user; while the microphone is activated, determining whether a voice command spoken by the user is received at the user device; in response to determining that the voice command is received, transmitting the voice command from the user device to a server configured to analyze the voice command to generate instructions for responding to the voice command; receiving the instructions from the server; and performing, at the user device, an action corresponding to the voice command based on the instructions received from the server. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
one or more processors; and a memory storing one or more computer programs that include computer instructions, which when executed by the one or more processors, cause the one or more processors to; play first media content from a content provider to a user, wherein the first media content embeds speech interactive content that includes a prompt that instructs the user to respond with a predefined action phrase; activate a microphone at a user device of the user in response to detecting a wakeup word spoken by the user to detect voice commands spoken by the user; while the microphone is activated, determine whether a voice command spoken by the user is received at the user device; in response to determining that the voice command is received, transmit the voice command from the user device to a server configured to analyze the voice command to generate instructions for responding to the voice command; receive the instructions from the server; and perform, at the user device, an action corresponding to the voice command based on the instructions received from the server. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification