Event-based speech interactive media player
First Claim
1. A device comprising:
- one or more processors;
a memory storing one or more computer programs that include computer instructions, which when executed by the one or more processors, cause the device to;
monitor media content streamed from a content provider to a user, the media content comprising both non-interactive content and interactive content in at least an audio or video format;
detect a first event in the media content streamed to the user, the first event comprising information associated with activating a speech recognizer to detect voice commands spoken by the user;
in response to detecting the first event in the media content streamed to the user, activate the speech recognizer;
while the speech recognizer is activated, receive a voice command spoken by the user and detected by the speech recognizer;
in accordance with a determination that the voice command spoken by the user is an action phrase, cause an action associated with the action phrase to be performed; and
in accordance with a determination that the voice command spoken by the user is not the action phrase, cause the media content to continue to be played.
1 Assignment
0 Petitions
Accused Products
Abstract
Interactive content containing audio or video may be provided in conjunction with non-interactive content containing audio or video to enhance user engagement and interest with the contents and to increase the effectiveness of the distributed information. Interactive content may be directly inserted into the existing, non-interactive content. Additionally or alternatively, interactive content may be streamed in parallel to the existing content, with only minimal modification to the existing content. For example, the server may monitor content from a content provider; detect an event (e.g., detecting a marker embedded in the content stream, or in a data source external to the content stream); upon detection of the event, play interactive content at one or more designated times while silencing the content stream of the content provider (e.g., by muting, pausing, playing silence.) The marker may be a sub-audible tone or metadata associated with the content stream. The user may respond to the interactive content by voice, touch or gesture.
69 Citations
41 Claims
-
1. A device comprising:
-
one or more processors; a memory storing one or more computer programs that include computer instructions, which when executed by the one or more processors, cause the device to; monitor media content streamed from a content provider to a user, the media content comprising both non-interactive content and interactive content in at least an audio or video format; detect a first event in the media content streamed to the user, the first event comprising information associated with activating a speech recognizer to detect voice commands spoken by the user; in response to detecting the first event in the media content streamed to the user, activate the speech recognizer; while the speech recognizer is activated, receive a voice command spoken by the user and detected by the speech recognizer; in accordance with a determination that the voice command spoken by the user is an action phrase, cause an action associated with the action phrase to be performed; and in accordance with a determination that the voice command spoken by the user is not the action phrase, cause the media content to continue to be played. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for providing speech interactive content, the method comprising:
-
monitoring media content streamed from a content provider to a user, the media content comprising both non-interactive content and interactive content in at least an audio or video format; detecting a first event in the media content streamed to the user, the first event comprising information associated with activating a speech recognizer to detect voice commands spoken by the user; in response to detecting the first event in the media content streamed to the user, activating the speech recognizer; while the speech recognizer is activated, receive a voice command spoken by the user and detected by the speech recognizer; in accordance with a determination that the voice command spoken by the user is an action phrase, cause an action associated with the action phrase to be performed; and in accordance with a determination that the voice command spoken by the user is not the action phrase, causing the media content to continue to be played. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer readable medium comprising one or more computer programs, which when executed by a device with one or more processors, cause the device to:
-
monitor media content streamed from a content provider to a user, the media content comprising both non-interactive content and interactive content in at least an audio or video format; detect a first event in the media content streamed to the user, the first event comprising information associated with activating a speech recognizer to detect voice commands spoken by the user; in response to detecting the first event in the media content streamed to the user, activate the speech recognizer; while the speech recognizer is activated, receive a voice command spoken by the user and detected by the speech recognizer; in accordance with a determination that the voice command spoken by the user is an action phrase, cause an action associated with the action phrase to be performed; and in accordance with a determination that the voice command spoken by the user is not the action phrase, cause the media content to continue to be played. - View Dependent Claims (20)
-
-
21. A device comprising:
-
one or more processors; a memory storing one or more computer programs that include computer instructions, which when executed by the one or more processors, cause the device to; monitor non-interactive media content in at least an audio or video format that is streamed from a first source to a user; detect a first event in the non-interactive media content streamed to the user, the first event indicating a start time of a playback of interactive media content and comprising information associated with activating a speech recognizer to detect voice commands spoken by the user; in response to detecting the first event in the non-interactive media content streamed to the user; cause the interactive media content in at least an audio or video format to be played by a second source to the user at the indicated start time, and activate the speech recognizer; while the speech recognizer is activated, receive a voice command spoken by the user and detected by the speech recognizer; in accordance with a determination that the voice command spoken by the user is an action phrase associated with the interactive media content, cause an action associated with the action phrase to be performed; and in accordance with a determination that the voice command spoken by the user is not the action phrase associated with the interactive media content, cause the non-interactive media content to continue to be played. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A method for providing speech interactive content, the method comprising:
-
monitoring non-interactive media content in at least an audio or video format that is streamed from a first source to a user; detecting a first event in the non-interactive media content streamed to the user, the first event indicating a start time of a playback of interactive media content and comprising information associated with activating a speech recognizer to detect voice commands spoken by the user; in response to detecting the first event in the non-interactive media content streamed to the user; causing the interactive media content in at least an audio or video format to be played by a second source to the user at the indicated start time, and activating the speech recognizer; while the speech recognizer is activated, receiving a voice command spoken by the user and detected by the speech recognizer; in accordance with a determination that the voice command spoken by the user is an action phrase associated with the interactive media content, cause an action associated with the action phrase to be performed; and in accordance with a determination that the voice command spoken by the user is not the action phrase associated with the interactive media content, cause the non-interactive media content to continue to be played. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A non-transitory computer readable medium comprising one or more computer programs, which when executed by a device with one or more processors, cause the device to:
-
monitor non-interactive media content in at least an audio or video format that is streamed by a first source to a user; detect a first event in the non-interactive media content streamed to the user, the first event indicating a start time of a playback of interactive media content and comprising information associated with activating a speech recognizer to detect voice commands spoken by the user; in response to detecting the first event in the non-interactive media content streamed to the user; cause the interactive media content in at least an audio or video format to be played by a second source to the user at the indicated start time, and activate the speech recognizer; while the speech recognizer is activated, receive a voice command spoken by the user and detected by the speech recognizer; in accordance with a determination that the voice command spoken by the user is an action phrase associated with the interactive media content, cause an action associated with the action phrase to be performed; and in accordance with a determination that the voice command spoken by the user is not the action phrase associated with the interactive media content, cause the non-interactive media content to continue to be played.
-
Specification