AUDIO, VIDEO, SIMULATION, AND USER INTERFACE PARADIGMS
First Claim
1. A method comprising:
- acquiring activity data of a user located within range of one or more sensors, the one or more sensors associated with a display screen displaying video content, the displayed video content having a volume level, the activity data being in the form of one or more temporal audio or video samples;
analyzing, using at least one processor operatively coupled with a memory, one or more of the temporal video samples to determine if the user has looked away from the display screen for a first predetermined period of time by searching successive images of the temporal video samples to detect a presence or absence of a frontal face corresponding to the user; and
analyzing one or more of the temporal audio samples to determine if the user has had an emotional response relative to the activity on the display screen for a second predetermined period of time by searching a plurality of successive subsets of the temporal audio samples having a predetermined duration to detect a presence or absence of the user'"'"'s voice activity in each subset, and determining whether or not the user'"'"'s voice activity is present in a predetermined consecutive number of the subsets at a volume greater than a predetermined level.
2 Assignments
0 Petitions
Accused Products
Abstract
Consumer electronic devices have been developed with enormous information processing capabilities, high quality audio and video outputs, large amounts of memory, and may also include wired and/or wireless networking capabilities. Additionally, relatively unsophisticated and inexpensive sensors, such as microphones, video camera, GPS or other position sensors, when coupled with devices having these enhanced capabilities, can be used to detect subtle features about users and their environments. A variety of audio, video, simulation and user interface paradigms have been developed to utilize the enhanced capabilities of these devices. These paradigms can be used separately or together in any combination. One paradigm automatically creating user identities using speaker identification. Another paradigm includes a control button with 3-axis pressure sensitivity for use with game controllers and other input devices.
-
Citations
20 Claims
-
1. A method comprising:
-
acquiring activity data of a user located within range of one or more sensors, the one or more sensors associated with a display screen displaying video content, the displayed video content having a volume level, the activity data being in the form of one or more temporal audio or video samples; analyzing, using at least one processor operatively coupled with a memory, one or more of the temporal video samples to determine if the user has looked away from the display screen for a first predetermined period of time by searching successive images of the temporal video samples to detect a presence or absence of a frontal face corresponding to the user; and analyzing one or more of the temporal audio samples to determine if the user has had an emotional response relative to the activity on the display screen for a second predetermined period of time by searching a plurality of successive subsets of the temporal audio samples having a predetermined duration to detect a presence or absence of the user'"'"'s voice activity in each subset, and determining whether or not the user'"'"'s voice activity is present in a predetermined consecutive number of the subsets at a volume greater than a predetermined level. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more sensors configured for acquiring activity data of a user located within range of the one or more sensors associated with a display screen displaying video content, the displayed video content having a volume level, the activity data being in the form of one or more temporal audio or video samples; a processor configured for analyzing one or more of the temporal video samples to determine if the user has looked away from the display screen for a first predetermined period of time by searching successive images of the temporal video samples to detect a presence or absence of a frontal face corresponding to the user; and the processor configured for analyzing one or more of the temporal audio samples to determine if the user has had an emotional response relative to the activity on the display screen for a second predetermined period of time by searching a plurality of successive subsets of the temporal audio samples having a predetermined duration to detect a presence or absence of the user'"'"'s voice activity in each subset, and determining whether or not the user'"'"'s voice activity is present in a predetermined consecutive number of the subsets at a volume greater than a predetermined level. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A machine-readable non-transitory medium embodying information indicative of instructions for causing one or more machines to perform operations comprising:
-
acquiring activity data of a user located within range of one or more sensors, the one or more sensors associated with a display screen displaying video content, the displayed video content having a volume level, the activity data being in the form of one or more temporal audio or video samples; analyzing one or more of the temporal video samples to determine if the user has looked away from the display screen for a first predetermined period of time by searching successive images of the temporal video samples to detect a presence or absence of a frontal face corresponding to the user; and analyzing one or more of the temporal audio samples to determine if the user has had an emotional response relative to the activity on the display screen for a second predetermined period of time by searching a plurality of successive subsets of the temporal audio samples having a predetermined duration to detect a presence or absence of the user'"'"'s voice activity in each subset, and determining whether or not the user'"'"'s voice activity is present in a predetermined consecutive number of the subsets at a volume greater than a predetermined level. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification