COMPOUND GESTURE-SPEECH COMMANDS
First Claim
1. A method for controlling a computing system using a voice commands, comprising:
- accessing multiple depth images from a depth sensor system;
recognizing a gesture from the multiple depth images;
in response to recognizing a gesture, choosing a subset of a set of sound commands based on the recognized gesture, the set of sound commands includes multiple subsets, each subset is associated with one or more gestures and sound command recognition data for the respective subset;
receiving sound input;
recognizing a sound command from the chosen subset based on the sound input; and
performing an action in response to the recognized sound command.
2 Assignments
0 Petitions
Accused Products
Abstract
A multimedia entertainment system combines both gestures and voice commands to provide an enhanced control scheme. A user'"'"'s body position or motion may be recognized as a gesture, and may be used to provide context to recognize user generated sounds, such as speech input. Likewise, speech input may be recognized as a voice command, and may be used to provide context to recognize a body position or motion as a gesture. Weights may be assigned to the inputs to facilitate processing. When a gesture is recognized, a limited set of voice commands associated with the recognized gesture are loaded for use. Further, additional sets of voice commands may be structured in a hierarchical manner such that speaking a voice command from one set of voice commands leads to the system loading a next set of voice commands.
33 Citations
20 Claims
-
1. A method for controlling a computing system using a voice commands, comprising:
-
accessing multiple depth images from a depth sensor system; recognizing a gesture from the multiple depth images; in response to recognizing a gesture, choosing a subset of a set of sound commands based on the recognized gesture, the set of sound commands includes multiple subsets, each subset is associated with one or more gestures and sound command recognition data for the respective subset; receiving sound input; recognizing a sound command from the chosen subset based on the sound input; and performing an action in response to the recognized sound command. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computing system, comprising:
-
a monitor for displaying multimedia content; a depth sensor for capturing depth images; a microphone for capturing sounds; and a processor in communication with the depth sensor, the microphone and the monitor;
the processor communicates with the monitor to display an object, the processor receives multiple depth images from the depth sensor and recognizes a gesture from the multiple depth images, the processor chooses a subset of a set of sound commands based on and in response to the recognized gesture, the set of sound commands includes multiple subsets, the processor loads sound command recognition data for the chosen subset of sound commands in response to the recognized gesture, the processor receives sound input from the microphone and recognizes a sound command from the chosen subset based on the sound input and the loaded sound command recognition data without searching all of the set of sound commands, processor performs an action in response to the recognized sound command. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. A processor readable storage device having instructions thereon, the instructions for programming one or more processors to perform a method, comprising:
-
receiving sound input; accessing multiple depth images from a depth sensor; recognizing a gesture from the multiple depth images based on tracking motion of an object moving in a scene captured in the depth images; in response to recognizing the gesture, choosing a subset of a set of sound commands based on the recognized gesture; and identifying a sound command using a recognition process that is limited to attempting to match the sound input to sound commands of the chosen subset and not to sound commands of the set of sound commands not in the chosen subset. - View Dependent Claims (19, 20)
-
Specification