×

Compound gesture-speech commands

  • US 10,534,438 B2
  • Filed: 04/28/2017
  • Issued: 01/14/2020
  • Est. Priority Date: 06/18/2010
  • Status: Active Grant
First Claim
Patent Images

1. An automated method of initiating a machine action based on a combination of sounds and gestures made by one or more users, the method comprising:

  • using a depth determining camera to capture a first three-dimensional body pose made and/or a first three-dimensional body action performed by a respective at least one of the one or more users;

    identifying a first pre-specified three-dimensional gesture based on the first three-dimensional body pose and/or the first three-dimensional body action of the respective at least one user, and determining a confidence level for identification of the first pre-specified three-dimensional gesture;

    assigning a weight to the first pre-specified three-dimensional gesture based on the confidence level for identification of the first pre-specified three-dimensional gesture;

    detecting a first set of one or more sounds made by the respective at least one or at least another of the one or more users, the first set of one or more sounds being made in combination with the first respective three-dimensional body pose and/or the first three-dimensional body action of the respective at least one user;

    recognizing a first voice command based on the first set of one or more sounds and determining a confidence level for recognition of the first voice command;

    assigning a weight to the first voice command based on the confidence level for recognition of the first voice command;

    automatically identifying a first command pre-associated with a compound combination of the first pre-specified three-dimensional gesture and the first voice command by,determining that the weight of the first pre-specified three-dimensional gesture is greater than the weight of the first voice command, and verifying that the first voice command is within a set of voice commands associated with the first pre-specified three-dimensional gesture;

    in response to the first command, initiating performance by an instructable machine of a first machine action that has been predetermined to be commanded by the first command;

    identifying a second pre-specified three-dimensional gesture based on a second three-dimensional body pose and/or a second three-dimensional body action of the respective at least one user;

    assigning a weight to the second pre-specified three-dimensional gesture;

    recognizing a second voice command based on a second set of one or more sounds;

    assigning a weight to the second voice command;

    automatically identifying a second command pre-associated with a compound combination of the second pre-specified three-dimensional gesture and the second voice command by determining that the weight of the second voice command is greater than the weight of the second pre-specified three-dimensional gesture, and verifying that the second pre-specified three-dimensional gesture is within a set of gestures associated with the second voice command; and

    in response to the second command, initiating performance by the instructable machine of a second machine action that has been predetermined to be commanded by the second command.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×