Integrated learning for interactive synthetic characters

US 20110099130A1
Filed: 07/15/2004
Published: 04/28/2011
Est. Priority Date: 07/16/2003
Status: Abandoned Application

First Claim

Patent Images

1. A method for training a mechanism to perform desired actions comprising, in combination,storing state data specifying the attributes of each of a plurality of different environmental states in which said mechanism can exist,storing action data specifying the attributes of each of a plurality of different actions that said mechanism may perform,storing tuple data comprising a plurality of tuples each of which specifies a given one of said environmental states, a given one of said actions, and at least one utility value indicating the likelihood of achieving a desired outcome as a result of performing said given action when said given state exists,storing current state condition data defining the attributes of the current environmental state of said mechanism;

accepting input stimulus data and modifying said current state condition data in response to said input stimulus data,comparing said current state condition data with said tuple data to identify matching tuples which specify an environmental state corresponding to said current state condition,selecting from said matching tuples the particular tuple having the highest utility value,performing the action specified in said particular tuple if said highest utility value is greater than a specified threshold,altering said utility value in said particular tuple to record the performance of said action, andmodifying said current state condition to reflect the performance of said action.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A practical approach to real-time learning for synthetic characters grounded in the techniques of reinforcement learning and informed by insights from animal training. The approach simplifies the learning task for characters by (a) enabling them to take advantage of predictable regularities in their world, (b) allowing them to make maximal use of any supervisory signals, and (c) making them easy to train by humans. An autonomous animated dog is described that can be trained with a technique used to train real dogs called “clicker training.”

Citations

4 Claims

1. A method for training a mechanism to perform desired actions comprising, in combination,storing state data specifying the attributes of each of a plurality of different environmental states in which said mechanism can exist,storing action data specifying the attributes of each of a plurality of different actions that said mechanism may perform,storing tuple data comprising a plurality of tuples each of which specifies a given one of said environmental states, a given one of said actions, and at least one utility value indicating the likelihood of achieving a desired outcome as a result of performing said given action when said given state exists,storing current state condition data defining the attributes of the current environmental state of said mechanism;
- accepting input stimulus data and modifying said current state condition data in response to said input stimulus data,comparing said current state condition data with said tuple data to identify matching tuples which specify an environmental state corresponding to said current state condition,selecting from said matching tuples the particular tuple having the highest utility value,performing the action specified in said particular tuple if said highest utility value is greater than a specified threshold,altering said utility value in said particular tuple to record the performance of said action, andmodifying said current state condition to reflect the performance of said action.
- View Dependent Claims (2, 3, 4)
- - 2. A method for training a mechanism to perform desired actions as set forth in claim 1 wherein said at least one utility value indicating the likelihood of achieving a desired outcome indicates the rate at which said desired outcome is achieved over a limited number of prior performances of said given action when said given state exists.
  - 3. A method for training a mechanism to perform desired actions as set forth in claim 1 further including the step of organizing said state data into hierarchical parent-child groupings in which the data defining each specific child state is more specific than the data defining the parent state of said specific child state.
  - 4. A method for training a mechanism to perform desired actions as set forth in claim 1 wherein said action data which specifies the attributes of a given action comprises the identification of a sequence of configurations assumable by said mechanism.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Massachusetts Institute of Technology
Original Assignee
Massachusetts Institute of Technology
Inventors
Berlin, Matthew Roberts, Blumberg, Bruce M., Downie, Marc Norman, Ivanov, Yuri

Application Number

US10/891,657
Publication Number

US 20110099130A1
Time in Patent Office

Days
Field of Search
US Class Current

706/12
CPC Class Codes

G09B 19/00 Teaching not covered by oth...

G09B 5/00 Electrically-operated educa...

Integrated learning for interactive synthetic characters

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

4 Claims

Specification

Solutions

Use Cases

Quick Links

Integrated learning for interactive synthetic characters

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

4 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links