×

Multi-agent reinforcement learning for integrated and networked adaptive traffic signal control

  • US 9,818,297 B2
  • Filed: 12/10/2012
  • Issued: 11/14/2017
  • Est. Priority Date: 12/16/2011
  • Status: Active Grant
First Claim
Patent Images

1. A system for adaptive traffic signal control comprising:

  • an agent comprising;

    a processor;

    a communication interface for coupling to a traffic signal array at a first intersection and to one or more other agents; and

    a memory storing computer readable instructions that, when executed by the processor, cause the processor to generate and provide to the traffic signal array a control action for the traffic signal array by continuously updating in real-time a joint control policy for causing the agent to collaborate with the one or more other agents in communication with the agent, the one or more other agents controlling selected neighbouring traffic signal arrays located at other intersections neighbouring the first intersection along two dimensions, the joint control policy comprising a traffic optimization policy simultaneously considering both of the two dimensions, determination of the joint control policy comprising;

    tracking the control action at each update of the joint control policy and,updating of a Q-value or a Q-factor of the joint control policy to improve a cumulative reward, the updating of the joint control policy being based on;

    the tracked control actions;

    respective selected control actions and individual control policies exchanged by the agent with the one or more other agents for negotiation, each individual control policy defining a mapping from a traffic state to a control action for the respective agent; and

    gain messages exchanged by the agent with the one or more other agents comprising, for the exchanged selected control actions and individual control policies, maximum gain values determined by each agent to be obtainable by the respective agent changing its selected control action to the selected actions of the other agents.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×