Using Adalines to Approximate Q-functions in Reinforcement Learning

About This Presentation

Title:

Description:

Number of Views:82

Avg rating:3.0/5.0

Slides: 12

Provided by: Steven1091

Learn more at: https://www.cs.hmc.edu

Category:

more less

Transcript and Presenter's Notes

Title: Using Adalines to Approximate Q-functions in Reinforcement Learning

1
Using Adalines to Approximate Q-functions in
Reinforcement Learning

2
The Problem

Timing traffic lights for optimal traffic flow is
hard
It would be really nice if there was a good way
to have the traffic lights learn the best timing

3
Green Light District

4
Green Light District

TLController fills out a table with the gains
for each lane
SimModel picks the best legal light configuration
Cars are allowed to move (or not) and the
TLController gets to listen in on their movement
Repeat

5
Existing Algorithms

6
My Algorithm

Use a neural network instead of dynamic
programming
Good
Network can deal with continuous input
Might be able to recognize traffic patterns that
are not available using a table lookup
Bad
Hard to tell what the network will learn
Hard to figure out useful input
Hard to tell what the right output is for
training

7
Pitfalls / Solutions

Dont know if we will be red or green
Two adalines to predict reward if the light is
red or greengain is the difference
Input
(for each lane) number of cars, traffic density,
is a given lane full
Rewards
Reward for cars moving, passing through
intersections
Shared reward for other lanes in the intersection

8
Results Split