11-07 | 12-07 | 13-07 | 14-07 | 15-07 | |
MONDAY | TUESDAY | WEDNESDAY | THURSDAY | FRIDAY | |
08:30- | Welcome | ||||
09:00 | Vincent Francois-Lavet Kickoff RL week+ Generalization in RL (slides) |
Herke van Hoof Temporal difference with function approximators (slides) |
Aske Plaat Deep model-based RL (slides) |
Herke van Hoof Hierarchical RL (slides) |
Quizz (questions) |
10:30 | Coffee Break | Coffee Break | Coffee Break | Coffee Break | Coffee Break |
10:45 |
Julia Olkhovskaya Intro to bandits (slides) |
Peter Bloem MCTS (slides) |
Elise van der Pol Symmetries in RL (slides) |
Thomas Kipf World models and object-centric learning (slides) |
Practical session (notebook) |
12:15 | Lunch Break | Lunch Break | Lunch Break | Lunch Break | Lunch Break |
14:00 |
Emile van Krieken Policy gradients (slides) |
Alessandro Lazaric Exploration/exploitation (slides) |
Social activity |
Pablo Samuel Castro State-similarity metrics (slides) - A1, A2 |
Wouter Koolen-Wijkstra Pure exploration problems (slides) |
15:30 | Coffee Break | Coffee Break | Coffee Break | Coffee Break | |
15:45 |
Practical session (notebook) |
Practical session (notebook) |
Practical session (notebook) |
Marc Bellemare Distributional RL |