Learning, inference and behaviour in the perception-action loop with a specific interest in model-based RL, sparse rewards problems and action advising.
Welcome to the Reinforcement Learning reading group at the Queen Mary University of London.
Date | Paper | Presenter |
---|---|---|
4 February 2020 | Empowerment-driven Exploration using Mutual Information Estimation - Kumar (2018) | Alvaro Ovalle & Martin Balla |
28 January 2020 | MINE: Mutual Information Neural Estimation - Belghazi et al. (2018) | Alvaro Ovalle & Martin Balla |
14 January 2020 | The Journey is the Reward: Unsupervised Learning of Influential Trajectories - Binas et al. (2019) | Alvaro Ovalle & Martin Balla |
8 January 2020 | Empowerment – an introduction - Salge et al. (2013) | Alvaro Ovalle & Martin Balla |
21 November 2019 | QXplore: Q-learning Exploration by Maximizing Temporal Difference Error - Simmons-Edler et al. (2019) | Ercüment İlhan |
7 November 2019 | Grandmaster level in StarCraft II using multi-agent reinforcement learning - Vinyals et al. (2019) | Martin Balla |
31 October 2019 | Unsupervised Video Object Segmentation for Deep Reinforcement Learning - Goel et al. (2018) | Alvaro Ovalle |
24 October 2019 | Solving Rubik’s Cube with a Robot Hand - OpenAI et al. (2019) | Chris Bamford |
17 October 2019 | Diversity is all you need - Eysenbach et al. (2018) | Martin Balla |
09 October 2019 | Deep Exploration via Bootstrapped DQN - Osband et al. (2016) | Ercüment İlhan |
28 August 2019 | Learning Latent Dynamics for Planning from Pixels - Hafner et al. (2018) | Alvaro Ovalle |
16 August 2019 | Shaping belief states with generative environment models for RL - Gregor et al. (2019) | Chris Bamford |
4 July 2019 | Learning to act by predicting the future - Dosovitskiy & Koltun (2016) | Martin Balla |
27 June 2019 | Imitating latent policies from observation - Edwards et al. (2018) | Ercüment İlhan |
1 May 2019 | World Discovery Models - Azar et al. (2019) | Alvaro Ovalle |
17 April 2019 | Provably efficient RL with Rich Observations via Latent State Decoding - Du et al. (2019) | Chris Bamford |
10 April 2019 | Distilling Policy Distillation - Czarnecki et al. (2019) | Martin Balla |
3 April 2019 | Deep Reinforcement Learning that Matter - Henderson et al. (2017) | Ercüment İlhan |
21 March 2019 | Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play - Sukhbaatar et al. (2017) | Alvaro Ovalle |
6 March 2019 | Deep reinforcement learning with relational inductive biases - Zambaldi et al. (2018) | Martin Balla |
28 Feb 2019 | Show, Attend and Tell: Neural Image Caption Generation with Visual Attention - Xu et al. (2015) | Chris Bamford |
13 Feb 2019 | Universal Value Function Approximators - Schaul et al. (2015) | Ercüment İlhan |
6 Feb 2019 | Temporal Difference Variational Auto-Encoder - Gregor et al. (2018) | Alvaro Ovalle & Chris Bamford |
6 Feb 2019 | An investigation of model-free planning - Guez et al. (2019) | Alvaro Ovalle |
30 Jan 2019 | Hindsight Experience Replay - Andrychowicz et al. (2017) | Alvaro Ovalle |
24 Jan 2019 | Strategic Attentive Writer for Learning Macro-Actions - Vezhnevets et al. (2016) | Martin Balla |
16 Jan 2019 | Imagination-Augmented Agents for Deep Reinforcement Learning - Weber et al. (2017) | Chris Bamford |
12 Dec 2018 | Machine Theory of Mind - Rabinowitz et al. (2018) | Ercüment İlhan |
15 Nov 2018 | Exploration by Random Network Distillation - Burda et al. (2018) | Alvaro Ovalle |
The end is nigh