Reinforcement Learning at QMUL

Research Interest

Learning, inference and behaviour in the perception-action loop with a specific interest in model-based RL, sparse rewards problems and action advising.

Reading Group

Welcome to the Reinforcement Learning reading group at the Queen Mary University of London.

Date Paper Presenter
4 February 2020 Empowerment-driven Exploration using Mutual Information Estimation - Kumar (2018) Alvaro Ovalle & Martin Balla
28 January 2020 MINE: Mutual Information Neural Estimation - Belghazi et al. (2018) Alvaro Ovalle & Martin Balla
14 January 2020 The Journey is the Reward: Unsupervised Learning of Influential Trajectories - Binas et al. (2019) Alvaro Ovalle & Martin Balla
8 January 2020 Empowerment – an introduction - Salge et al. (2013) Alvaro Ovalle & Martin Balla
21 November 2019 QXplore: Q-learning Exploration by Maximizing Temporal Difference Error - Simmons-Edler et al. (2019) Ercüment İlhan
7 November 2019 Grandmaster level in StarCraft II using multi-agent reinforcement learning - Vinyals et al. (2019) Martin Balla
31 October 2019 Unsupervised Video Object Segmentation for Deep Reinforcement Learning - Goel et al. (2018) Alvaro Ovalle
24 October 2019 Solving Rubik’s Cube with a Robot Hand - OpenAI et al. (2019) Chris Bamford
17 October 2019 Diversity is all you need - Eysenbach et al. (2018) Martin Balla
09 October 2019 Deep Exploration via Bootstrapped DQN - Osband et al. (2016) Ercüment İlhan
28 August 2019 Learning Latent Dynamics for Planning from Pixels - Hafner et al. (2018) Alvaro Ovalle
16 August 2019 Shaping belief states with generative environment models for RL - Gregor et al. (2019) Chris Bamford
4 July 2019 Learning to act by predicting the future - Dosovitskiy & Koltun (2016) Martin Balla
27 June 2019 Imitating latent policies from observation - Edwards et al. (2018) Ercüment İlhan
1 May 2019 World Discovery Models - Azar et al. (2019) Alvaro Ovalle
17 April 2019 Provably efficient RL with Rich Observations via Latent State Decoding - Du et al. (2019) Chris Bamford
10 April 2019 Distilling Policy Distillation - Czarnecki et al. (2019) Martin Balla
3 April 2019 Deep Reinforcement Learning that Matter - Henderson et al. (2017) Ercüment İlhan
21 March 2019 Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play - Sukhbaatar et al. (2017) Alvaro Ovalle
6 March 2019 Deep reinforcement learning with relational inductive biases - Zambaldi et al. (2018) Martin Balla
28 Feb 2019 Show, Attend and Tell: Neural Image Caption Generation with Visual Attention - Xu et al. (2015) Chris Bamford
13 Feb 2019 Universal Value Function Approximators - Schaul et al. (2015) Ercüment İlhan
6 Feb 2019 Temporal Difference Variational Auto-Encoder - Gregor et al. (2018) Alvaro Ovalle & Chris Bamford
6 Feb 2019 An investigation of model-free planning - Guez et al. (2019) Alvaro Ovalle
30 Jan 2019 Hindsight Experience Replay - Andrychowicz et al. (2017) Alvaro Ovalle
24 Jan 2019 Strategic Attentive Writer for Learning Macro-Actions - Vezhnevets et al. (2016) Martin Balla
16 Jan 2019 Imagination-Augmented Agents for Deep Reinforcement Learning - Weber et al. (2017) Chris Bamford
12 Dec 2018 Machine Theory of Mind - Rabinowitz et al. (2018) Ercüment İlhan
15 Nov 2018 Exploration by Random Network Distillation - Burda et al. (2018) Alvaro Ovalle

The end is nigh