Completed STUDENTSHIP UKRI Gateway to Research

Synthetic Environment Design for Reinforcement Learning

Funder	Engineering and Physical Sciences Research Council
Recipient Organization	University of Oxford
Country	United Kingdom
Start Date	Sep 30, 2021
End Date	Sep 29, 2025
Duration	1,460 days
Number of Grantees	2
Roles	Student; Supervisor
Data Source	UKRI Gateway to Research
Grant ID	`2579054`

Grant Description

Brief description of the context of the research including the potential impact

Reinforcement learning (RL) has made significant strides over the last decade, achieving superhuman performance on a range of RL tasks. However, its success has largely been limited to simulated environments where it is possible to generate task experience indefinitely, an unrealistic assumption for real-world tasks with limited available experience and poor simulation quality.

One approach to improving performance in this domain is to learn a simulation of the environment, before training an agent on synthetic data generated from this simulation. Doing so would enable RL agents to be trained on significantly less experience and without a manually-programmed simulation, transferring these advancements to real-world tasks.

Aims and Objectives

We aim to develop various methods for generating synthetic task experience and training RL agents on synthetically generated data. These methods should allow for improvements in sample efficiency on a range of existing RL benchmarks and may enable novel applications of RL in new domains. Novelty of the research methodology

Training RL agents on synthetic experience is a little-studied topic, however, some recent works have demonstrated success from it. We are proposing a novel approach to generating synthetic experience, in which we will use insights from Unsupervised Environment Design to tailor the experience to the current capabilities of the agent, thereby improving training efficiency and asymptotic performance.

Alignment to EPSRC's strategies and research areas

This project falls under the EPSRC's Artificial Intelligence Technologies research area, within the Engineering and ICT themes. Specifically, this work will enable the deployment of RL-based systems to real-world applications such as robotics, by allowing agents to train on minimal task experience and without any need for manually-programmed simulation.

All Grantees

University of Oxford

Interested in applying for this grant?

Complete our application form to express your interest and we'll guide you through the process.

Apply for This Grant

Synthetic Environment Design for Reinforcement Learning

Grant Description

All Grantees

Interested in applying for this grant?

Quick Summary

Related Grants