Sample-efficient imitation learning via generative adversarial nets

Blondé, Lionel; Kalousis, Alexandros

2019

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

GAIL is a recent successful imitation learning architecture that exploits the adversarial training procedure introduced in GANs. Albeit successful at generating behaviours similar to those demonstrated to the agent, GAIL suffers from a high sample complexity in the number of interactions it has to carry out in the environment in order to achieve satisfactory performance. We dramatically shrink the amount of interactions with the environment necessary to learn well-behaved imitation policies, by up to several orders of magnitude. Our framework, operating in the model-free regime, exhibits a significant increase in sample-efficiency over previous methods by simultaneously a) learning a self-tuned adversarially-trained surrogate reward and b) leveraging an off-policy actor-critic architecture. We show that our approach is simple to implement and that the learned agents remain remarkably stable, as shown in our experiments that span a variety of continuous control tasks. Video visualisations available at: \url{https://youtu.be/-nCsqUJnRKU}.

Détails

Titre Sample-efficient imitation learning via generative adversarial nets

Auteur(s)/ trice(s) Blondé, Lionel (University of Geneva, Switzerland)
Kalousis, Alexandros (Haute école de gestion de Genève, HES-SO Haute Ecole Spécialisée de Suisse Occidentale)

Date 2019-04

Publié dans Proceedings of Machine Learning Research

Volume 2019, vol. 89, pp. 3138-3148

Editeur Okinawa, Japan, 16-18 April 2019

Pagination 11 p.

Présenté à The 22nd International Conference on Artificial Intelligence and Statistics, Okinawa, Japan, 2019-04-16, 2019-04-18

ISSN 2640-3498

Type de papier full paper

Domaine Economie et Services

Ecole HEG - Genève

Institut CRAG - Centre de Recherche Appliquée en Gestion

Le document apparaît dans Documents de conférences
Global

Ressource(s) externe(s) Online version

Résumé

Détails

Actions

PDF