Multi-agent reinforcement learning for hyperparameter optimization of convolutional neural networks

Iranfar, Arman; Zapater, Marina; Atienza, David

doi:10.1109/TCAD.2021.3077193

Iranfar, Arman; Zapater, Marina; Atienza, David

2021

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Nowadays, Deep Convolutional Neural Networks (DCNNs) play a significant role in many application domains, such as, computer vision, medical imaging, and image processing. Nonetheless, designing a DCNN, able to defeat the state of the art, is a manual, challenging, and time-consuming task, due to the extremely large design space, as a consequence of a large number of layers and their corresponding hyperparameters. In this work, we address the challenge of performing hyperparameter optimization of DCNNs through a novel Multi-Agent Reinforcement Learning (MARL)-based approach, eliminating the human effort. In particular, we adapt Q-learning and define learning agents per layer to split the design space into independent smaller design sub-spaces such that each agent fine-tunes the hyperparameters of the assigned layer concerning a global reward. Moreover, we provide a novel formation of Q-tables along with a new update rule that facilitates agents’ communication. Our MARL-based approach is data-driven and able to consider an arbitrary set of design objectives and constraints. We apply our MARL-based solution to different well-known DCNNs, including GoogLeNet, VGG, and U-Net, and various datasets for image classification and semantic segmentation. Our results have shown that, compared to the original CNNs, the MARL-based approach can reduce the model size, training time, and inference time by up to, respectively, 83x, 52%, and 54% without any degradation in accuracy. Moreover, our approach is very competitive to state-of-the-art neural architecture search methods in terms of the designed CNN accuracy and its number of parameters while significantly reducing the optimization cost.

Détails

Titre Multi-agent reinforcement learning for hyperparameter optimization of convolutional neural networks

Auteur(s)/ trice(s) Iranfar, Arman (EPFL, Lausanne, Switzerland)
Zapater, Marina (School of Engineering and Management Vaud, HES-SO, University of Applied Sciences and Arts Western Switzerland)
Atienza, David (EPFL, Lausanne, Switzerland)

Date 2021-05

Publié dans IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Volume 2022, vol. 41, no. 4, pp. 1034-1047

Pagination 14 p.

DOI https://doi.org/10.1109/TCAD.2021.3077193

ISSN 0278-0070

Mots-clés (libres) optimization ; computer architecture ; training ; search problems ; kernel ; convolution ; reinforcement learning ; convolutional neural network ; hyperparameter optimization ; neural architecture search

Type d'article scientifique

Domaine Ingénierie et Architecture

Ecole HEIG-VD

Institut ReDS - Reconfigurable & embedded Digital Systems

Le document apparaît dans Articles scientifiques
Global

Résumé

Détails

Actions

PDF