On the interaction of regularization factors in low-resource neural machine translation

Atrio, Alex R.; Popescu-Belis, Andrei

2022

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

We explore the roles and interactions of the hyper-parameters governing regularization, and propose a range of values applicable to low-resource neural machine translation. We demonstrate that default or recommended values for high-resource settings are not optimal for low-resource ones, and that more aggressive regularization is needed when resources are scarce, in proportion to their scarcity. We explain our observations by the generalization abilities of sharp vs. flat basins in the loss landscape of a neural network. Results for four regularization factors corroborate our claim: batch size, learning rate, dropout rate, and gradient clipping. Moreover, we show that optimal results are obtained when using several of these factors, and that our findings generalize across datasets of different sizes and languages.

Détails

Titre On the interaction of regularization factors in low-resource neural machine translation

Auteur(s)/ trice(s) Atrio, Alex R. (School of Management and Engineering Vaud, HES-SO University of Applied Sciences Western Switzerland ; EPFL, Lausanne, Switzerland)
Popescu-Belis, Andrei (School of Management and Engineering Vaud, HES-SO University of Applied Sciences Western Switzerland ; EPFL, Lausanne, Switzerland)

Date 2022-06

Publié dans Proceedings of the 23rd Annual Conference of the European Association for Machine Translation

Publié par Ghent, Belgium, 1-3 June 2022

Pagination 10 p.

Présenté à 23rd Annual Conference of the European Association for Machine Translation, Ghent, Belgium, 2022-06-01, 2022-06-03

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIG-VD

Institut IICT - Institut des Technologies de l'Information et de la Communication

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions

PDF