On the scale invariance in state of the art CNNs trained on ImageNet

Graziani, Mara; Lompech, Thomas; Müller, Henning; Depeursinge, Adrien; Andrearczyk, Vincent

doi:10.3390/make3020019

Graziani, Mara; Lompech, Thomas; Müller, Henning; Depeursinge, Adrien; Andrearczyk, Vincent

2021

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

The diffused practice of pre-training Convolutional Neural Networks (CNNs) on large natural image datasets such as ImageNet causes the automatic learning of invariance to object scale variations. This, however, can be detrimental in medical imaging, where pixel spacing has a known physical correspondence and size is crucial to the diagnosis, for example, the size of lesions, tumors or cell nuclei. In this paper, we use deep learning interpretability to identify at what intermediate layers such invariance is learned. We train and evaluate different regression models on the PASCAL-VOC (Pattern Analysis, Statistical modeling and ComputAtional Learning-Visual Object Classes) annotated data to (i) separate the effects of the closely related yet different notions of image size and object scale, (ii) quantify the presence of scale information in the CNN in terms of the layer-wise correlation between input scale and feature maps in InceptionV3 and ResNet50, and (iii) develop a pruning strategy that reduces the invariance to object scale of the learned features. Results indicate that scale information peaks at central CNN layers and drops close to the softmax, where the invariance is reached. Our pruning strategy uses this to obtain features that preserve scale information. We show that the pruning significantly improves the performance on medical tasks where scale is a relevant factor, for example for the regression of breast histology image magnification. These results show that the presence of scale information at intermediate layers legitimates transfer learning in applications that require scale covariance rather than invariance and that the performance on these tasks can be improved by pruning off the layers where the invariance is learned. All experiments are performed on publicly available data and the code is available on GitHub.

Détails

Titre

On the scale invariance in state of the art CNNs trained on ImageNet

Auteur(s)/ trice(s)

Graziani, Mara (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis) ; University of Geneva, Switzerland)
Lompech, Thomas (Institute National Polytechnique de Toulouse, Ecole Nationale Supérieure d’Electrotechnique, d’Electronique, d’Informatique, d’Hydraulique et des Télécommunications (INP-ENSEEIHT), Toulouse, France)
Müller, Henning (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis) ; University of Geneva, Switzerland)
Depeursinge, Adrien (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis) ; Nuclear Medicine and Molecular Imaging Department, Centre Hospitalier Universitaire Vaudois (CHUV), Lausanne, Switzerland)
Andrearczyk, Vincent (University of Applied Sciences and Arts Western Switzerland (HES-SO Valais-Wallis))

Date

2021-06

Publié dans

Machine learning and knowledge extraction

Volume

2021, vol. 3, no. 2, pp. 374-391

Pagination

18 p.

DOI

https://doi.org/10.3390/make3020019

ISSN

2504-4990

Mots-clés (libres)

scale invariance ; deep learning ; interpretability ; medical imaging

Type d'article

scientifique

Domaine

Economie et Services

Ecole

HEG-VS

Institut

Institut Informatique de gestion

Le document apparaît dans

Articles scientifiques
Global

Résumé

Détails

Actions

PDF