Exact neural networks from inexact multipliers via Fibonacci weight encoding

Simon, William Andrew; Ray, Valérian; Levisse, Alexandre; Ansaloni, Giovanni; Zapater, Marina; Atienza, David

doi:10.1109/DAC18074.2021.9586245

Simon, William Andrew; Ray, Valérian; Levisse, Alexandre; Ansaloni, Giovanni; Zapater, Marina; Atienza, David

2021

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Edge devices must support computationally demanding algorithms, such as neural networks, within tight area/energy budgets. While approximate computing may alleviate these constraints, limiting induced errors remains an open challenge. In this paper, we propose a hardware/software co-design solution via an inexact multiplier, reducing area/power-delay-product requirements by 73/43%, respectively, while still computing exact results when one input is a Fibonacci encoded value. We introduce a retraining strategy to quantize neural network weights to Fibonacci encoded values, ensuring exact computation during inference. We benchmark our strategy on Squeezenet 1.0, DenseNet-121, and ResNet-18, measuring accuracy degradations of only 0.4/1.1/1.7%.

Détails

Titre Exact neural networks from inexact multipliers via Fibonacci weight encoding

Auteur(s)/ trice(s) Simon, William Andrew (EPFL, Lausanne, Switzerland)
Ray, Valérian (School of Management and Engineering Vaud, HES-SO University of Applied Sciences Western Switzerland ; EPFL, Lausanne, Switzerland)
Levisse, Alexandre (EPFL, Lausanne, Switzerland)
Ansaloni, Giovanni (EPFL, Lausanne, Switzerland)
Zapater, Marina (School of Management and Engineering Vaud, HES-SO University of Applied Sciences Western Switzerland ; EPFL, Lausanne, Switzerland)
Atienza, David (EPFL, Lausanne, Switzerland)

Date 2021-12

Publié dans Proceedings of the 58th ACM/IEEE Design Automation Conference (DAC)

Editeur San Francisco, CA, USA, 5-9 December 2021

Pagination 6 p.

Présenté à 58th ACM/IEEE Design Automation Conference (DAC), San Francisco, USA, 2021-12-05, 2021-12-09

ISBN 978-1-6654-3274-0

DOI https://doi.org/10.1109/DAC18074.2021.9586245

Mots-clés (libres) degradation ; runtime ; limiting ; design automation ; costs ; neural networks ; benchmark testing ; quantization ; accelerators ; approximate computing

Type de papier published full paper

Domaine Ingénierie et Architecture

Ecole HEIG-VD

Institut ReDS - Reconfigurable & embedded Digital Systems

Le document apparaît dans Documents de conférences
Global

Résumé

Détails

Actions

PDF