ALPINE : analog in-memory acceleration with tight processor integration for deep learning

Klein, Joshua; Boybat, Irem; Qureshi, Yasir; Dazzi, Martino; Levisse, Alexandre; Ansaloni, Giovanni; Zapater, Marina; Sebastian, Abu; Atienza, David

doi:10.1109/TC.2022.3230285

Klein, Joshua; Boybat, Irem; Qureshi, Yasir; Dazzi, Martino; Levisse, Alexandre; Ansaloni, Giovanni; Zapater, Marina; Sebastian, Abu; Atienza, David

2022

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Résumé

Analog in-memory computing (AIMC) cores offers significant performance and energy benefits for neural network inference with respect to digital logic (e.g., CPUs). AIMCs accelerate matrix-vector multiplications, which dominate these applications' run-time. However, AIMC-centric platforms lack the flexibility of general-purpose systems, as they often have hard-coded data flows and can only support a limited set of processing functions. With the goal of bridging this gap in flexibility, we present a novel system architecture that tightly integrates analog in-memory computing accelerators into multi-core CPUs in general-purpose systems. We developed a powerful gem5-based full system-level simulation framework into the gem5-X simulator, ALPINE, which enables an in-depth characterization of the proposed architecture. ALPINE allows the simulation of the entire computer architecture stack from major hardware components to their interactions with the Linux OS. Within ALPINE, we have defined a custom ISA extension and a software library to facilitate the deployment of inference models. We showcase and analyze a variety of mappings of different neural network types, and demonstrate up to 20.5x/20.8x performance/energy gains with respect to a SIMD-enabled ARM CPU implementation for convolutional neural networks, multi-layer perceptrons, and recurrent neural networks.

Détails

Titre ALPINE : analog in-memory acceleration with tight processor integration for deep learning

Auteur(s)/ trice(s) Klein, Joshua (EPFL, Lausanne, Switzerland)
Boybat, Irem (IBM Research Europe, Ruschlikon, Switzerland)
Qureshi, Yasir (EPFL, Lausanne, Switzerland)
Dazzi, Martino (IBM Research Europe, Ruschlikon, Switzerland)
Levisse, Alexandre (EPFL, Lausanne, Switzerland)
Ansaloni, Giovanni (EPFL, Lausanne, Switzerland)
Zapater, Marina (School of Engineering and Management Vaud, HES-SO, University of Applied Sciences and Arts Western Switzerland)
Sebastian, Abu (IBM Research Europe, Ruschlikon, Switzerland)
Atienza, David (EPFL, Lausanne, Switzerland)

Date 2022-12

Publié dans IEEE Transactions on Computers

Volume 2023, vol. 72, no. 7, pp. 1985 - 1998

Pagination 14 p.

DOI https://doi.org/10.1109/TC.2022.3230285

ISSN 0018-9340

Mots-clés (libres) hardware ; computational modeling ; computer architecture ; biological system modeling ; in-memory computing ; reduced instruction set computing ; recurrent neural networks

Type d'article scientifique

Domaine Ingénierie et Architecture

Ecole HEIG-VD

Institut ReDS - Reconfigurable & embedded Digital Systems

Le document apparaît dans Articles scientifiques
Global

Résumé

Détails

Actions

PDF