Mixed-precision architecture based on computational memory for training deep neural networks

Irem Boybat Kara, Evangelos Eleftheriou, Abu Sebastian
2018
Article de conférence

Résumé

Deep neural networks (DNN) have revolutionized the field of machine learning by providing unprecedented human-like performance in solving many real-world problems such as image or speech recognition. Training of large DNNs, however, is a computationally intensive task, and this necessitates the development of novel computing architectures targeting this application. A computational memory unit where resistive memory devices are organized in crossbar arrays can be used to store the synaptic weights in their conductance states. The expensive multiply accumulate operations can be performed in place using Kirchhoff's circuit laws in a non-von Neumann manner. However, a key challenge remains the inability to alter the conductance states of the devices in a reliable manner during the weight update process. We propose a mixed-precision architecture that combines a computational memory unit storing the synaptic weights with a digital processing unit and an additional memory unit that stores the accumulated weight updates in high precision. The new architecture delivers classification accuracies comparable to those of floating-point implementations without being constrained by challenges associated with the non-ideal weight update characteristics of emerging resistive memories. The computational memory unit in a two layer neural network realized using non-linear stochastic models of phase-change memory achieves a test accuracy of 97.40% in the MNIST digit classification problem.

Source officielle

https://infoscience.epfl.ch/record/261938?ln=fr

À propos de ce résultat

Cette page est générée automatiquement et peut contenir des informations qui ne sont pas correctes, complètes, à jour ou pertinentes par rapport à votre recherche. Il en va de même pour toutes les autres pages de ce site. Veillez à vérifier les informations auprès des sources officielles de l'EPFL.

Mixed-precision architecture based on computational memory for training deep neural networks

Graph Chatbot

Chattez avec Graph Search

2D Nanosystems: Applications of 2D Semiconductors for In-Memory Computing

Deep Learning Generalization with Limited and Noisy Labels

Hardware-Software Co-design for Improved Resource Utilization in DNN Accelerators

Deep Learning Generalization with Limited and Noisy Labels

2D Nanosystems: Applications of 2D Semiconductors for In-Memory Computing

Hardware-Software Co-design for Improved Resource Utilization in DNN Accelerators