Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Quantized artificial neural networks implemented with spintronic stochastic computing

View through CrossRef
Abstract An artificial neural network (ANN) inference involves matrix vector multiplications that require a very large number of multiply and accumulate operations, resulting in high energy cost and large device footprint. Stochastic computing (SC) offers a less resource-intensive ANN implementation with minimal accuracy loss. Random number generators (RNG) are required to implement SC in hardware. These can be realized through stochastic-magnetic tunnel junctions (s-MTJ), where the energy barrier to switch between the ‘up’ and ‘down’ states is designed to be small, enabling thermal noise to generate a random bit stream. While s-MTJs have previously been used to implement SC-ANNs, these studies have been limited to architectures with continuously varying (i.e., analog) weights. In this work, we study the use of SC for matrix vector multiplication with quantized synaptic weights and quantized outputs. We show that a quantized SC-ANN, implemented by using experimentally obtained s-MTJ bitstreams and a limited number of discrete quantized states for both weights and hidden layer nodes in an ANN, can effectively reduce time (latency) and energy consumption in SC compared to an analog implementation, while largely preserving accuracy. We implemented quantization with 5 and 11 quantized states, along with SC configured with stochastic bitstream lengths of 100, 200, 300, 400, and 500 on neural networks with one hidden layer and three hidden layers. Inference was performed on the MNIST dataset for both training with SC and without SC. Training with SC provided better accuracy for all cases. For the shortest bitstream of 100 bits, the highest accuracies were 92% for one hidden layer and over 96% for three hidden layers. The overall system attained its peak accuracy of 96.82% using a 400 bit stochastic bitstream with three hidden layers. Our investigations demonstrate 9× improvement in latency and 2.6× improvement in energy consumption using the quantized SC approach compared to a similar s-MTJ based ANN architecture without quantization.
Title: Quantized artificial neural networks implemented with spintronic stochastic computing
Description:
Abstract An artificial neural network (ANN) inference involves matrix vector multiplications that require a very large number of multiply and accumulate operations, resulting in high energy cost and large device footprint.
Stochastic computing (SC) offers a less resource-intensive ANN implementation with minimal accuracy loss.
Random number generators (RNG) are required to implement SC in hardware.
These can be realized through stochastic-magnetic tunnel junctions (s-MTJ), where the energy barrier to switch between the ‘up’ and ‘down’ states is designed to be small, enabling thermal noise to generate a random bit stream.
While s-MTJs have previously been used to implement SC-ANNs, these studies have been limited to architectures with continuously varying (i.
e.
, analog) weights.
In this work, we study the use of SC for matrix vector multiplication with quantized synaptic weights and quantized outputs.
We show that a quantized SC-ANN, implemented by using experimentally obtained s-MTJ bitstreams and a limited number of discrete quantized states for both weights and hidden layer nodes in an ANN, can effectively reduce time (latency) and energy consumption in SC compared to an analog implementation, while largely preserving accuracy.
We implemented quantization with 5 and 11 quantized states, along with SC configured with stochastic bitstream lengths of 100, 200, 300, 400, and 500 on neural networks with one hidden layer and three hidden layers.
Inference was performed on the MNIST dataset for both training with SC and without SC.
Training with SC provided better accuracy for all cases.
For the shortest bitstream of 100 bits, the highest accuracies were 92% for one hidden layer and over 96% for three hidden layers.
The overall system attained its peak accuracy of 96.
82% using a 400 bit stochastic bitstream with three hidden layers.
Our investigations demonstrate 9× improvement in latency and 2.
6× improvement in energy consumption using the quantized SC approach compared to a similar s-MTJ based ANN architecture without quantization.

Related Results

Spintronic Materials and Devices for Artificial Intelligence Applications
Spintronic Materials and Devices for Artificial Intelligence Applications
ABSTRACT The core of spintronic technologies lies in the use of electrons’ intrinsic spin states, along with their electric charge, for information storage and pr...
Fuzzy Chaotic Neural Networks
Fuzzy Chaotic Neural Networks
An understanding of the human brain’s local function has improved in recent years. But the cognition of human brain’s working process as a whole is still obscure. Both fuzzy logic ...
La luz: de herramienta a lenguaje. Una nueva metodología de iluminación artificial en el proyecto arquitectónico.
La luz: de herramienta a lenguaje. Una nueva metodología de iluminación artificial en el proyecto arquitectónico.
The constant development of artificial lighting throughout the twentieth century helped to develop architecture to the current situation in which a new methodology is needed for ...
On the role of network dynamics for information processing in artificial and biological neural networks
On the role of network dynamics for information processing in artificial and biological neural networks
Understanding how interactions in complex systems give rise to various collective behaviours has been of interest for researchers across a wide range of fields. However, despite ma...
ACM SIGCOMM computer communication review
ACM SIGCOMM computer communication review
At some point in the future, how far out we do not exactly know, wireless access to the Internet will outstrip all other forms of access bringing the freedom of mobility to the way...
Backpropagation artificial neural network learning algorithm process impact based on hyperparameters
Backpropagation artificial neural network learning algorithm process impact based on hyperparameters
<p>The problem of computing machine passing the maze is one of theoretical computer science key tasks. This task was partially considered by the classics of computer science,...
Artificial neural network for the recognition of human emotions under a backpropagation algorithm
Artificial neural network for the recognition of human emotions under a backpropagation algorithm
The era of the technological revolution increasingly encourages the development of technologies that facilitate in one way or another people's daily activities, thus generating a g...
Memorization capacity and robustness of neural networks
Memorization capacity and robustness of neural networks
Machine learning, and deep learning in particular, has recently undergone rapid advancements. To contribute to a rigorous understanding of deep learning, this thesis explores two d...

Back to Top