Unsupervised Adaptive Weight Pruning for Energy-Efficient Neuromorphic Systems

doi:10.3389/fnins.2020.598876

. 2020 Nov 12:14:598876.

doi: 10.3389/fnins.2020.598876. eCollection 2020.

Unsupervised Adaptive Weight Pruning for Energy-Efficient Neuromorphic Systems

Wenzhe Guo^{1

2}, Mohammed E Fouda³, Hasan Erdem Yantir^{1

2}, Ahmed M Eltawil^{2

3}, Khaled Nabil Salama¹

Affiliations

¹ Sensors Lab, Advanced Membranes & Porous Materials Center, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
² Communication and Computing Systems Lab, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
³ Department of Electrical Engineering and Computer Science, University of California, Irvine, Irvine, CA, United States.

PMID: 33281549
PMCID: PMC7689062
DOI: 10.3389/fnins.2020.598876

Unsupervised Adaptive Weight Pruning for Energy-Efficient Neuromorphic Systems

Wenzhe Guo et al. Front Neurosci. 2020.

. 2020 Nov 12:14:598876.

doi: 10.3389/fnins.2020.598876. eCollection 2020.

Authors

Wenzhe Guo^{1

2}, Mohammed E Fouda³, Hasan Erdem Yantir^{1

2}, Ahmed M Eltawil^{2

3}, Khaled Nabil Salama¹

Affiliations

¹ Sensors Lab, Advanced Membranes & Porous Materials Center, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
² Communication and Computing Systems Lab, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
³ Department of Electrical Engineering and Computer Science, University of California, Irvine, Irvine, CA, United States.

PMID: 33281549
PMCID: PMC7689062
DOI: 10.3389/fnins.2020.598876

Abstract

To tackle real-world challenges, deep and complex neural networks are generally used with a massive number of parameters, which require large memory size, extensive computational operations, and high energy consumption in neuromorphic hardware systems. In this work, we propose an unsupervised online adaptive weight pruning method that dynamically removes non-critical weights from a spiking neural network (SNN) to reduce network complexity and improve energy efficiency. The adaptive pruning method explores neural dynamics and firing activity of SNNs and adapts the pruning threshold over time and neurons during training. The proposed adaptation scheme allows the network to effectively identify critical weights associated with each neuron by changing the pruning threshold dynamically over time and neurons. It balances the connection strength of neurons with the previous layer with adaptive thresholds and prevents weak neurons from failure after pruning. We also evaluated improvement in the energy efficiency of SNNs with our method by computing synaptic operations (SOPs). Simulation results and detailed analyses have revealed that applying adaptation in the pruning threshold can significantly improve network performance and reduce the number of SOPs. The pruned SNN with 800 excitatory neurons can achieve a 30% reduction in SOPs during training and a 55% reduction during inference, with only 0.44% accuracy loss on MNIST dataset. Compared with a previously reported online soft pruning method, the proposed adaptive pruning method shows 3.33% higher classification accuracy and 67% more reduction in SOPs. The effectiveness of our method was confirmed on different datasets and for different network sizes. Our evaluation showed that the implementation overhead of the adaptive method regarding speed, area, and energy is negligible in the network. Therefore, this work offers a promising solution for effective network compression and building highly energy-efficient neuromorphic systems in real-time applications.

Keywords: STDP; neuromorphic computing; pattern recognition; pruning; spiking neural networks; unsupervised learning.

PubMed Disclaimer

Figures

**FIGURE 1**
Overview of the proposed adaptive pruning process in SNNs. The SNN architecture consists of two layers, an input layer and a winner-take-all (WTA) layer with excitatory and inhibitory neurons connected to each other. Pruning only happens in the synapses from the input layer to the WTA layer. w_th is the weight pruning threshold.

**FIGURE 2**
Network dynamics were monitored every 5,000 training images in the SNN with 100 excitatory neurons during training and without pruning. **(A)** Firing activity. The average of spike counts of 100 excitatory neurons was calculated. **(B)** Statistics of weight updates Δw with the mean (black) and variance (red).

**FIGURE 3**
**(A)** The illustration of the online adaptive pruning scheme over time. $w_{t h}^{0}$ is the initial pruning threshold and w_th(t) is the pruning threshold at time t. **(B)** The evolution of the pruning threshold over time with $w_{t h}^{0}$ set as 0.036.

**FIGURE 4**
Simulation results of the online adaptive pruning over time for different adaptation functions in the SNN trained on MNIST dataset with 100 excitatory neurons. **(A,C,E)** show the network connectivity changes with the initial threshold for the adaptation functions f₁, f₂, and f₃, respectively. **(B,D,F)** present the accuracy changes with the network connectivity for the adaptation functions f₁, f₂, and f₃, respectively. The connectivity is defined as the percentage of the unpruned weights in the total weights.

**FIGURE 5**
The illustration of the online adaptive pruning scheme over neurons. $w_{t h}^{n}$ is the pruning threshold for the n-th group (Gn). N_n is the number of neurons in the n-th group. SI = 50 is used as an example for demonstrating the grouping method based on an example spike count distribution.

**FIGURE 6**
Simulation results of the online adaptive pruning over neurons for different adaptation functions in the SNN trained on MNIST dataset with 100 excitatory neurons. **(A,C,E)** show the network connectivity changes with the initial threshold for the adaptation functions f₁, f₂, and f₃, respectively. **(B,D,F)** present the accuracy changes with the connectivity for the adaptation functions f₁, f₂, and f₃, repectively. The connectivity is defined as the percentage of the unpruned weights. The spike interval is set as 30.

**FIGURE 7**
Performance comparison among different adaptation functions in the SNN trained on MNIST dataset with 100 neurons. **(A)** APT: Online adaptive pruning over time, and **(B)** APN: Online adaptive pruning over neurons. The spike count interval is set as 30.

**FIGURE 8**
Simulation results of online adaptive pruning over neurons for different spike count intervals (SI) in the SNN trained on MNIST dataset with 100 neurons. **(A)** Connectivity vs. initial threshold, and **(B)** Accuracy vs. connectivity. SI = Infinity (Inf) means that there is only one group and hence no adaptation over neurons.

**FIGURE 9**
Comparison among different weight pruning methods in the SNN trained on different datasets. MNIST dataset: **(A)** 100 excitatory neurons and **(B)** 800 excitatory neurons. Fashion-MNIST dataset: **(C)** 100 excitatory neurons and **(D)** 800 excitatory neurons.

**FIGURE 10**
Trained weight maps on **(A)** MNIST dataset and **(B)** Fashion-MNIST dataset in the SNN with 100 neurons without pruning. Each pattern in the maps is formed by arranging the weights associated with each neuron to a 28 × 28 matrix.

**FIGURE 11**
MNIST accuracy results at different connectivity values in the SNN with 100 neurons after applying APTN method. The number of pre-pruning training images was changed from 1,000, 10,000, 20,000, 30,000, 50,000, to 60,000.

**FIGURE 12**
**(A)** Normalized SOPs/image and **(B)** a figure of merit (FOM) for different connectivity values are obtained in the SNNs with 100 and 800 excitatory neurons using the online adaptive pruning over time and neuron method.

**FIGURE 13**
Comparison between online weight pruning methods and an online adaptive neuron pruning method in the SNN trained on MNIST dataset with 100 excitatory neurons. **(A)** Accuracy and **(B)** normalized SOPs/image change with connectivity. CWP, AWP, and ANP are short for constant weight pruning, adaptive weight pruning, and adaptive neuron pruning, respectively.

**FIGURE 14**
Comparison with the online soft weight pruning method adopted from Shi et al. (2019) in the SNN trained on MNIST dataset with 100 excitatory neurons. **(A)** Accuracy and **(B)** normalized SOPs/image change with unpruned weights percentage. Since the soft pruning method does not remove the pruned weights, the connectivity is not applicable as the x axis here. Instead, the unpruned percentage is used, which is defined as the percentage of the unpruned weights in the total weights before pruning.

**FIGURE 15**
Simulation runtime. **(A)** Total network simulation runtime during training at different network connectivity values after applying the proposed adaptive pruning method APTN. **(B)** Pruning algorithm runtime percentage over the total network simulation time at different network connectivity values. Different batch sizes were used as 100, 1,000, and 5,000.

See this image and copyright information in PMC

Cited by

A flexible capacitive photoreceptor for the biomimetic retina.
Vijjapu MT, Fouda ME, Agambayev A, Kang CH, Lin CH, Ooi BS, He JH, Eltawil AM, Salama KN. Vijjapu MT, et al. Light Sci Appl. 2022 Jan 1;11(1):3. doi: 10.1038/s41377-021-00686-4. Light Sci Appl. 2022. PMID: 34974516 Free PMC article.
Sharing leaky-integrate-and-fire neurons for memory-efficient spiking neural networks.
Kim Y, Li Y, Moitra A, Yin R, Panda P. Kim Y, et al. Front Neurosci. 2023 Jul 31;17:1230002. doi: 10.3389/fnins.2023.1230002. eCollection 2023. Front Neurosci. 2023. PMID: 37583415 Free PMC article.
Neural Coding in Spiking Neural Networks: A Comparative Study for Robust Neuromorphic Systems.
Guo W, Fouda ME, Eltawil AM, Salama KN. Guo W, et al. Front Neurosci. 2021 Mar 4;15:638474. doi: 10.3389/fnins.2021.638474. eCollection 2021. Front Neurosci. 2021. PMID: 33746705 Free PMC article.
Real-time execution of SNN models with synaptic plasticity for handwritten digit recognition on SIMD hardware.
Vallejo-Mancero B, Madrenas J, Zapata M. Vallejo-Mancero B, et al. Front Neurosci. 2024 Aug 6;18:1425861. doi: 10.3389/fnins.2024.1425861. eCollection 2024. Front Neurosci. 2024. PMID: 39165339 Free PMC article.
Backpropagation With Sparsity Regularization for Spiking Neural Network Learning.
Yan Y, Chu H, Jin Y, Huan Y, Zou Z, Zheng L. Yan Y, et al. Front Neurosci. 2022 Apr 14;16:760298. doi: 10.3389/fnins.2022.760298. eCollection 2022. Front Neurosci. 2022. PMID: 35495028 Free PMC article.

See all "Cited by" articles

References

1. Anwar S., Hwang K., Sung W. (2017). Structured pruning of deep convolutional neural networks. J. Emerg. Technol. Comput. Syst. 13:32 10.1145/3005348 - DOI
1. Azarian K., Bhalgat Y., Lee J., Blankevoort T. (2020). Learned threshold pruning. ArXiv [Preprint]. Available online at: https://arxiv.org/pdf/2003.00075.pdf (accessed October 4, 2020).
1. Burkitt A. N. (2006). A review of the integrate-and-fire neuron model: i. homogeneous synaptic input. Biol. Cybernet. 95 1–19. 10.1007/s00422-006-0068-6 - DOI - PubMed
1. Davies M., Srinivasa N., Lin T., Chinya G., Cao Y., Choday S. H., et al. (2018). Loihi: a neuromorphic manycore processor with on-chip learning. IEEE Micro 38 82–99. 10.1109/MM.2018.112130359 - DOI
1. Diehl P., Cook M. (2015). Unsupervised learning of digit recognition using spike-timing-dependent plasticity. Front. Comput. Neurosci. 9:99. 10.3389/fncom.2015.00099 - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources

[1] Anwar S., Hwang K., Sung W. (2017). Structured pruning of deep convolutional neural networks. J. Emerg. Technol. Comput. Syst. 13:32 10.1145/3005348 - DOI

[2] Anwar S., Hwang K., Sung W. (2017). Structured pruning of deep convolutional neural networks. J. Emerg. Technol. Comput. Syst. 13:32 10.1145/3005348 - DOI

[3] Azarian K., Bhalgat Y., Lee J., Blankevoort T. (2020). Learned threshold pruning. ArXiv [Preprint]. Available online at: https://arxiv.org/pdf/2003.00075.pdf (accessed October 4, 2020).

[4] Azarian K., Bhalgat Y., Lee J., Blankevoort T. (2020). Learned threshold pruning. ArXiv [Preprint]. Available online at: https://arxiv.org/pdf/2003.00075.pdf (accessed October 4, 2020).

[5] Burkitt A. N. (2006). A review of the integrate-and-fire neuron model: i. homogeneous synaptic input. Biol. Cybernet. 95 1–19. 10.1007/s00422-006-0068-6 - DOI - PubMed

[6] Burkitt A. N. (2006). A review of the integrate-and-fire neuron model: i. homogeneous synaptic input. Biol. Cybernet. 95 1–19. 10.1007/s00422-006-0068-6 - DOI - PubMed

[7] Davies M., Srinivasa N., Lin T., Chinya G., Cao Y., Choday S. H., et al. (2018). Loihi: a neuromorphic manycore processor with on-chip learning. IEEE Micro 38 82–99. 10.1109/MM.2018.112130359 - DOI

[8] Davies M., Srinivasa N., Lin T., Chinya G., Cao Y., Choday S. H., et al. (2018). Loihi: a neuromorphic manycore processor with on-chip learning. IEEE Micro 38 82–99. 10.1109/MM.2018.112130359 - DOI

[9] Diehl P., Cook M. (2015). Unsupervised learning of digit recognition using spike-timing-dependent plasticity. Front. Comput. Neurosci. 9:99. 10.3389/fncom.2015.00099 - DOI - PMC - PubMed

[10] Diehl P., Cook M. (2015). Unsupervised learning of digit recognition using spike-timing-dependent plasticity. Front. Comput. Neurosci. 9:99. 10.3389/fncom.2015.00099 - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Unsupervised Adaptive Weight Pruning for Energy-Efficient Neuromorphic Systems

Affiliations

Unsupervised Adaptive Weight Pruning for Energy-Efficient Neuromorphic Systems

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources