Skip to main content
QUICK REVIEW

[论文解读] Enabling Deep Spiking Neural Networks with Hybrid Conversion and Spike Timing Dependent Backpropagation

Nitin Rathi, Gopalakrishnan Srinivasan|arXiv (Cornell University)|May 4, 2020
Advanced Memory and Neural Computing参考文献 22被引用 164
一句话总结

该论文提出了一种混合训练方法:从人工神经网络 (ANN) 转换得到一个已转换的脉冲神经网络(SNN)并通过脉冲时序相关反向传播(STDB)进行微调,以在大幅减少时间步数和训练成本的同时实现类似的准确率。

ABSTRACT

Spiking Neural Networks (SNNs) operate with asynchronous discrete events (or spikes) which can potentially lead to higher energy-efficiency in neuromorphic hardware implementations. Many works have shown that an SNN for inference can be formed by copying the weights from a trained Artificial Neural Network (ANN) and setting the firing threshold for each layer as the maximum input received in that layer. These type of converted SNNs require a large number of time steps to achieve competitive accuracy which diminishes the energy savings. The number of time steps can be reduced by training SNNs with spike-based backpropagation from scratch, but that is computationally expensive and slow. To address these challenges, we present a computationally-efficient training technique for deep SNNs. We propose a hybrid training methodology: 1) take a converted SNN and use its weights and thresholds as an initialization step for spike-based backpropagation, and 2) perform incremental spike-timing dependent backpropagation (STDB) on this carefully initialized network to obtain an SNN that converges within few epochs and requires fewer time steps for input processing. STDB is performed with a novel surrogate gradient function defined using neuron's spike time. The proposed training methodology converges in less than 20 epochs of spike-based backpropagation for most standard image classification datasets, thereby greatly reducing the training complexity compared to training SNNs from scratch. We perform experiments on CIFAR-10, CIFAR-100, and ImageNet datasets for both VGG and ResNet architectures. We achieve top-1 accuracy of 65.19% for ImageNet dataset on SNN with 250 time steps, which is 10X faster compared to converted SNNs with similar accuracy.

研究动机与目标

  • Motivate and address the high latency of ANN-SNN conversion and the high cost of training SNNs from scratch.
  • Propose a hybrid training pipeline that uses converted SNNs as initialization for STDB-based fine-tuning.
  • Develop a novel surrogate gradient based on spike timing to enable efficient spike-based learning.
  • Demonstrate scalability to deep architectures (VGG, ResNet) on CIFAR and ImageNet with reduced time steps.

提出的方法

  • Use an ANN-SNN converted network as the initialization for the SNN.
  • Train the initialized SNN with spike timing dependent backpropagation (STDB) using a surrogate gradient based on neuron spike times.
  • Define a surrogate gradient where the derivative of the spike activation with respect to membrane potential decays with the time since last spike (Δt).
  • Employ a soft-reset LIF neuron model with discrete-time updates suitable for PyTorch implementation.
  • Apply threshold balancing during conversion and keep a shared threshold per layer while learning with STDB.
  • Evaluate on CIFAR-10, CIFAR-100 and ImageNet using VGG and ResNet-like architectures.

实验结果

研究问题

  • RQ1Can a converted SNN initialized from an ANN reduce the training time and latency when trained end-to-end with spike-based backpropagation?
  • RQ2Does spike timing dependent backpropagation with a spike-time surrogate gradient enable deep SNNs to achieve competitive accuracy with fewer time steps?
  • RQ3How does the hybrid approach compare to pure ANN-SNN conversion and pure spike-based training across standard datasets?
  • RQ4What are the effects of threshold handling and architectural choices (VGG, Residual nets) on performance for SNNs trained with STDB?

主要发现

  • Hybrid training yields 10×–25× fewer time steps while preserving accuracy comparable to purely converted SNNs.
  • Training converges in fewer than 20 epochs for common image datasets.
  • On ImageNet with 250 time steps, the hybrid approach achieved 65.19% top-1 accuracy for VGG16, approaching ANN performance with much lower latency.
  • Hybrid STDB training reduces spike activity by about 1.5× compared to purely converted SNNs at iso conditions, improving energy efficiency.
  • Across CIFAR-10/100 and ImageNet, VGG/ResNet variants show near-ISO accuracy relative to ANNs and converted SNNs with significantly reduced time steps.
  • For CIFAR-10, the VGG16-based hybrid model attains 91.13% at T=100, and 92.02% at T=200 in the reported comparisons.

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。