QUICK REVIEW

[论文解读] Surrogate Gradient Learning in Spiking Neural Networks

Emre Neftci, Hesham Mostafa|arXiv (Cornell University)|Jan 28, 2019

Advanced Memory and Neural Computing参考文献 40被引用 149

一句话总结

本论文评审训练 SNNs，并倡导 surrogate gradient 方法以克服不可微的脉冲动力学，详细说明 SNNs 如何映射到 RNNs 以及 SGs 如何实现有效学习。

ABSTRACT

Spiking neural networks are nature's versatile solution to fault-tolerant and energy efficient signal processing. To translate these benefits into hardware, a growing number of neuromorphic spiking neural network processors attempt to emulate biological neural networks. These developments have created an imminent need for methods and tools to enable such systems to solve real-world signal processing problems. Like conventional neural networks, spiking neural networks can be trained on real, domain specific data. However, their training requires overcoming a number of challenges linked to their binary and dynamical nature. This article elucidates step-by-step the problems typically encountered when training spiking neural networks, and guides the reader through the key concepts of synaptic plasticity and data-driven learning in the spiking setting. To that end, it gives an overview of existing approaches and provides an introduction to surrogate gradient methods, specifically, as a particularly flexible and efficient method to overcome the aforementioned challenges.

研究动机与目标

解释在具有隐藏层的深度脉冲神经网络的训练挑战。
介绍并将代理梯度方法归类为对不可微的脉冲非线性性的一种灵活解决方案。
展示如何将 SNNs 表述为循环神经网络以将 RNNs 的训练技术迁移过来。
讨论在神经形态或硬件导向平台上实现训练的实际考虑。

提出的方法

将以电流为基础突触的脉冲 LIF 神经元表述为等效的 RCNN/RNN，以统一训练视角。
描述突触电流和膜电位的离散时间动力学方程（方程2–5），并将其与 RNN 状态更新相关联。
回顾并归类 RNNs 的训练方法（BP/Backpropagation Through Time, forward methods, locality），并将它们映射到 SNNs。
引入代理梯度（SG）方法以解决脉冲非线性的不微分性问题，包括 SG 如何与标准基于梯度的方法整合。
将平滑化方法（软/非线性模型、概率模型、速率编码、单脉冲时序）与基于 SG 的方法（通过代理导数的梯度）区分开来。
总结 SG 如何实现对具有各种编码方案和硬件考量的 SNN 的端到端训练。

实验结果

研究问题

RQ1在具有隐藏层的深度 SNNs 训练中，主要挑战是什么？
RQ2代理梯度方法如何解决不可微的脉冲非线性问题，以在 SNNs 中实现基于梯度的学习？
RQ3如何使用基于 RNN 的技术（如 BPTT 或前向方法）有效训练 SNNs？
RQ4代理梯度方法对硬件实现和局部学习规则的实际影响是什么？

主要发现

SNNs 可以被视为一种形式的 RNNs，从而将基于梯度的训练技术从 RNNs 转移到 SNNs。
代理梯度方法提供对脉冲非线性的连续、可微近似，从而实现有效的基于梯度的优化。
代理梯度方法可以与常规的反向传播（BPTT）或前向方法（包括eligibility traces）配合训练 SNNs。
对平滑化方法和基于 SG 的方法进行比较，强调代理的具体选择不如保留有用的梯度信号重要。
代理梯度技术适用于各种编码方案（速率、时序），并且可以适应在完全 BPTT 不实用的硬件约束下的实现。
文献综述涵盖了一系列代理导数（分段线性、快速 S 形、指数），以及它们在 RCNNs 与深度 SNNs 中的成功应用。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。