QUICK REVIEW

[论文解读] Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations

Eirikur Agustsson, Fabian Mentzer|arXiv (Cornell University)|Apr 3, 2017

Advanced Image Processing Techniques参考文献 37被引用 262

一句话总结

论文介绍了一种端到端训练的软到硬向量量化框架，用于学习可压缩的特征表征和模型，在图像和DNN压缩方面取得具有竞争力的结果。

ABSTRACT

We present a new approach to learn compressible representations in deep architectures with an end-to-end training strategy. Our method is based on a soft (continuous) relaxation of quantization and entropy, which we anneal to their discrete counterparts throughout training. We showcase this method for two challenging applications: Image compression and neural network compression. While these tasks have typically been approached with different methods, our soft-to-hard quantization approach gives results competitive with the state-of-the-art for both.

研究动机与目标

Motivate learning compressible representations for deep networks and data types.
Jointly optimize model parameters, quantization levels, and entropy of the symbol stream.
Provide a unified framework for compression of both features in networks and whole models.

提出的方法

Introduce a soft (continuous) relaxation of quantization and entropy with an annealing schedule from soft to hard assignments.
Model encoder E as selecting nearest centers from a learned codebook; decoder D reconstructs from symbol indices.
Estimate entropy via a differentiable soft histogram and a soft entropy loss that upper-bounds the true entropy.
Anneal the soft quantization to hard assignments to enable end-to-end differentiable training.
Apply vector quantization (not just scalar) to better capture bottleneck statistics.
Demonstrate end-to-end learning for both image compression via a compressive autoencoder and DNN model compression (ResNet on CIFAR-10).

实验结果

研究问题

RQ1Can soft-to-hard vector quantization be trained end-to-end to minimize distortion plus rate (D + βR) in deep networks?
RQ2Does learning the quantization levels jointly with weights improve compressibility for both image data and model parameters?
RQ3How does vector quantization compare to scalar quantization in learned compression scenarios?
RQ4Can histogram-based entropy estimation without strong parametric assumptions yield competitive results?

主要发现

Method	Acc	Comp.
Original model	92.6	1.00
Pruning + ft. + index coding + H. Coding [12]	92.6	4.52
Pruning + ft. + k-means + ft. + I.C. + H.C. [11]	92.6	18.25
Pruning + ft. + Hessian-weighted k-means + ft. + I.C. + H.C.	92.7	20.51
Pruning + ft. + Uniform quantization + ft. + I.C. + H.C.	92.7	22.17
Pruning + ft. + Iterative ECSQ + ft. + I.C. + H.C.	92.7	21.01
Soft-to-Hard Annealing + ft. + H. Coding ( ours)	92.1	19.15
Soft-to-Hard Annealing + ft. + A. Coding ( ours)	92.1	20.15

Achieves competitive performance with state-of-the-art methods for both image compression and DNN model compression.
Vector quantization with soft-to-hard annealing improves rate-distortion trade-offs over scalar quantization.
Entropy loss based on soft histograms provides differentiable guidance for compressibility.
On CIFAR-10 with a 32-layer ResNet, achieves ~19–20× compression with minimal loss in accuracy.
For image compression, SHA outperforms JPEG/JPEG 2000 at high compression rates and is competitive with BPG on several datasets.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。