Skip to main content
QUICK REVIEW

[Paper Review] Towards Accurate Binary Convolutional Neural Network

Xiaofan Lin, Cong Zhao|arXiv (Cornell University)|Nov 30, 2017
Neural Networks and Applications392 citations
TL;DR

This paper introduces ABC-Net, a scheme to train binary CNNs using multiple binary weight bases and multiple binary activations, achieving near full-precision accuracy on ImageNet while enabling efficient bitwise computation.

ABSTRACT

We introduce a novel scheme to train binary convolutional neural networks (CNNs) -- CNNs with weights and activations constrained to {-1,+1} at run-time. It has been known that using binary weights and activations drastically reduce memory size and accesses, and can replace arithmetic operations with more efficient bitwise operations, leading to much faster test-time inference and lower power consumption. However, previous works on binarizing CNNs usually result in severe prediction accuracy degradation. In this paper, we address this issue with two major innovations: (1) approximating full-precision weights with the linear combination of multiple binary weight bases; (2) employing multiple binary activations to alleviate information loss. The implementation of the resulting binary CNN, denoted as ABC-Net, is shown to achieve much closer performance to its full-precision counterpart, and even reach the comparable prediction accuracy on ImageNet and forest trail datasets, given adequate binary weight bases and activations.

Motivation & Objective

  • Motivate low-memory, low-power CNN deployment on mobile/hardware by binarizing weights and activations.
  • Propose a method to approximate full-precision weights with linear combinations of multiple binary bases.
  • Enhance information flow through networks by using multiple binary activations.
  • Demonstrate that with adequate bases, binary CNNs can approach full-precision accuracy on ImageNet.

Proposed method

  • Represent real-valued weights as a linear combination of M binary weight bases with B_m in {-1,+1}.
  • Compute B_m from the shifted, normalized real weights to enable backpropagation with STE.
  • Train using straight-through estimators to backpropagate through the binary bases and activations.
  • Represent real-valued activations as a linear combination of N binary activations A_n with coefficients beta_n, enabling richer information passage.
  • Perform convolution as a sum of M×N binary convolutions, enabling bitwise operations when activations are also binary.
  • Use batch normalization and learned shift parameters to stabilize and optimize multiple binary activations.

Experimental results

Research questions

  • RQ1How close can binary CNNs come to full-precision accuracy on ImageNet using multiple binary weight bases and multiple binary activations?
  • RQ2What is the impact of varying the number of weight bases M and activations N on classification accuracy?
  • RQ3Can weight approximation as a whole (or channel-wise) with binary bases effectively replace full-precision weights in CNNs?
  • RQ4How does ABC-Net compare to existing quantization/binzarization methods on large-scale datasets?

Key findings

ModelW (weights bitwidth)A (activations bitwidth)Top-1Top-5
Full-Precision Resnet-18323269.3%89.2%
BWN13260.8%83.0%
DoReFa-Net1459.2%81.5%
XNOR-Net1151.2%73.2%
BNN1142.2%67.1%
ABC-Net [5 binary weight bases, 5 binary activations]1165.0%85.9%
ABC-Net [5 binary weight bases, full-precision activations]13268.3%87.9%
  • ABC-Net with 5 binary weight bases and 5 binary activations achieves Top-1 65.0% and Top-5 85.9% on ImageNet with ResNet-18, nearing full-precision performance (69.3% Top-1, 89.2% Top-5).
  • Increasing M and N consistently improves accuracy, with larger configurations yielding smaller gaps to full-precision models (e.g., M=5, N=5 reduces the Top-1 gap to about 4.3%).
  • Compared to state-of-the-art binary/low-precision networks (BWN, XNOR-Net, BNN, DoReFa-Net), ABC-Net with multiple bases/activations delivers substantially higher Top-1/Top-5 accuracy while maintaining binary weights and activations.
  • Channel-wise weight approximation offers potentially finer-grained modeling than whole-weight approximation, though more training cost is involved.
  • Using full-precision activations with binary weights (or vice versa) also yields competitive results, illustrating flexibility in the ABC-Net design.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.