[论文解读] Progressive Neural Architecture Search
PNAS 引入了对 CNN 单元结构的渐进式、代理引导的搜索,在计算速度比以前的基于强化学习的 NAS 方法快多达8倍的情况下达到最先进的准确度。
We propose a new method for learning the structure of convolutional neural networks (CNNs) that is more efficient than recent state-of-the-art methods based on reinforcement learning and evolutionary algorithms. Our approach uses a sequential model-based optimization (SMBO) strategy, in which we search for structures in order of increasing complexity, while simultaneously learning a surrogate model to guide the search through structure space. Direct comparison under the same search space shows that our method is up to 5 times more efficient than the RL method of Zoph et al. (2018) in terms of number of models evaluated, and 8 times faster in terms of total compute. The structures we discover in this way achieve state of the art classification accuracies on CIFAR-10 and ImageNet.
研究动机与目标
- Motivate and reduce the computational cost of neural architecture search (NAS) for CNNs compared to RL and EA approaches.
- Propose a progressive, block-wise search over CNN cells combined with a surrogate predictor to guide expansion.
- Demonstrate that the discovered architectures reach state-of-the-art accuracy on CIFAR-10 and ImageNet with lower compute.
- Show that sharing a single cell type and progressively increasing complexity improves search efficiency and transferability.
提出的方法
- Define a hierarchical cell-based search space with B blocks per cell and a fixed set of block operations and inputs.
- Perform progressive search from simple (1-block) cells to deeper (B-block) cells by expanding candidates and predicting performance with a surrogate model.
- Train a population of proxy CNNs built from candidate cells to obtain validation accuracy as supervision.
- Train an ensemble surrogate predictor (MLP or RNN) to rank expanded cells and select top-K for the next generation, updating the predictor with new observed data.
- Construct final CNNs by stacking the best cell type with specified repeats and stride patterns, then train on target datasets (CIFAR-10 and ImageNet).
- Compare efficiency to NAS (RL-based) and random search, reporting model counts, total compute, and top validation accuracy.
实验结果
研究问题
- RQ1Can progressive, surrogate-guided search reduce the number of evaluated models in NAS while maintaining or improving accuracy?
- RQ2Does using a cell-based, progressively complex search space improve efficiency and transferability to larger datasets like ImageNet?
- RQ3How well do surrogate predictors (MLP vs RNN ensembles) rank promising architectures and generalize to larger, unseen cells?
- RQ4What is the practical speedup and accuracy trade-off of PNAS compared with NAS, random search, and Hierarchical EA on CIFAR-10 and ImageNet?
主要发现
- PNAS is up to 5x more efficient than the RL NAS method of Zoph et al. (2018) in the number of models evaluated.
- PNAS is up to 8x faster in total compute than the RL NAS method for the same search space.
- The architectures discovered by PNAS achieve state-of-the-art or competitive classification accuracies on CIFAR-10 and ImageNet.
- Using a progressive search with surrogate guidance enables exploring larger, more complex cells than direct full-CNN searches.
- An ML-based surrogate ensemble (especially MLP-ensemble) effectively ranks candidate cells for unseen, larger block counts, improving search efficiency.
- PNASNet-5 outperforms several NAS variants in ImageNet (Mobile and Large settings) while using substantially less compute than AmoebaNet implementations.
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。