Skip to main content
QUICK REVIEW

[논문 리뷰] Motivation is Something You Need

Mehdi Acheli, Qi Yu|arXiv (Cornell University)|2026. 02. 24.
Neural and Behavioral Psychology Studies인용 수 0
한 줄 요약

이 논문은 정의된 동기 조건하에서 활성화되는 더 큰 동기화된 모델과 기본 모델 사이를 번갈아가며 학습 효율성과 모델 성능을 향상시키는 신경과학에서 영감을 받은 이중 모델 학습 프레임워크를 제시합니다.

ABSTRACT

This work introduces a novel training paradigm that draws from affective neuroscience. Inspired by the interplay of emotions and cognition in the human brain and more specifically the SEEKING motivational state, we design a dual-model framework where a smaller base model is trained continuously, while a larger motivated model is activated intermittently during predefined "motivation conditions". The framework mimics the emotional state of high curiosity and anticipation of reward in which broader brain regions are recruited to enhance cognitive performance. Exploiting scalable architectures where larger models extend smaller ones, our method enables shared weight updates and selective expansion of network capacity during noteworthy training steps. Empirical evaluation on the image classification task demonstrates that, not only does the alternating training scheme efficiently and effectively enhance the base model compared to a traditional scheme, in some cases, the motivational model also surpasses its standalone counterpart despite seeing less data per epoch. This opens the possibility of simultaneously training two models tailored to different deployment constraints with competitive or superior performance while keeping training cost lower than when training the larger model.

연구 동기 및 목표

  • Motivation: 신경망 학습을 향상시키기 위해 SEEKING과 유사한 동기 상태를 재현한다.
  • Propose a task-agnostic framework using a base model and a bigger motivated model within scalable architectures.
  • Show that alternating training improves the base model and can outperform standalone larger models under certain conditions.
  • Demonstrate efficiency gains and a train-once, deploy-twice paradigm for resource-constrained settings.

제안 방법

  • Define four core elements: base model, motivated model, motivation condition, and weights map.
  • Implement a weights map to align base and motivated models within scalable architectures (ResNet, ViT, EfficientNet).
  • Trigger motivation when the training loss decreases for k consecutive batches, switching to training the motivated model.
  • Copy weights and optimizer state when switching between states to maintain training continuity.
  • Evaluate on image classification tasks across CIFAR, ImageNet, Flowers, Pets using ResNet, ViT, and EfficientNet architectures.
  • Use ACC/FLOPs and ACC/F_Ratio as efficiency metrics to compare against baseline and next-level models.

실험 결과

연구 질문

  • RQ1Does motivation-inspired alternating training improve the base model performance compared to standard training across multiple architectures and datasets?
  • RQ2Can the motivated model achieve competitive or superior performance to standalone larger models while reducing training cost?
  • RQ3How does the choice of motivation condition and weight mapping affect learning and transfer performance?
  • RQ4Is there a practical train-once, deploy-twice workflow enabled by this approach for resource-constrained scenarios?

주요 결과

  • Motivation-inspired training improves base model accuracy across CIFAR-10, CIFAR-100, ImageNet, ViT, and EfficientNet variants.
  • On CIFAR datasets, base models often achieve better accuracy per FLOP than their next-level counterparts, with instances of larger motivated models surpassing standalone counterparts.
  • On ImageNet, the motivation-inspired scheme is up to 18x more efficient than the next-level model while delivering performance gains.
  • Transfer learning with motivated weights yields 4% to 29% improvements on downstream tasks like CIFAR-10, CIFAR-100, Flowers, and Pets.
  • EfficientNet experiments show the motivated model can outperform some classically trained larger models, with notable FLOP efficiency gains (up to 14x).
  • Ablation studies confirm the importance of a well-timed motivation condition and show that random activation of the motivated model degrades or, for some architectures, marginally affects performance.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.