QUICK REVIEW

[論文レビュー] Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation

Risto Vuorio, Shaohua Sun|arXiv (Cornell University)|Oct 30, 2019

Domain Adaptation and Few-Shot Learning被引用数 72

ひとこと要約

MMAMLはMAMLを拡張し、マルチモーダルなタスク分布内のタスクモードを識別するモジュレーションネットワークを備え、回帰・分類・強化学習にまたがる迅速な適応のためにメタ学習済み事前分布を調整する。

ABSTRACT

Model-agnostic meta-learners aim to acquire meta-learned parameters from similar tasks to adapt to novel tasks from the same distribution with few gradient updates. With the flexibility in the choice of models, those frameworks demonstrate appealing performance on a variety of domains such as few-shot image classification and reinforcement learning. However, one important limitation of such frameworks is that they seek a common initialization shared across the entire task distribution, substantially limiting the diversity of the task distributions that they are able to learn from. In this paper, we augment MAML with the capability to identify the mode of tasks sampled from a multimodal task distribution and adapt quickly through gradient updates. Specifically, we propose a multimodal MAML (MMAML) framework, which is able to modulate its meta-learned prior parameters according to the identified mode, allowing more efficient fast adaptation. We evaluate the proposed model on a diverse set of few-shot learning tasks, including regression, image classification, and reinforcement learning. The results not only demonstrate the effectiveness of our model in modulating the meta-learned prior in response to the characteristics of tasks but also show that training on a multimodal distribution can produce an improvement over unimodal training.

研究の動機と目的

Identify limitation of single initialization in standard model-agnostic meta-learners under multimodal task distributions.
Propose Multimodal Model-Agnostic Meta-Learning (MMAML) to identify task modes and modulate meta-learned priors.
Enable rapid adaptation to new tasks via gradient updates after modulation.
Demonstrate generalization benefits from multimodal training across regression, image classification, and reinforcement learning.

提案手法

A modulation network analyzes few (K) task examples to produce a task embedding v.
Task-specific parameters tau_i are generated as tau_i = g_i(v; ω_g) for each network block i.
Modulate each block of the task network via phi_i = θ_i ⊙ tau_i (e.g., FiLM or attention-based modulation).
Use the modulated initialization to perform a few gradient steps to adapt to the target task, keeping tau_i fixed during adaptation.
Train with a meta-training procedure that optimizes both the meta-learner parameters θ and the modulation network parameters (ω_h, ω_g) as in Algorithm 1.
Domains include regression, image classification, and reinforcement learning to evaluate multimodal task adaptation.

実験結果

リサーチクエスチョン

RQ1Can MMAML identify task modes from few-shot task data and modulate the meta-learned prior accordingly?
RQ2Does training on multimodal task distributions improve generalization relative to unimodal or single-initialization meta-learners across regression, classification, and RL?
RQ3How does FiLM-based modulation compare to softmax-based attention for modulating network parameters?
RQ4What is the impact of MMAML on fast adaptation performance versus MAML and Multi-MAML across different modes or datasets?

主な発見

MMAML with FiLM modulation outperforms standard MAML and achieves competitive results with Multi-MAML across multimodal regression and image classification benchmarks.
Task embeddings learned by the modulation network cluster according to task modes, indicating successful mode identification.
FiLM modulation yields more stable and superior performance than softmax attention in the experimented domains.
MMAML benefits from multimodal training, showing better generalization than unimodal-trained baselines in several tasks.
In reinforcement learning, MMAML consistently surpasses unmodulated ProMP baselines, with performance improving as task modes increase.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。