QUICK REVIEW

[論文レビュー] Domain Separation Networks

Konstantinos Bousmalis, George Trigeorgis|arXiv (Cornell University)|Aug 22, 2016

Domain Adaptation and Few-Shot Learning参考文献 31被引用数 588

ひとこと要約

DSN は、教師なしドメイン適応のためにプライベート（ドメイン特有）および共有表現を明示的にモデル化し、再構成損失と直交性損失を類似性ガイダンスと組み合わせて、従来の手法を上回る。

ABSTRACT

The cost of large scale data collection and annotation often makes the application of machine learning algorithms to new tasks or datasets prohibitively expensive. One approach circumventing this cost is training models on synthetic data where annotations are provided automatically. Despite their appeal, such models often fail to generalize from synthetic to real images, necessitating domain adaptation algorithms to manipulate these models before they can be successfully applied. Existing approaches focus either on mapping representations from one domain to the other, or on learning to extract features that are invariant to the domain from which they were extracted. However, by focusing only on creating a mapping or shared representation between the two domains, they ignore the individual characteristics of each domain. We suggest that explicitly modeling what is unique to each domain can improve a model's ability to extract domain-invariant features. Inspired by work on private-shared component analysis, we explicitly learn to extract image representations that are partitioned into two subspaces: one component which is private to each domain and one which is shared across domains. Our model is trained not only to perform the task we care about in the source domain, but also to use the partitioned representation to reconstruct the images from both domains. Our novel architecture results in a model that outperforms the state-of-the-art on a range of unsupervised domain adaptation scenarios and additionally produces visualizations of the private and shared representations enabling interpretation of the domain adaptation process.

研究の動機と目的

合成データとドメイン適応を活用して注釈コストを削減する動機付け。
2つのサブスペース表現を提案：private（ドメイン特異）とshared（ドメイン不変）。
ソースドメインのタスク損失と再構成・直交損失を用いて表現を分離して訓練する。
適応中のプライベートおよび共有コンポーネントの可視化・解釈性を可能にする。

提案手法

各ドメインに対して共有エンコーダEcとプライベートエンコーダEpを導入する。
EcとEp表現から入力を再構成する共有デコーダDを使用する。
プライベートと共有コンポーネントの直交性を促進する差分損失（L_difference）を課す。
DANN（GRL付き）またはMMDを用いて、ドメイン間で共有表現を整合させる類似性損失（L_similarity）を課す。
ソースラベルのタスク損失と両ドメインの再構成損失で訓練し、ウォームアップ期間後にドメイン適応損失を適用する。
複数のドメイン適応シナリオにわたってアーキテクチャと訓練手順を共有する。

実験結果

リサーチクエスチョン

RQ1ドメインプライベートおよびドメイン共有表現を明示的にモデル化することは、教師なしドメイン適応をどのように改善できるか。
RQ2プライベート-共有の分割と再構成損失は、従来のマッピングまたは共有表現メソッドよりもクロスドメイン一般化を高めるか。
RQ3どの類似性目的（DANN対MMD）がDSN内のクロスドメイン整合性を最も促進するか。
RQ4適応中にプライベートと共有表現の解釈可能な可視化をDSNは提供できるか。

主な発見

モデル	MNIST-M	SVHN	MNIST	GTSRB
ソースのみ	56.6 (52.2)	86.7 (86.7)	59.2 (54.9)	85.1 (79.0)
CORAL [27]	57.7	85.2	63.1	86.9
MMD [30,18]	76.9	88.0	71.1	91.1
DANN [8]	77.4 (76.6)	90.3 (91.0)	70.7 (73.8)	92.9 (88.6)
DSN w/ MMD (ours)	80.5	88.5	72.2	92.6
DSN w/ DANN (ours)	83.2	91.2	82.7	93.1
ターゲットのみ	98.7	92.4	99.5	99.8

DSN with DANN similarity loss outperforms CORAL, MMD, and standard DANN baselines across multiple unsupervised domain adaptation tasks.
In ablations, removing the difference (orthogonality) loss degrades performance across tasks.
Replacing scale-invariant reconstruction with standard L2 MSE reduces performance, validating the proposed reconstruction loss.
DSN achieves strong accuracy on MNIST→MNIST-M, Synth Digits→SVHN, SVHN→MNIST, and Synth Signs→GTSRB tasks.
DSN also yields competitive pose estimation results in the Synth Objects→LINEMOD scenario.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。