QUICK REVIEW

[論文レビュー] Triple Generative Adversarial Nets

Chongxuan Li, Kun Xu|arXiv (Cornell University)|Mar 7, 2017

Generative Adversarial Networks and Image Synthesis参考文献 25被引用数 230

ひとこと要約

Triple-GAN は半教師付き学習において p(x,y) を共同でモデル化する3つのプレイヤー（生成器、識別器、分類器）を導入し、最先端の分類性能を達成するとともに、クラス条件付き生成と潜在空間の補間を可能にします。

ABSTRACT

Generative Adversarial Nets (GANs) have shown promise in image generation and semi-supervised learning (SSL). However, existing GANs in SSL have two problems: (1) the generator and the discriminator (i.e. the classifier) may not be optimal at the same time; and (2) the generator cannot control the semantics of the generated samples. The problems essentially arise from the two-player formulation, where a single discriminator shares incompatible roles of identifying fake samples and predicting labels and it only estimates the data without considering the labels. To address the problems, we present triple generative adversarial net (Triple-GAN), which consists of three players---a generator, a discriminator and a classifier. The generator and the classifier characterize the conditional distributions between images and labels, and the discriminator solely focuses on identifying fake image-label pairs. We design compatible utilities to ensure that the distributions characterized by the classifier and the generator both converge to the data distribution. Our results on various datasets demonstrate that Triple-GAN as a unified model can simultaneously (1) achieve the state-of-the-art classification results among deep generative models, and (2) disentangle the classes and styles of the input and transfer smoothly in the data space via interpolation in the latent space class-conditionally.

研究の動機と目的

2つの主要なSSL GANの課題に対処する: (1) 生成器と識別器が同時に最適でない可能性がある、(2) 生成器が生成サンプルの意味を制御できない。
生成器、分類器、識別器の3プレイヤーからなるフレームワークを提案し、p(x,y) の結合分布をモデル化する。
分類器と生成器の分布が互換的な目的関数を通じてデータ分布へ収束することを保証する。

提案手法

3つのネットワーク: p(y|x)を近似する分類器C、p(x|y)を近似するクラス条件付き生成器G、実データと偽データラベルのペアを識別する識別器D。
A minimax game with adversarial losses: U(C,G,D) = E_{(x,y)~p(x,y)}[log D(x,y)] + α E_{(x,y)~p_c(x,y)}[log(1−D(x,y))] + (1−α)E_{(x,y)~p_g(x,y)}[log(1−D(G(y,z),y))], (α ∈ (0,1)).
A supervised regularization term R_L = E_{(x,y)~p(x,y)}[−log p_c(y|x)] to drive unique equilibrium.
Addition of pseudo discriminative loss R_P = E_{p_g}[−log p_c(y|x)] to leverage generated pairs for C.
Training procedure includes updating D, C, and G with minibatch SGD and unbiased estimators, plus practical SSL regularizations.

実験結果

リサーチクエスチョン

RQ1Triple-GAN は分類器とクラス条件付き生成器を共同で最適化し、両方が真のデータ分布へ収束することができるか。
RQ2単一の識別器を持つ三者間ゲームの導入が、二者間SSL GAN で見られる非互換性を防ぐことができるか。
RQ3モデルは限定ラベルの下で、クラスとスタイル因子を分離学習し、意味のあるクラス条件付き潜在空間補間を実現できるか。
RQ4偽識別損失が半教師付き分類とサンプル生成に与える影響は何か。

主な発見

Triple-GAN はラベルが限定された状態で MNIST、SVHN、CIFAR-10 の深屍生成モデルの中で最先端の半教師付き分類結果を達成する。
モデルはクラスとスタイルを分離し、潜在空間でクラス条件付きの滑らかな補間を可能にする。
提案されたゲームの平衡点は p(x,y) = p_g(x,y) = p_c(x,y) となり、設計されたユーティリティ下で C と G がデータ分布へ収束することを保証する。
偽識別損失と他のSSL正則化は分類精度とサンプル品質を改善する。
実証的結果は Triple-GAN が複数の SSL GAN ベースライン（例: Improved-GAN）をデータセット全体で上回り、特に SVHN と CIFAR-10 で有意な向上を示す。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。