QUICK REVIEW

[논문 리뷰] Self-supervised GAN: Analysis and Improvement with Multi-class Minimax Game

Ngoc-Trung Tran, Viet-Hung Tran|arXiv (Cornell University)|2019. 11. 16.

Human Pose and Action Recognition인용 수 43

한 줄 요약

논문은 기존의 자기지도(self-supervised) 작업이 GAN 학습과 어떻게 상호작용하는지 분석하고, 모드 붕괴를 일으킬 수 있는 허점을 드러내며, 다중 클래스 미니맥스(Self-Supervised) 게임을 제안해 GAN 수렴을 개선하고 여러 데이터셋에서 강한 FID 점수를 달성한다.

ABSTRACT

Self-supervised (SS) learning is a powerful approach for representation learning using unlabeled data. Recently, it has been applied to Generative Adversarial Networks (GAN) training. Specifically, SS tasks were proposed to address the catastrophic forgetting issue in the GAN discriminator. In this work, we perform an in-depth analysis to understand how SS tasks interact with learning of generator. From the analysis, we identify issues of SS tasks which allow a severely mode-collapsed generator to excel the SS tasks. To address the issues, we propose new SS tasks based on a multi-class minimax game. The competition between our proposed SS tasks in the game encourages the generator to learn the data distribution and generate diverse samples. We provide both theoretical and empirical analysis to support that our proposed SS tasks have better convergence property. We conduct experiments to incorporate our proposed SS tasks into two different GAN baseline models. Our approach establishes state-of-the-art FID scores on CIFAR-10, CIFAR-100, STL-10, CelebA, Imagenet $32 imes32$ and Stacked-MNIST datasets, outperforming existing works by considerable margins in some cases. Our unconditional GAN model approaches performance of conditional GAN without using labeled data. Our code: https://github.com/tntrung/msgan

연구 동기 및 목표

GAN에서 생성기 학습에 기존 자기지도 작업이 어떤 영향을 미치는지 이해한다.
현재 SS-task 설계의 취약점을 식별하여 모드 붕괴를 가능하게 하는 요인을 파악한다.
GAN 학습을 안정화하고 분포 매칭을 개선하는 강건한 자기지도 프레임워크를 개발한다.
다양한 무조건적 GAN 기반선 및 데이터셋에서 실증적 이득을 입증한다.]
method:[
GAN 목표와 SS 작업 Psi(G,C) 및 Phi(G,C)의 최적 판별기와 분류기에서의 상호작용을 분석한다.
기존의 SS 작업이 모드 붕괴된 G에 의해 Phi(G,C)를 학습 없이 최대화하도록 악용될 수 있음을 보인다.
다중 클래스 미니맥스 게임에 기반한 Psi^{+}(G,C) 및 Phi^{+}(G,C)를 도입하여 GAN과 SS 작업을 결합한다.
회전된(변환된) 샘플을 이용한 수렴 특성과 분포 매칭이 향상됨을 이론적으로 도출한다.
새로운 목적 함수의 KL 기반 항이 P_g를 P_d와 일치시키는 업데이트를 정렬함을 증명한다.
CIFAR-10/100, STL-10, CelebA, ImageNet32×32, Stacked-MNIST에서 실험적으로 검증한다.]
research_questions:[
기존의 자기지도 작업이 생성기 학습 중 GAN 목표와 어떻게 상호작용하는가?
현 SS-task 설계가 모드 붕괴를 방지하고 데이터 분포 커버리지를 확보하는 데 어떤 한계가 있는가?
다중 클래스 미니맥스 자기지도 프레임워크가 무조건적 GAN의 수렴과 샘플 다양성을 개선할 수 있는가?
회전/변환된 샘플이 P_g를 P_d에 더 잘 정렬하기 위한 피드백 신호를 강화하는가?]
key_findings:[
기존의 SS 작업은 생성기가 데이터 분포를 학습하지 않고 SS 손실을 최대화하도록 악용될 수 있어 모드 붕괴의 위험이 있다.
제안된 다중 클래스 미니맥스 SS 작업은 더 강한 피드백을 제공하여 P_g와 P_d 간의 매칭을 개선한다.
이론적 분석은 회전된 샘플을 SS 게임에 활용함으로써 수렴 이득과 분포 매칭이 향상됨을 보여준다.
실험 결과 CIFAR-10, CIFAR-100, STL-10, CelebA, ImageNet32×32, Stacked-MNIST에서 여러 기반선에 대해 최첨단 FID 점수를 달성한다.
MS 접근을 이용한 무조건적 GAN은 레이블을 사용하지 않고도 일부 조건부 GAN의 성능에 근접한다.
table_headers:[]
table_rows:[]

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.