Skip to main content
QUICK REVIEW

[논문 리뷰] Isolating Sources of Disentanglement in Variational Autoencoders

Ricky T. Q. Chen, Xuechen Li|arXiv (Cornell University)|2018. 02. 14.
Generative Adversarial Networks and Image Synthesis참고 문헌 46인용 수 145
한 줄 요약

이 논문은 ELBO를 분해하여 총상관 항을 분리하고, 추가 하이퍼파라미터 없이 beta-VAE를 보완하는 plug-in으로 beta-TCVAE를 도입하며, classifier-free disentanglement 지표 MIG를 제안한다. 또한 총상관과 데이터셋 전반의 disentanglement를 경험적으로 연결한다.

ABSTRACT

We decompose the evidence lower bound to show the existence of a term measuring the total correlation between latent variables. We use this to motivate our $β$-TCVAE (Total Correlation Variational Autoencoder), a refinement of the state-of-the-art $β$-VAE objective for learning disentangled representations, requiring no additional hyperparameters during training. We further propose a principled classifier-free measure of disentanglement called the mutual information gap (MIG). We perform extensive quantitative and qualitative experiments, in both restricted and non-restricted settings, and show a strong relation between total correlation and disentanglement, when the latent variables model is trained using our framework.

연구 동기 및 목표

  • Motivate and quantify disentanglement in VAEs by decomposing the ELBO to identify the total correlation term.
  • Propose a training method that weights decomposition terms without introducing new hyperparameters.
  • Introduce beta-TCVAE as a plug-in replacement for beta-VAE with automatic disentanglement benefits.
  • Propose a classifier-free, information-theoretic metric (MIG) to evaluate disentanglement across latent distributions.

제안 방법

  • Derive an ELBO decomposition revealing index-code MI, total correlation, and dimension-wise KL terms.
  • Propose minibatch-weighted sampling to estimate decomposition terms without extra hyperparameters.
  • Define beta-TCVAE as a special case with alpha=gamma=1 and beta controlling TC penalty.
  • Provide an alternative training approach to estimate TC without a discriminator.

실험 결과

연구 질문

  • RQ1Does penalizing the total correlation term in the ELBO promote disentanglement in VAEs?
  • RQ2Can beta-TCVAE achieve better disentanglement than beta-VAE without adding training complexity?
  • RQ3Is there a robust, classifier-free metric to quantify disentanglement across latent distributions?
  • RQ4How does total correlation correlate with disentanglement across datasets and sampling biases?

주요 결과

  • beta-TCVAE yields more interpretable disentangled representations than beta-VAE in several datasets.
  • Total correlation correlates negatively with disentanglement under beta-TCVAE, supporting the TC penalty's role.
  • MIG provides a classifier-free, axis-aligned, generalizable disentanglement measure applicable to various latent distributions.
  • The proposed minibatch weighting allows training with TC weighting without additional hyperparameters.
  • FactorVAE, which is similar in objective, can be outperformed when density ratio tricks are hard to train, highlighting beta-TCVAE robustness.
  • beta-TCVAE remains effective even under non-uniform or dependent factor sampling, improving interpretability over baselines.]
  • table_headers: [],
  • table_rows: []} } - This JSON got malformed. Let me fix. The
  • inserted incorrectly. I'll produce correct JSON. I'll remove stray quotes and ensure proper fields. Also the title field originally should be

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.