QUICK REVIEW

[논문 리뷰] Riemannian Manifold Hamiltonian Monte Carlo

Mark Girolami, Ben Calderhead|arXiv (Cornell University)|2009. 07. 06.

Markov Chains and Monte Carlo Methods참고 문헌 31인용 수 27

한 줄 요약

이 논문은 매개변수 공간의 리만 기하학을 활용하여 제어밀도를 자동으로 조정하는 완전 자동화된 MCMC 샘플러인 리만 다양체 해밀턴 몽테카를로(Riemannian Manifold Hamiltonian Monte Carlo, RMHMC)를 소개한다. 비분리 해밀토니안에 대해 반명시적 2차 비틀림 보존 적분기를 사용함으로써 RMHMC는 고차원이고 상관관계가 강한 사후 분포에서 우수한 수렴성과 탐색 효율성을 달성하여 로지스틱 회귀, 포인트 프로세스, 스토하스틱 볼atility, 그리고 동역학 시스템 모델에서 시간 정규화된 효과적 표본 크기(Effective Sample Size)를 크게 향상시킨다.

ABSTRACT

The paper proposes a Riemannian Manifold Hamiltonian Monte Carlo sampler to resolve the shortcomings of existing Monte Carlo algorithms when sampling from target densities that may be high dimensional and exhibit strong correlations. The method provides a fully automated adaptation mechanism that circumvents the costly pilot runs required to tune proposal densities for Metropolis-Hastings or indeed Hybrid Monte Carlo and Metropolis Adjusted Langevin Algorithms. This allows for highly efficient sampling even in very high dimensions where different scalings may be required for the transient and stationary phases of the Markov chain. The proposed method exploits the Riemannian structure of the parameter space of statistical models and thus automatically adapts to the local manifold structure at each step based on the metric tensor. A semi-explicit second order symplectic integrator for non-separable Hamiltonians is derived for simulating paths across this manifold which provides highly efficient convergence and exploration of the target density. The performance of the Riemannian Manifold Hamiltonian Monte Carlo method is assessed by performing posterior inference on logistic regression models, log-Gaussian Cox point processes, stochastic volatility models, and Bayesian estimation of parameter posteriors of dynamical systems described by nonlinear differential equations. Substantial improvements in the time normalised Effective Sample Size are reported when compared to alternative sampling approaches. Matlab code at \url{http://www.dcs.gla.ac.uk/inference/rmhmc} allows replication of all results.

연구 동기 및 목표

기존 몽테카를로 방법이 고차원이고 상관관계가 강한 사후 분포에서 효율성이 떨어지는 문제를 해결하기 위해.
메트로폴리스-해스팅스와 하이브리드 몽테카를로의 제어밀도 조정을 위한 비용이 많이 드는 프리런(pilot runs)이 필요 없도록 하기 위해.
매개변수 공간의 국소 기하학에 동적으로 적응할 수 있는 완전 자동화된 적응 메커니즘을 개발하기 위해.
고차원에서 마코프 체인의 일시적 및 정적 상태 단계 모두에서 효율적인 샘플링을 가능하게 하기 위해.
다양한 통계 모델에서 시간 정규화된 효과적 표본 크기(Effective Sample Size, ESS)를 향상시키기 위해.

제안 방법

매개변수 공간의 리만 다양체 구조를 사용하며, 국소 기하학은 피셔 정보 계량을 통해 정의된다.
각 단계에서 계량 텐서를 사용하여 제어밀도를 적응적으로 조정하는 해밀턴 몽테카를로 샘플러를 구축함으로써 국소 곡률에 최적의 스케일링을 보장한다.
비분리 해밀토니안에 대해서도 적용 가능한 반명시적 2차 비틀림 보존 적분기를 유도하여 다양체 위의 궤적을 시뮬레이션한다.
적분기는 비틀림 보존 구조를 유지하여 장기적인 안정성과 정확한 샘플링을 보장한다.
알고리즘은 마코프 체인의 일시적 및 정적 상태 단계에서 서로 다른 스케일링 요구 사항에 자동으로 적응한다.
이 방법은 MATLAB로 구현되었으며, 전체 재현 가능성을 확보하기 위해 http://www.dcs.gla.ac.uk/inference/rmhmc 에서 공개되어 있다.

실험 결과

연구 질문

RQ1사후 분포의 국소 리만 기하학에 자동으로 적응할 수 있는 해밀턴 몽테카를로 샘플러를 설계할 수 있는가?
RQ2RMHMC는 고차원이고 상관관계가 강한 모델에서 표준 HMC와 MALA에 비해 어떻게 성능을 발휘하는가?
RQ3비분리 해밀토니안을 위한 제안된 비틀림 보존 적분기는 복잡한 사후 기하학에서 정확성과 효율성을 유지할 수 있는가?
RQ4다양한 통계 모델에서 RMHMC는 시간 정규화된 효과적 표본 크기(Effective Sample Size)를 어느 정도 향상시키는가?
RQ5자동 적응 메커니즘이 MCMC 샘플링에서 수동 조정이나 프리런이 필요 없도록 하는가?

주요 결과

모든 테스트된 모델에서 RMHMC는 다른 샘플링 방법에 비해 시간 정규화된 효과적 표본 크기(Effective Sample Size)에서 상당한 향상을 달성한다.
이 방법은 특히 로지스틱 회귀나 스토하스틱 볼atility 모델에서 강한 상관관계가 있는 고차원 사후 분포를 효율적으로 탐색할 수 있도록 한다.
반명시적 비틀림 보존 적분기는 곡면 위에서 해밀토니안 역학을 안정적이고 정확하게 시뮬레이션한다.
계량 텐서를 통한 자동 적응 덕분에 프리런이나 제어밀도의 수동 조정이 필요 없어진다.
비선형 미분 방정식으로 정의된 비선형적이고 복잡한 매개변수 의존성 구조를 가진 모델에서 성능 향상이 특히 두드러진다.
제공된 MATLAB 구현을 통해 모든 결과가 재현 가능하다. http://www.dcs.gla.ac.uk/inference/rmhmc

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.