QUICK REVIEW

[논문 리뷰] A robust anomaly finder based on autoencoders

Tuhin S. Roy, Aravind H. Vijay|arXiv (Cornell University)|2019. 03. 05.

Particle physics theoretical and experimental studies참고 문헌 80인용 수 67

한 줄 요약

이 논문은 공통 프레임에서 제트 질량과 에너지를 고정하는 새로운 제트 전처리와 자동인코더를 결합한 강건한 anti-QCD 제트 태거를 제시하며, 질량에 강건한 이상 탐지를 서로 다른 위상공간 영역에서 가능하게 한다.

ABSTRACT

We propose a robust method to identify anomalous jets by vetoing QCD-jets. The robustness of this method ensures that the distribution of the proposed discriminating variable (which allows us to veto QCD-jets) remains unaffected by the phase space of QCD-jets, even if they were different from the region on which the model was trained. This suggests that our method can be used to look for anomalous jets in high m/p T bins by simply training on jets from low m/p T bins, where sufficient background-enriched data is available. The robustness follows from combining an autoencoder with a novel way of pre-processing jets. We use momentum rescaling followed by a Lorentz boost to find the frame of reference where any given jet is characterized by predetermined mass and energy. In this frame we generate jet images by constructing a set of orthonormal basis vectors using the Gram-Schmidt method to span the plane transverse to the jet axis. Due to our preprocessing, the autoencoder loss function does not depend on the initial jet mass, momentum or orientation while still offering remarkable performance. We also explore the application of this loss function combined (using supervised learning techniques like boosted decision trees) with few other jet observables like the mass and Nsubjettiness for the purpose of top tagging. This exercise shows that our method performs almost as well as existing top taggers which use a large amount of physics information associated with top decays while also reinforcing the fact that the loss function is mostly independent of the additional jet observables.

연구 동기 및 목표

표준모형을 넘는 물리 현상을 찾기 위한 제트 데이터에서의 하향식 이상 탐색을 동기화한다.
QCD 제트 위상공간의 변동에 강건한 자동인코더 기반 이상 탐지기를 개발한다.
맞춤형 제트 전처리가 자동인코더 손실을 제트 질량과 운동량에 거의 독립적으로 만든다는 것을 보여준다.

제안 방법

제트를 고정 질량 m0으로 재스케일링하고 고정 에너지 E0로 로렌츠 변환(부스트)하여 전처리한다.
회전 대칭을 제거하기 위해 그래드-슈미트 직교 기저를 사용하여 횡단면 평면에 제트 이미지를 구성한다.
완전 연결(fully-connected) 및 컨볼루션(convolutional) 자동인코더를 모두 학습시켜 제트 이미지를 재구성하고 재구성 손실을 이상 탐지 점수로 사용한다.
에너지 정규화를 보존하기 위해 SoftMax 최종층을 사용하고 입력과 재구성 간의 L2 거리를 손실로 계산한다.
탐색 강건성을 평가하기 위해 하나의 rho 구간에서 훈련하고 다른 구간에서 테스트하며, 검출기 효과 여부에 관계없이 13 TeV 및 100 TeV 시뮬레이션에서 평가한다.
선택적으로 자동인코더 손실을 질량 및 N-subjettiness와 같은 제트 관측치와 결합하여 톱 태깅에 활용한다.

실험 결과

연구 질문

RQ1한 kinematic 지역에서 훈련된 자동인코더가 다른 지역의 QCD 제트를 효과적으로 배제하도록 만들 수 있는가? 제트 질량 분포를 조각내지 않고도 가능한가?
RQ2제안된 제트 전처리(질량/에너지 고정 및 Gram-Schmidt 기반 횡단면 기저)가 자동인코더 손실을 제트 질량 및 rho(mJ/pT R)로부터 분리하는가?
RQ3dense(밀집) vs convolutional(컨볼루션) 자동인코더의 성능은 강건성과 이상 식별 측면에서 어떻게 비교되는가? 톱-및 W-제트에 대해?
RQ4물리 기반 태깅기와 비교해 단독으로 또는 제트 관측치와 결합했을 때 방법이 경쟁력이 있는가?
RQ5Di-W hadronic decays와 같은 NP 시나리오에서 나타나는 이상을 방법으로 식별할 수 있는가?

주요 결과

전처리는 강건성을 크게 강화한다: 제트 질량에 대한 자동인코더 손실이 rho 구간 전반에 걸쳐 거의 독립적으로 나타난다.
컨볼루션 자동인코더가 테스트된 아키텍처 중에서 가장 강건한 성능을 제공한다.
단독으로 사용할 때 기존 접근법과 유사한 이상 탐지 성능을 달성하고, 톱 태깅에서 N-subjettiness와 결합하면 성능이 크게 향상된다.
저 rho QCD 제트에서 훈련하면 배경 순도가 제한적일 때도 높은 rho 영역에서 이상 탐색이 효과적으로 가능하다.
제트 질량이나 N-subjettiness와 결합했을 때 특정 구성에서 전용 톱 태거와 동등하거나 더 우수한 성능을 보여준다.
방법은 detector effects 및 MPI 변동 하에서도 효과를 유지하고, 다양한 시나리오에서 손실 함수의 강건성을 유지한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.