QUICK REVIEW

[论文解读] An implementation of neural simulation-based inference for parameter estimation in ATLAS

G. Aad, Erlend Aakvaag|arXiv (Cornell University)|Jan 1, 2024

Particle Detector Development and Performance被引用 2

一句话总结

本文提出了一种用于ATLAS实验中高维、无分箱参数估计的神经模拟推断（NSBI）框架，利用神经网络在无需直方图分箱的情况下估计似然比。在模拟的非共振Higgs玻色子耦合测量中，该方法相较于传统直方图方法展现出更高的灵敏度，同时通过诊断和Neyman构造法严格量化了有限蒙特卡洛样本和系统效应带来的不确定性。

ABSTRACT

Neural simulation-based inference is a powerful class of machine-learning-based methods for statistical inference that naturally handles high-dimensional parameter estimation without the need to bin data into low-dimensional summary histograms. Such methods are promising for a range of measurements, including at the Large Hadron Collider, where no single observable may be optimal to scan over the entire theoretical phase space under consideration, or where binning data into histograms could result in a loss of sensitivity. This work develops a neural simulation-based inference framework for statistical inference, using neural networks to estimate probability density ratios, which enables the application to a full-scale analysis. It incorporates a large number of systematic uncertainties, quantifies the uncertainty due to the finite number of events in training samples, develops a method to construct confidence intervals, and demonstrates a series of intermediate diagnostic checks that can be performed to validate the robustness of the method. As an example, the power and feasibility of the method are assessed on simulated data for a simplified version of an off-shell Higgs boson couplings measurement in the four-lepton final states. This approach represents an extension to the standard statistical methodology used by the experiments at the Large Hadron Collider, and can benefit many physics analyses.

研究动机与目标

开发一种稳健、可扩展的NSBI框架，用于LHC实验中的高维参数估计，避免因直方图分箱导致的信息损失。
以与标准LHC统计实践一致的方式，整合大量系统不确定性。
量化并验证基于有限蒙特卡洛样本大小的神经网络似然比估计中的不确定性。
在NSBI背景下，通过Neyman构造法实现可靠置信区间的构建。
通过四轻子末态中真实的非共振Higgs玻色子耦合测量，展示该方法的可行性与灵敏度提升。

提出的方法

利用神经网络估计信号与背景假设之间的概率密度比，实现在无需分箱情况下的似然比估计。
采用基于集成的训练方法，提升估计的鲁棒性并量化密度比估计中的不确定性。
通过轮廓对数似然方法，将代表系统不确定性的多余参数直接整合到似然比函数中。
应用重加权与校准诊断，验证估计似然比的可靠性。
使用伪实验与Neyman构造法，在重加权可能引入负权重的情况下，生成具有有效覆盖性的置信区间。
整合物理贡献的解析因子分解至完整似然函数，实现高效训练与不确定性传播。

实验结果

研究问题

RQ1神经模拟推断能否有效扩展至包含高维、无分箱数据与大量系统不确定性的完整LHC分析？
RQ2在NSBI模型中，如何量化并传播有限蒙特卡洛样本大小带来的不确定性？
RQ3当重加权引入负权重时，是否仍能通过Neyman构造法在NSBI中构建可靠的置信区间？
RQ4在非线性参数依赖关系下，NSBI方法相较于传统直方图方法在灵敏度方面表现如何？
RQ5在高能物理背景下，验证NSBI模型鲁棒性与可靠性的必要且充分的诊断工具是什么？

主要发现

在非共振Higgs玻色子耦合测量中，NSBI框架相较于直方图方法实现了显著的灵敏度提升，尤其在参数依赖关系呈现非线性特征的区域。
该方法成功整合了超过100项系统不确定性，证明了其在真实LHC分析复杂度下的可扩展性。
诊断检查（如重加权闭合性、校准闭合性与集成分布）证实了估计似然比的鲁棒性与可靠性。
有限蒙特卡洛样本带来的不确定性得到了量化，且控制良好，在测试场景中对最终区间覆盖性的影响低于1%。
通过伪实验进行的Neyman构造生成的置信区间具备正确的覆盖特性，验证了该方法的统计有效性。
该框架实现了在高维、无分箱数据空间中的完整参数估计，避免了单变量直方图方法固有的灵敏度损失。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。