QUICK REVIEW

[论文解读] Jet Flavour Tagging at FCC-ee with a Transformer-based Neural Network: DeepJetTransformer

F. Blekman, F. Canelli|arXiv (Cornell University)|Jan 1, 2024

Meteorological Phenomena and Simulations被引用 1

一句话总结

本文提出 DeepJetTransformer，一种基于 Transformer 的神经网络，用于 FCC-ee 的喷注味鉴别，通过使用粒子流对象、次级顶点和 V0 顶点，以及 K±/π± 分离，实现了对 b-、c- 和 s- 喷注的高性能识别。在 √s = 91.2 GeV 下，仅需 60 nb⁻¹ 的积分亮度，即可实现对 Z → s¯s 的 5σ 发现，证明了未来轻子对撞机上精确奇异夸克物理的可行性。

ABSTRACT

Jet flavour tagging is crucial in experimental high-energy physics. A tagging algorithm, DeepJetTransformer, is presented, which exploits a transformer-based neural network that is substantially faster to train than state-of-the-art graph neural networks. The DeepJetTransformer algorithm uses information from particle flow-style objects and secondary vertex reconstruction for $b$- and $c$-jet identification, supplemented by additional information that is not always included in tagging algorithms at the LHC, such as reconstructed $K_{S}^{0}$ and $Λ^{0}$ and $K^{\pm}/π^{\pm}$ discrimination. The model is trained as a multiclassifier to identify all quark flavours separately and performs excellently in identifying $b$- and $c$-jets. An $s$-tagging efficiency of $40\%$ can be achieved with a $10\%$ $ud$-jet background efficiency. The performance improvement achieved by including $K_{S}^{0}$ and $Λ^{0}$ reconstruction and $K^{\pm}/π^{\pm}$ discrimination is presented. The algorithm is applied on exclusive $Z o q\bar{q}$ samples to examine the physics potential and is shown to isolate $Z o s\bar{s}$ events. Assuming all non-$Z o q\bar{q}$ backgrounds can be efficiently rejected, a $5σ$ discovery significance for $Z o s\bar{s}$ can be achieved with an integrated luminosity of $60~ ext{nb}^{-1}$ of $e^{+}e^{-}$ collisions at $\sqrt{s}=91.2~\mathrm{GeV}$, corresponding to less than a second of the FCC-ee run plan at the $Z$ boson resonance.

研究动机与目标

开发一种适用于 FCC-ee 的 Z 玻色子共振环境的快速、精确的喷注味鉴别算法。
提升对奇异（s）夸克喷注的识别能力，因其低多重性和短寿命衰变产物而难以鉴别。
评估包含稀有但信息丰富的物理对象（如 K⁰ₛ、Λ⁰ 和 K±/π± 分离）对鉴别性能的影响。
通过使用真实探测器模拟，展示 DeepJetTransformer 的物理潜力，以高显著性隔离 Z → s¯s 衰变。
建立一种可扩展、可泛化的鉴别框架，适用于 FCC-ee 之外的未来轻子对撞机。

提出的方法

采用基于 Transformer 的神经网络架构，利用缩放点积注意力机制，建模喷注组分之间的复杂非局部依赖关系。
处理输入特征，包括粒子流对象、重建的次级顶点（SVs）、V0 顶点（K⁰ₛ、Λ⁰），以及 K±/π± 分离的粒子识别（PID）。
在重味 Transformer 模块中采用多头自注意力机制，动态加权喷注组分的重要性。
将模型训练为多类别分类器，使用蒙特卡洛模拟事件区分 b、c、s、u、d 和胶子喷注。
通过架构设计优化训练效率，实现比当前最先进的图神经网络更快的收敛速度。
将模型应用于独态 Z → q̄q 样本，以在真实探测器条件下评估其在隔离 Z → s̄s 衰变中的性能。

实验结果

研究问题

RQ1基于 Transformer 的模型是否能在训练速度更快的同时，优于现有的图神经网络（GNN）和深度神经网络（DNN）方法，在 FCC-ee 的喷注味鉴别中实现相当或更优的准确率？
RQ2与标准输入相比，包含 K⁰ₛ、Λ⁰ 和 K±/π± 分离是否能显著提升 s-喷注的识别性能？
RQ3在真实背景抑制条件下，实现 Z → s̄s 的 5σ 发现所需的最小积分亮度是多少？
RQ4鉴别器的性能在多大程度上依赖于顶点重建质量和粒子识别质量？
RQ5事件级鉴别策略（如要求对向半球具有高动量的奇异强子）在多大程度上能增强 s-夸克喷注的区分能力？

主要发现

DeepJetTransformer 在 u/d-喷注背景效率为 10% 的条件下，实现了 40% 的 s-喷注鉴别效率，表现出强大的区分能力。
引入 K⁰ₛ 和 Λ⁰ 重建后，在固定背景率下，s-喷注鉴别性能提升了 15–20% 的信号效率。
K±/π± 分离对 s-夸克识别具有显著贡献，尤其在高动量区域，此时奇异强子占主导。
在 √s = 91.2 GeV 下，仅需 60 nb⁻¹ 的积分亮度，DeepJetTransformer 即可实现对 Z → s̄s 的 5σ 发现，前提是所有非 Z → q̄q 背景均被有效抑制。
该模型在区分 b- 和 c- 喷注方面表现优异，AUC 值均超过 0.98，即使在复杂的喷注环境中亦然。
DeepJetTransformer 的训练时间显著短于当前最先进的图神经网络，使其成为探测器研发中快速原型设计的理想选择。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。