CMS-PFT-25-001

CMS-PFT-25-001 ; CERN-EP-2025-300
Full event interpretation with machine-learning-based particle-flow reconstruction in the CMS detector
CMS Collaboration
24 January 2026
Submitted to Eur. Phys. J. C
Abstract: The particle-flow (PF) algorithm constructs a global description of each particle collision by producing a comprehensive list of final-state particles, and is central to event reconstruction in the CMS experiment at the CERN LHC. The existing PF implementation relies on physics-motivated heuristics and assumptions that can be replaced by machine-learning (ML) models trained directly on simulated data and naturally suited to modern graphics processing units (GPUs). A state-of-the-art ML-based PF (MLPF) reconstruction algorithm, implemented within the CMS software framework, is presented. The MLPF algorithm performs a learnable full-event reconstruction on GPUs, generalizes across detector conditions and collision energies, and replaces multiple modular reconstruction steps with a single unified model. Physics performance comparable to standard PF reconstruction is achieved in both simulation and data, with improved jet energy resolution and inference time. In simulated top quark-antiquark events under LHC Run-3 (2023-2024) conditions, the jet energy resolution improves by 10-20% for jets with transverse momentum between 30-100 GeV. Inference time is evaluated using simulated multijet events, with a median of 20 ms per event on an Nvidia L4 GPU, compared to approximately 110 ms for the standard CMS PF reconstruction.
Links: e-print arXiv:2601.17554 [hep-ex] (PDF) ; CDS record ; inSPIRE record ; CADI line (restricted) ;

Figures & Tables	Summary	References	CMS Publications

Figures
png pdf	Figure 1: Validation of the MLPF particle-level target, after removing the contributions from the pileup particles, using $ \mathrm{t} \overline{\mathrm{t}} $ simulation with pileup. The $ p_{\mathrm{T}} $ distribution of target particles and truth particles derived from PYTHIA stable particles (upper left). The $ p_{\mathrm{T}} $ distribution of jets clustered from each particle set (upper right). The jet response relative to particle-level jets (lower left). The target $ p_{\mathrm{T}}^\text{miss} $ as a function of the PYTHIA truth $ p_{\mathrm{T}}^\text{miss} $ (lower right). The target particle set, after removing the contribution from pileup, generally aligns well with the pileup-free PYTHIA truth.
png pdf	Figure 1-a: Validation of the MLPF particle-level target, after removing the contributions from the pileup particles, using $ \mathrm{t} \overline{\mathrm{t}} $ simulation with pileup. The $ p_{\mathrm{T}} $ distribution of target particles and truth particles derived from PYTHIA stable particles (upper left). The $ p_{\mathrm{T}} $ distribution of jets clustered from each particle set (upper right). The jet response relative to particle-level jets (lower left). The target $ p_{\mathrm{T}}^\text{miss} $ as a function of the PYTHIA truth $ p_{\mathrm{T}}^\text{miss} $ (lower right). The target particle set, after removing the contribution from pileup, generally aligns well with the pileup-free PYTHIA truth.
png pdf	Figure 1-b: Validation of the MLPF particle-level target, after removing the contributions from the pileup particles, using $ \mathrm{t} \overline{\mathrm{t}} $ simulation with pileup. The $ p_{\mathrm{T}} $ distribution of target particles and truth particles derived from PYTHIA stable particles (upper left). The $ p_{\mathrm{T}} $ distribution of jets clustered from each particle set (upper right). The jet response relative to particle-level jets (lower left). The target $ p_{\mathrm{T}}^\text{miss} $ as a function of the PYTHIA truth $ p_{\mathrm{T}}^\text{miss} $ (lower right). The target particle set, after removing the contribution from pileup, generally aligns well with the pileup-free PYTHIA truth.
png pdf	Figure 1-c: Validation of the MLPF particle-level target, after removing the contributions from the pileup particles, using $ \mathrm{t} \overline{\mathrm{t}} $ simulation with pileup. The $ p_{\mathrm{T}} $ distribution of target particles and truth particles derived from PYTHIA stable particles (upper left). The $ p_{\mathrm{T}} $ distribution of jets clustered from each particle set (upper right). The jet response relative to particle-level jets (lower left). The target $ p_{\mathrm{T}}^\text{miss} $ as a function of the PYTHIA truth $ p_{\mathrm{T}}^\text{miss} $ (lower right). The target particle set, after removing the contribution from pileup, generally aligns well with the pileup-free PYTHIA truth.
png pdf	Figure 1-d: Validation of the MLPF particle-level target, after removing the contributions from the pileup particles, using $ \mathrm{t} \overline{\mathrm{t}} $ simulation with pileup. The $ p_{\mathrm{T}} $ distribution of target particles and truth particles derived from PYTHIA stable particles (upper left). The $ p_{\mathrm{T}} $ distribution of jets clustered from each particle set (upper right). The jet response relative to particle-level jets (lower left). The target $ p_{\mathrm{T}}^\text{miss} $ as a function of the PYTHIA truth $ p_{\mathrm{T}}^\text{miss} $ (lower right). The target particle set, after removing the contribution from pileup, generally aligns well with the pileup-free PYTHIA truth.
png pdf	Figure 2: The neural network structure of the model. Input and output features are visualized in purple. Pointwise FFNs are visualized in green, and layers with full-event correlations in red. The last dimension contains a binary output to predict the presence of a particle (Valid), multi-class outputs for the PID, and the predicted momentum values.
png pdf	Figure 2-a: The neural network structure of the model. Input and output features are visualized in purple. Pointwise FFNs are visualized in green, and layers with full-event correlations in red. The last dimension contains a binary output to predict the presence of a particle (Valid), multi-class outputs for the PID, and the predicted momentum values.
png pdf	Figure 2-b: The neural network structure of the model. Input and output features are visualized in purple. Pointwise FFNs are visualized in green, and layers with full-event correlations in red. The last dimension contains a binary output to predict the presence of a particle (Valid), multi-class outputs for the PID, and the predicted momentum values.
png pdf	Figure 3: The PID, binary classification, and regression losses for the training and validation data sets, as a function of the training epoch. We observe both the training and validation losses to be smoothly converging. The training loss is evaluated with a partially-trained model throughout the training epoch on the training events (80%), while the validation loss is computed with the model weights at the end of the epoch on the validation events (20%). The losses are measured in arbitrary units, averaged across the events.
png pdf	Figure 3-a: The PID, binary classification, and regression losses for the training and validation data sets, as a function of the training epoch. We observe both the training and validation losses to be smoothly converging. The training loss is evaluated with a partially-trained model throughout the training epoch on the training events (80%), while the validation loss is computed with the model weights at the end of the epoch on the validation events (20%). The losses are measured in arbitrary units, averaged across the events.
png pdf	Figure 3-b: The PID, binary classification, and regression losses for the training and validation data sets, as a function of the training epoch. We observe both the training and validation losses to be smoothly converging. The training loss is evaluated with a partially-trained model throughout the training epoch on the training events (80%), while the validation loss is computed with the model weights at the end of the epoch on the validation events (20%). The losses are measured in arbitrary units, averaged across the events.
png pdf	Figure 3-c: The PID, binary classification, and regression losses for the training and validation data sets, as a function of the training epoch. We observe both the training and validation losses to be smoothly converging. The training loss is evaluated with a partially-trained model throughout the training epoch on the training events (80%), while the validation loss is computed with the model weights at the end of the epoch on the validation events (20%). The losses are measured in arbitrary units, averaged across the events.
png pdf	Figure 4: The particle $ p_{\mathrm{T}} $ (left) and $ \eta $ (right) distributions split by PID for MC particles (Gen), and PF and MLPF candidates in $ \mathrm{t} \overline{\mathrm{t}} $ events. In the forward region, the PF and MLPF algorithms do not attempt to identify particles, but instead reconstruct the deposits as either electromagnetic or hadronic.
png pdf	Figure 4-a: The particle $ p_{\mathrm{T}} $ (left) and $ \eta $ (right) distributions split by PID for MC particles (Gen), and PF and MLPF candidates in $ \mathrm{t} \overline{\mathrm{t}} $ events. In the forward region, the PF and MLPF algorithms do not attempt to identify particles, but instead reconstruct the deposits as either electromagnetic or hadronic.
png pdf	Figure 4-b: The particle $ p_{\mathrm{T}} $ (left) and $ \eta $ (right) distributions split by PID for MC particles (Gen), and PF and MLPF candidates in $ \mathrm{t} \overline{\mathrm{t}} $ events. In the forward region, the PF and MLPF algorithms do not attempt to identify particles, but instead reconstruct the deposits as either electromagnetic or hadronic.
png pdf	Figure 5: Neutral hadron efficiency (left) and misidentification rate (right) as functions of $ p_{\mathrm{T}} $ for PF (blue) and MLPF (red) in $ \mathrm{t} \overline{\mathrm{t}} $ events based on $ \Delta R $ matching to generator-level particles. The MLPF algorithm generally has higher efficiency at a lower misidentification rate. The vertical bars indicate the statistical uncertainties on the respective distributions.
png pdf	Figure 5-a: Neutral hadron efficiency (left) and misidentification rate (right) as functions of $ p_{\mathrm{T}} $ for PF (blue) and MLPF (red) in $ \mathrm{t} \overline{\mathrm{t}} $ events based on $ \Delta R $ matching to generator-level particles. The MLPF algorithm generally has higher efficiency at a lower misidentification rate. The vertical bars indicate the statistical uncertainties on the respective distributions.
png pdf	Figure 5-b: Neutral hadron efficiency (left) and misidentification rate (right) as functions of $ p_{\mathrm{T}} $ for PF (blue) and MLPF (red) in $ \mathrm{t} \overline{\mathrm{t}} $ events based on $ \Delta R $ matching to generator-level particles. The MLPF algorithm generally has higher efficiency at a lower misidentification rate. The vertical bars indicate the statistical uncertainties on the respective distributions.
png pdf	Figure 6: The corrected jet $ p_{\mathrm{T}} $ distribution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions. MLPF is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. We show the jet $ p_{\mathrm{T}} $ distribution after simulated response calibration. The shaded bands indicate the statistical uncertainties on Gen.
png pdf	Figure 6-a: The corrected jet $ p_{\mathrm{T}} $ distribution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions. MLPF is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. We show the jet $ p_{\mathrm{T}} $ distribution after simulated response calibration. The shaded bands indicate the statistical uncertainties on Gen.
png pdf	Figure 6-b: The corrected jet $ p_{\mathrm{T}} $ distribution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions. MLPF is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. We show the jet $ p_{\mathrm{T}} $ distribution after simulated response calibration. The shaded bands indicate the statistical uncertainties on Gen.
png pdf	Figure 6-c: The corrected jet $ p_{\mathrm{T}} $ distribution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions. MLPF is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. We show the jet $ p_{\mathrm{T}} $ distribution after simulated response calibration. The shaded bands indicate the statistical uncertainties on Gen.
png pdf	Figure 6-d: The corrected jet $ p_{\mathrm{T}} $ distribution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions. MLPF is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. We show the jet $ p_{\mathrm{T}} $ distribution after simulated response calibration. The shaded bands indicate the statistical uncertainties on Gen.
png pdf	Figure 7: The jet resolution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions, after simulated response calibration. The MLPF algorithm is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. The jet resolution is parameterized by fitting a Gaussian distribution and dividing the best-fit standard deviation by the mean, showing the bin midpoints on the horizontal axis.
png pdf	Figure 7-a: The jet resolution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions, after simulated response calibration. The MLPF algorithm is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. The jet resolution is parameterized by fitting a Gaussian distribution and dividing the best-fit standard deviation by the mean, showing the bin midpoints on the horizontal axis.
png pdf	Figure 7-b: The jet resolution in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) validation samples with 55--75 pileup interactions, after simulated response calibration. The MLPF algorithm is only trained to reconstruct particles, and is never explicitly optimized to reconstruct jets. The PUPPI algorithm is applied to both PF and MLPF jets to perform pileup subtraction. The jet resolution is parameterized by fitting a Gaussian distribution and dividing the best-fit standard deviation by the mean, showing the bin midpoints on the horizontal axis.
png pdf	Figure 8: Uncalibrated $ p_{\mathrm{T}}^\text{miss} $ in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) samples with pileup from offline reconstruction using PF and MLPF. The MLPF algorithm is only trained to reconstruct particles, and is never explicitly optimized to reconstruct $ p_{\mathrm{T}}^\text{miss} $. The $ p_{\mathrm{T}}^\text{miss} $ distributions from PF and MLPF are found to be generally consistent, with MLPF reconstructing a somewhat harder $ p_{\mathrm{T}}^\text{miss} $ spectrum in the tails. The vertical bars indicate the statistical uncertainties on the respective distributions, and the shaded bands in the ratio panel indicate the statistical uncertainties on PF.
png pdf	Figure 8-a: Uncalibrated $ p_{\mathrm{T}}^\text{miss} $ in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) samples with pileup from offline reconstruction using PF and MLPF. The MLPF algorithm is only trained to reconstruct particles, and is never explicitly optimized to reconstruct $ p_{\mathrm{T}}^\text{miss} $. The $ p_{\mathrm{T}}^\text{miss} $ distributions from PF and MLPF are found to be generally consistent, with MLPF reconstructing a somewhat harder $ p_{\mathrm{T}}^\text{miss} $ spectrum in the tails. The vertical bars indicate the statistical uncertainties on the respective distributions, and the shaded bands in the ratio panel indicate the statistical uncertainties on PF.
png pdf	Figure 8-b: Uncalibrated $ p_{\mathrm{T}}^\text{miss} $ in $ \mathrm{t} \overline{\mathrm{t}} $ (left) and QCD multijet (right) samples with pileup from offline reconstruction using PF and MLPF. The MLPF algorithm is only trained to reconstruct particles, and is never explicitly optimized to reconstruct $ p_{\mathrm{T}}^\text{miss} $. The $ p_{\mathrm{T}}^\text{miss} $ distributions from PF and MLPF are found to be generally consistent, with MLPF reconstructing a somewhat harder $ p_{\mathrm{T}}^\text{miss} $ spectrum in the tails. The vertical bars indicate the statistical uncertainties on the respective distributions, and the shaded bands in the ratio panel indicate the statistical uncertainties on PF.
png pdf	Figure 9: Distributions of the leading jet $ p_{\text{T,corr}} $ (left), the dijet asymmetry (center), and $ p_{\mathrm{T}}^\text{miss} $ (right) in a small sample of dijet data collected in Run 3. The distributions are largely compatible. The vertical bars indicate the statistical uncertainties on the respective distributions, and the shaded bands in the ratio panel indicate the statistical uncertainties on PF.
png pdf	Figure 9-a: Distributions of the leading jet $ p_{\text{T,corr}} $ (left), the dijet asymmetry (center), and $ p_{\mathrm{T}}^\text{miss} $ (right) in a small sample of dijet data collected in Run 3. The distributions are largely compatible. The vertical bars indicate the statistical uncertainties on the respective distributions, and the shaded bands in the ratio panel indicate the statistical uncertainties on PF.
png pdf	Figure 9-b: Distributions of the leading jet $ p_{\text{T,corr}} $ (left), the dijet asymmetry (center), and $ p_{\mathrm{T}}^\text{miss} $ (right) in a small sample of dijet data collected in Run 3. The distributions are largely compatible. The vertical bars indicate the statistical uncertainties on the respective distributions, and the shaded bands in the ratio panel indicate the statistical uncertainties on PF.
png pdf	Figure 9-c: Distributions of the leading jet $ p_{\text{T,corr}} $ (left), the dijet asymmetry (center), and $ p_{\mathrm{T}}^\text{miss} $ (right) in a small sample of dijet data collected in Run 3. The distributions are largely compatible. The vertical bars indicate the statistical uncertainties on the respective distributions, and the shaded bands in the ratio panel indicate the statistical uncertainties on PF.
png pdf	Figure 10: The runtime of the baseline particle-flow reconstruction compared to the MLPF inference, directly within the CMS offline software using the native ONNXRUNTIME on GPU.
png pdf	Figure 10-a: The runtime of the baseline particle-flow reconstruction compared to the MLPF inference, directly within the CMS offline software using the native ONNXRUNTIME on GPU.
png pdf	Figure 10-b: The runtime of the baseline particle-flow reconstruction compared to the MLPF inference, directly within the CMS offline software using the native ONNXRUNTIME on GPU.
png pdf	Figure 10-c: The runtime of the baseline particle-flow reconstruction compared to the MLPF inference, directly within the CMS offline software using the native ONNXRUNTIME on GPU.
png pdf	Figure 10-d: The runtime of the baseline particle-flow reconstruction compared to the MLPF inference, directly within the CMS offline software using the native ONNXRUNTIME on GPU.

Tables
png pdf	Table 1: Monte Carlo simulation samples used for optimizing, validating, and testing the model. The training and validation samples are divided using an 80%/20% split, respectively. The symbols $ \text{---} $ in the 3rd column represent samples generated without pileup.

Summary

A machine-learning (ML)-based algorithm for particle-flow (PF) reconstruction in the CMS experiment has been presented. The algorithm correlates tracks and calorimeter clusters using successive transformer layers, and outputs a list of reconstructed particles with four-momenta and particle identification values. The algorithm is functionally equivalent to the standard PF algorithm currently used in CMS. The model is trained on CMS Run-3 (2023--2024) full simulation data sets that include realistic pileup conditions, and its performance is evaluated in simulated top quark-antiquark ( $ \mathrm{t} \overline{\mathrm{t}} $) and quantum chromodynamics multijet testing samples across different experimental conditions, as well as on data collected using a dijet trigger. To ensure computational efficiency, the model avoids quadratic scaling in runtime and memory by efficiently employing modern graphics processing units (GPUs). The ML-based PF (MLPF) algorithm achieves physics performance comparable to the standard PF algorithm, while enabling GPU acceleration for full event reconstruction within the CMS offline software framework. Specifically, we observe improved neutral hadron efficiency with MLPF while maintaining the same misidentification rate as standard PF. For jets with $ p_{\mathrm{T}} $ between 30--100 GeV in $ \mathrm{t} \overline{\mathrm{t}} $ events, we observe a 10--20% improvement in jet energy resolution compared to standard PF. The algorithm delivers a per-event runtime of approximately 20\unitms on an Nvidia L4 inference GPU, with 64 inference streams running in parallel per GPU. We also observe improved runtime scaling with event size compared to the standard PF algorithm, highlighting the computational robustness of the approach. While the MLPF algorithm has been successfully benchmarked under Run-3 conditions, its primary motivation lies in deployment at the high-luminosity upgrade of the LHC (HL-LHC), where the increased event complexity and pileup conditions demand both high physics performance and computational scalability. To deploy MLPF in production at the HL-LHC, several developments are foreseen. First, the model must be retrained for the upgraded detector and higher pileup conditions. Second, ensuring the portability of the inference across different GPU generations and hardware vendors is essential. Third, the model inference can be further optimized through techniques such as sparse or block attention, quantization, pruning, and asynchronous or batched execution. Finally, the physics performance and robustness can be improved through additional training and extensive cross-checks on a wide variety of simulated and data samples.

References
1	CMS Collaboration	Particle-flow reconstruction and global event description with the CMS detector	JINST 12 (2017) P10003	CMS-PRF-14-001 1706.04965
2	ALEPH Collaboration	Performance of the ALEPH detector at LEP	NIM A 360 (1995) 481
3	ATLAS Collaboration	Jet reconstruction and performance using particle flow with the ATLAS detector	EPJC 77 (2017) 466	1703.10485
4	A. Bocci, S. Lami, S. Kuhlmann, and G. Latino	Study of jet energy resolution at CDF	Int. J. Mod. Phys. A 16 (2001) 255
5	CDF Collaboration	A search for supersymmetric Higgs bosons in the di-tau decay mode in $ \mathrm{p}\overline{\mathrm{p}} $ collisions at 1.8 TeV	PRD 72 (2005) 112008	hep-ex/0506042
6	CDF Collaboration	Measurement of $ \sigma(\mathrm{p}\overline{\mathrm{p}} \to \mathrm{Z})\mathcal{B}(\mathrm{Z} \to \tau\tau) $ in $ \mathrm{p}\overline{\mathrm{p}} $ collisions at $ \sqrt{s}= $ 1.96 TeV	PRD 75 (2007) 092004
7	CELLO Collaboration	An analysis of the charged and neutral energy flow in $ \mathrm{e}^+\mathrm{e}^- $ hadronic annihilation at 34 GeV, and a determination of the QCD effective coupling constant	PLB 113 (1982) 427
8	D0 Collaboration	Measurement of $ \sigma(\mathrm{p}\overline{\mathrm{p}} \to \mathrm{Z} + \mathrm{X})\mathcal{B}(\mathrm{Z} \to \tau^+ \tau^-) $ at $ \sqrt{s} = $ 1.96 TeV	PLB 670 (2009) 292	0808.1306
9	DELPHI Collaboration	Performance of the DELPHI detector	NIM A 378 (1996) 57
10	H1 Collaboration	Measurement of charged particle multiplicity distributions in DIS at HERA and its implication to entanglement entropy of partons	EPJC 81 (2021) 212	2011.01812
11	ZEUS Collaboration	Measurement of the diffractive structure function $ F_2^{D(4)} $ at HERA	EPJC 1 (1998) 81	hep-ex/9709021
12	ZEUS Collaboration	Measurement of the diffractive cross-section in deep inelastic scattering using ZEUS 1994 data	EPJC 6 (1999) 43	hep-ex/9807010
13	FCC Collaboration	FCC-ee: The lepton collider. Future Circular Collider conceptual design report volume 2	Eur. Phys. J. ST 228 (2019) 261
14	CMSnoop	The International Linear Collider technical design report - volume 1: executive summary	\hrefT. Behnke et al.,, 2013	1306.6327
15	CEPC Study Group	CEPC conceptual design report: volume 2 - physics \& detector		1811.10545
16	F. Mokhtar et al.	Fine-tuning machine-learned particle-flow reconstruction for new detector geometries in future colliders	PRD 111 (2025) 092015	2503.00131
17	F. A. Di Bello et al.	Towards a computer vision particle flow	EPJC 81 (2021) 107	2003.08863
18	J. Kieseler	Object condensation: one-stage grid-free multi-object reconstruction in physics detectors, graph and image data	EPJC 80 (2020) 886	2002.03605
19	J. Shlomi, P. Battaglia, and J.-R. Vlimant	Graph neural networks in particle physics	Mach. Learn. Sci. Tech. 2 (2021) 021001	2007.13681
20	J. Pata et al.	MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks	EPJC 81 (2021) 381	2101.08578
21	CMS Collaboration	GNN-based end-to-end reconstruction in the CMS Phase 2 High-Granularity Calorimeter	J. Phys. Conf. Ser. 2438 (2023) 012090	2203.01189
22	CMS Collaboration	Machine learning for particle flow reconstruction at CMS	J. Phys. Conf. Ser. 2438 (2023) 012100	2203.00330
23	F. Mokhtar et al.	Progress towards an improved particle flow algorithm at CMS with machine learning	in st Int. Workshop on Advanced Computing and Analysis Techniques in Physics Research, 2023 Proc. 2 (2023) 1	2303.17657
24	F. A. Di Bello et al.	Reconstructing particles in jets using set transformer and hypergraph prediction networks	EPJC 83 (2023) 596	2212.01328
25	J. Pata et al.	Improved particle-flow event reconstruction with scalable neural networks for current and future particle detectors	Commun. Phys. 7 (2024) 124	2309.06782
26	P. Wahlen and T. Suehara	Particle-flow reconstruction with transformer	Eur. Phys. J. Web Conf. 315 (2024) 03010
27	N. Kakati et al.	HGPflow: Extending hypergraph particle flow to collider event reconstruction	EPJC 85 (2025) 847	2410.23236
28	D. Kobylianskii et al.	GLOW: A unified particle flow transformer	in Proc. 8th th Conference on Neural Information Processing Systems, 2025 Machine Learning and the Physical Sciences Workshop at the 3 (2025) 9	2508.20092
29	CMS Collaboration	Portable acceleration of CMS computing workflows with coprocessors as a service	Comput. Softw. Big Sci. 8 (2024) 17	CMS-MLG-23-001 2402.15366
30	I. Bejar Alonso et al.	High-Luminosity Large Hadron Collider (HL-LHC): technical design report	CERN Yellow Rep. Monogr. 10 (2020)
31	T. Dorigo et al.	Toward the end-to-end optimization of particle physics instruments with differentiable programming	Rev. Phys. 10 (2023) 100085	2203.13818
32	M. Aehle et al.	Progress in end-to-end optimization of fundamental physics experimental apparata with differentiable programming	Rev. Phys. 13 (2025) 100120	2310.05673
33	CMS Collaboration	High-precision measurement of the W boson mass with the CMS experiment at the LHC	Accepted by \emphNature, 2024	CMS-SMP-23-002 2412.13872
34	H. Qu and L. Gouskos	Jet tagging via particle clouds	PRD 101 (2020) 056019	1902.08570
35	H. Qu, C. Li, and S. Qian	Particle transformer for jet tagging	in th Int. Conference on Machine Learning, volume 162, 2022 Proc. 3 (2022) 18281	2202.03772
36	E. A. Moreno et al.	JEDI-net: a jet identification algorithm based on interaction networks	EPJC 80 (2020) 58	1908.05318
37	E. A. Moreno et al.	Interaction networks for the identification of boosted $ \mathrm{H} \to \mathrm{b}\overline{\mathrm{b}} $ decays	PRD 102 (2020) 012010	1909.12285
38	V. Mikuni and F. Canelli	ABCNet: An attention-based method for particle tagging	Eur. Phys. J. Plus 135 (2020) 463	2001.05311
39	S. Farrell et al.	Novel deep learning methods for track reconstruction	in Proc. 4th Int. Workshop Connecting the Dots, 2018	1810.06111
40	X. Ju et al.	Graph neural networks for particle reconstruction in high energy physics detectors	in Proc. 2nd rd Conference on Neural Information Processing Systems, 2020 Machine Learning and the Physical Sciences Workshop at the 3 (2020) 3	2003.11603
41	S. Amrouche et al.	The tracking machine learning challenge: Accuracy phase	in The NeurIPS '18 Competition, 2020 link	1904.06778
42	S. Amrouche et al.	Similarity hashing for charged particle tracking	in, 2019 Proc. IEEE International Conference on Big Data 201 (2019) 1595
43	N. Choma et al.	Track seeding and labelling with embedded-space graph neural networks	in Proc. 6th Int. Workshop Connecting the Dots. 202 (1900) 0	2007.00149
44	G. DeZoort et al.	Charged particle tracking via edge-classifying interaction networks	Comput. Softw. Big Sci. 5 (2021) 26	2103.16701
45	S. Van Stroud et al.	Transformers for charged particle track reconstruction in high energy physics		2411.07149
46	S. Farrell et al.	The HEP.TrkX Project: Deep neural networks for HL-LHC online and offline tracking	in Eur. Phys. J. Web Conf., volume 150, 2017 link
47	X. Ju et al.	Performance of a geometric deep learning pipeline for HL-LHC particle tracking	EPJC 81 (2021) 876	2103.06995
48	S. Miao et al.	Locality-sensitive hashing-based efficient point transformer with applications in high-energy physics	in st Int. Conference on Machine Learning, volume 235, 2024 Proc. 4 (2024) 35546	2402.12535
49	J. Shlomi et al.	Secondary vertex finding in jets with neural networks	EPJC 81 (2021) 540	2008.02831
50	S. Van Stroud et al.	Secondary vertex reconstruction with MaskFormers	EPJC 84 (2024) 1020	2312.12272
51	X. Ju and B. Nachman	Supervised jet clustering with graph neural networks for Lorentz boosted bosons	PRD 102 (2020) 075014	2008.06064
52	J. Li, T. Li, and F.-Z. Xu	Reconstructing boosted Higgs jets from event image segmentation	JHEP 04 (2021) 156	2008.13529
53	J. Guo, J. Li, T. Li, and R. Zhang	Boosted Higgs boson jet reconstruction via a graph neural network	PRD 103 (2021) 116025	2010.05464
54	M. J. Fenton et al.	Permutationless many-jet event reconstruction with symmetry preserving attention networks	PRD 105 (2022) 112008	2010.09206
55	A. Shmakov et al.	SPANet: Generalized permutationless set assignment for particle physics using symmetry preserving attention	SciPost Phys. 12 (2022) 178	2106.03898
56	M. J. Fenton et al.	Reconstruction of unstable heavy particles using deep symmetry-preserving attention networks	Commun. Phys. 7 (2024) 139	2309.01886
57	H. Li et al.	Reconstruction of boosted and resolved multi-Higgs-boson events with symmetry-preserving attention networks	JHEP 11 (2025) 119	2412.03819
58	S. R. Qasim, J. Kieseler, Y. Iiyama, and M. Pierini	Learning representations of irregular particle-detector geometry with distance-weighted graph networks	EPJC 79 (2019) 608	1902.07987
59	P. T. Komiske, E. M. Metodiev, B. Nachman, and M. D. Schwartz	Pileup mitigation with machine learning (PUMML)	JHEP 12 (2017) 051	1707.08600
60	J. Arjona Mart \'\i nez et al.	Pileup mitigation at the Large Hadron Collider with graph neural networks	Eur. Phys. J. Plus 134 (2019) 333	1810.07988
61	T. Li et al.	Semi-supervised graph neural networks for pileup noise removal	EPJC 83 (2023) 99	2203.15823
62	CMS Collaboration	The phase-2 upgrade of the CMS tracker	CMS Technical Design Report CERN-LHCC-2017-009, CMS-TDR-014, 2017 link
63	CMS Collaboration	A MIP timing detector for the CMS phase-2 upgrade	CMS Technical Design Report CERN-LHCC-2019-003, CMS-TDR-020, 2019 CDS
64	CMS Collaboration	The phase-2 upgrade of the CMS level-1 trigger	CMS Technical Design Report CERN-LHCC-2020-004, CMS-TDR-021, 2020 CDS
65	CMS Collaboration	The phase-2 upgrade of the CMS data acquisition and high level trigger	CMS Technical Design Report CERN-LHCC-2021-007, CMS-TDR-022, 2020 CDS
66	CMS Collaboration	The phase-2 upgrade of the CMS beam radiation instrumentation and luminosity detectors	CMS Technical Design Report CERN-LHCC-2021-008, CMS-TDR-023, 2021 CDS
67	CMS Collaboration	The phase-2 upgrade of the CMS barrel calorimeters	CMS Technical Design Report CERN-LHCC-2017-011, CMS-TDR-015, 2017 CDS
68	CMS Collaboration	The phase-2 upgrade of the CMS endcap calorimeter	CMS Technical Design Report CERN-LHCC-2017-023, CMS-TDR-019, 2017 CDS
69	CMS Collaboration	The phase-2 upgrade of the CMS muon detectors	CMS Technical Design Report CERN-LHCC-2017-012, CMS-TDR-016, 2017 CDS
70	CMS Collaboration	The CMS experiment at the CERN LHC	JINST 3 (2008) S08004
71	CMS Collaboration	Development of the CMS detector for the CERN LHC Run 3	JINST 19 (2024) P05064	CMS-PRF-21-001 2309.05466
72	CMS Collaboration	Performance of the CMS Level-1 trigger in proton-proton collisions at $ \sqrt{s} = $ 13 TeV	JINST 15 (2020) P10017	CMS-TRG-17-001 2006.10165
73	CMS Collaboration	The CMS trigger system	JINST 12 (2017) P01020	CMS-TRG-12-001 1609.02366
74	CMS Collaboration	Performance of the CMS high-level trigger during LHC Run 2	JINST 19 (2024) P11021	CMS-TRG-19-001 2410.17038
75	CMS Collaboration	Electron and photon reconstruction and identification with the CMS experiment at the CERN LHC	JINST 16 (2021) P05014	CMS-EGM-17-001 2012.06888
76	CMS Collaboration	Performance of the CMS muon detector and muon reconstruction with proton-proton collisions at $ \sqrt{s}= $ 13 TeV	JINST 13 (2018) P06015	CMS-MUO-16-001 1804.04528
77	CMS Collaboration	Description and performance of track and primary-vertex reconstruction with the CMS tracker	JINST 9 (2014) P10009	CMS-TRK-11-001 1405.6569
78	P. Billoir and S. Qian	Simultaneous pattern recognition and track fitting by the kalman filtering method	NIM A 294 (1990) 219
79	CMS Collaboration	Heterogeneous reconstruction of hadronic particle flow clusters with the Alpaka portability library	Eur. Phys. J. Web Conf. 337 (2025) 01171
80	M. Cacciari, G. P. Salam, and G. Soyez	The anti-$ k_{\mathrm{T}} $ jet clustering algorithm	JHEP 04 (2008) 063	0802.1189
81	M. Cacciari, G. P. Salam, and G. Soyez	FastJet	EPJC 72 (2012) 1896	1111.6097
82	CMS Collaboration	Jet energy scale and resolution in the CMS experiment in pp collisions at 8 TeV	JINST 12 (2017) P02014	CMS-JME-13-004 1607.03663
83	CMS Collaboration	Pileup mitigation at CMS in 13 TeV data	JINST 15 (2020) P09018	CMS-JME-18-001 2003.00503
84	D. Bertolini, P. Harris, M. Low, and N. Tran	Pileup per particle identification	JHEP 10 (2014) 059	1407.6013
85	CMS Collaboration	Technical proposal for the Phase-II upgrade of the Compact Muon Solenoid	CMS Technical Proposal CERN-LHCC-2015-010, CMS-TDR-15-02, 2015 CDS
86	CMS Collaboration	Performance of missing transverse momentum reconstruction in proton-proton collisions at $ \sqrt{s} = $ 13 TeV using the CMS detector	JINST 14 (2019) P07004	CMS-JME-17-001 1903.06078
87	C. Bierlich et al.	A comprehensive guide to the physics and usage of PYTHIA8.3	SciPost Phys. Codeb, 2022 link	2203.11601
88	CMS Collaboration	Extraction and validation of a new set of CMS PYTHIA8 tunes from underlying-event measurements	EPJC 80 (2020) 4	CMS-GEN-17-001 1903.12179
89	GEANT4 Collaboration	GEANT 4---a simulation toolkit	NIM A 506 (2003) 250
90	GEANT4 Collaboration	GEANT 4 developments and applications	IEEE Trans. Nucl. Sci. 53 (2006) 270
91	GEANT4 Collaboration	Recent developments in GEANT 4	NIM A 835 (2016) 186
92	A. Vaswani et al.	Attention is all you need	in Advances in Neural Information Processing Systems, I. Guyon et al., eds., volume 30, Curran Associates, Inc, 2017 link	1706.03762
93	T.-Y. Lin et al.	Focal loss for dense object detection	IEEE Trans. Pattern Anal. Mach. Intell. 42 (2020) 318	1708.02002
94	D. Holmberg	Jet energy corrections with graph neural network regression	PhD thesis, Master's thesis, University of Helsinki, 2022 link
95	T. Dao et al.	FLASHATTENTION: Fast and memory-efficient exact attention with IO-awareness	in Advances in Neural Information Processing Systems, S. Koyejo et al., eds., volume 35, Curran Associates, Inc, 2022 link	2205.14135
96	R. Xiong et al.	On layer normalization in the transformer architecture	in Int. Conference on Machine Learning, H. Daumé III and A. Singh, eds., volume 119, PMLR, 1052 Xiong in Proc. 3 (1052) 4	2002.04745
97	J. Pata et al.	jpata/particleflow: v2.5.0	link
98	A. Paszke et al.	PYTORCH: An imperative style, high-performance deep learning library	in Advances in Neural Information Processing Systems, H. Wallach et al., eds., volume 32, Curran Associates, Inc, 2019 link	1912.01703
99	CMS Collaboration	A unified approach for jet tagging in Run 3 at $ \sqrt{s}= $ 13.6 TeV in CMS	CMS Detector Performance Note CMS-DP-2024-066, 2024 CDS
100	CMS Collaboration	Flavour tagging performance of the updated unified particle transformer algorithm with the CMS experiment at $ \sqrt{s}= $ 13.6 TeV	CMS Detector Performance Note CMS-DP-2025-081, 2025 CDS
101	CMS Collaboration	Jet energy scale and resolution of jets with ParticleNet $ p_{\mathrm{T}} $ regression using Run 3 data collected by the CMS experiment in 2022 and 2023 at 13.6 TeV	CMS Detector Performance Note CMS-DP-2024-064, 2024 CDS
102	CMS Collaboration	DeepMET: Improving missing transverse momentum estimation with a deep neural network	Submitted to Phys. Rev. D, 2025	CMS-JME-24-001 2509.12012
103	CMS Collaboration	Jet algorithms performance in 13 TeV data	CMS Physics Analysis Summary, 2017 CMS-PAS-JME-16-003	CMS-PAS-JME-16-003
104	CMS Collaboration	Open neural network exchange ( ONNX )	\href{ONNX}\url{https://github.com/onnx/onnx. Accessed: -11-02, 2025}