CMS logoCMS event Hgg
Compact Muon Solenoid
LHC, CERN

CMS-PAS-TAU-24-001
Identification of tau leptons using a convolutional neural network with domain adaptation in the CMS experiment
Abstract: The DeepTau identification algorithm, based on deep neural network techniques, has been developed to reduce the fraction of jets, muons, and electrons misidentified as hadronically decaying tau leptons ($ \tau_\mathrm{h} $) in the CMS experiment. The latest version of this algorithm includes domain adaptation by backpropagation, a technique that reduces data-to-simulation discrepancies in the region with the highest purity of genuine $ \tau_\mathrm{h} $ candidates. Additionally, a refined training workflow improves classification performance, with a reduction of 30-50% in the probability for jets to be misidentified as a $ \tau_\mathrm{h} $ for a given reconstruction and identification efficiency. This note presents the main novelties introduced to the DeepTau algorithm and evaluates its performance in LHC proton-proton collision data at $ \sqrt{s}= $ 13 and 13.6 TeV collected in 2018 and 2022, respectively, with integrated luminosities of 60 and 35 fb$ ^{-1} $. The techniques to determine data-to-simulation scale factors are presented with a subset of results among the ones deployed centrally for CMS physics analyses.
Figures & Tables Summary References CMS Publications
Figures

png pdf
Figure 1:
Schematic illustration of the signatures of the $ \mathrm{h}^{\pm} $, $ \mathrm{h}^{\pm}\pi^{0} $, $ \mathrm{h}^{\pm}\mathrm{h}^{\mp}\mathrm{h}^{\pm} $, and $ \mathrm{h}^{\pm}\mathrm{h}^{\mp}\mathrm{h}^{\pm}\pi^{0} $ decay modes of the tau lepton in the CMS detector. Charged hadrons are reconstructed by the PF algorithm by matching tracks with energy deposits in the ECAL and HCAL, while the HPS algorithm aims to reconstruct each $ \pi^{0}\to\gamma\gamma $ decay as a single ``strips'' of energy clusters in ECAL.

png pdf
Figure 2:
Inner and outer grid layout in $ \eta $-$ \phi $ space [21]. The inner grid encapsulates the signal cone of maximal radius 0.1, which contains the $ \mathrm{h}^{\pm} $ and $ \pi^{0} $ candidates, and consists of 11 $ \times $ 11 cells with a size of 0.02 $ \times $ 0.02 each. The outer grid contains the isolation cone of radius 0.5, and consists of 21 $ \times $ 21 cells with a size of 0.05 $ \times $ 0.05 each.

png pdf
Figure 3:
The DEEPTAU architecture with the domain adaptation configuration [66]. A set of final domain adaptation layers was introduced for data-simulation discrimination, consisting of several dense layers followed by a softmax layer that yields an output $ y_\text{adv} $ between zero and one. The backpropagation is modified to include the ``adversarial loss'', as described in the text.

png pdf
Figure 4:
Distribution of the DEEPTAU discriminator against jets before (left) and after (right) domain adaptation, for the 2018 dataset used for domain adaptation training. There is significant improvement in data-simulation agreement in the signal region, with the discrepancies in the final bin reduced to 0.9%.

png pdf
Figure 4-a:
Distribution of the DEEPTAU discriminator against jets before (left) and after (right) domain adaptation, for the 2018 dataset used for domain adaptation training. There is significant improvement in data-simulation agreement in the signal region, with the discrepancies in the final bin reduced to 0.9%.

png pdf
Figure 4-b:
Distribution of the DEEPTAU discriminator against jets before (left) and after (right) domain adaptation, for the 2018 dataset used for domain adaptation training. There is significant improvement in data-simulation agreement in the signal region, with the discrepancies in the final bin reduced to 0.9%.

png pdf
Figure 5:
Distribution of the DEEPTAU discriminator against jets before (left) and after (right) domain adaptation, for the early 2022 dataset. While data-to-simulation differences remain, there is an appreciable improvement with the inclusion of domain adaptation.

png pdf
Figure 5-a:
Distribution of the DEEPTAU discriminator against jets before (left) and after (right) domain adaptation, for the early 2022 dataset. While data-to-simulation differences remain, there is an appreciable improvement with the inclusion of domain adaptation.

png pdf
Figure 5-b:
Distribution of the DEEPTAU discriminator against jets before (left) and after (right) domain adaptation, for the early 2022 dataset. While data-to-simulation differences remain, there is an appreciable improvement with the inclusion of domain adaptation.

png pdf
Figure 6:
Jet misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated on 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H}\to \tau \tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Jet misidentification probability is estimated from $ {\mathrm{t}\overline{\mathrm{t}}} $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that do not match prompt electrons, muons or products of $ \tau_\mathrm{h} $ decays at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 6-a:
Jet misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated on 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H}\to \tau \tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Jet misidentification probability is estimated from $ {\mathrm{t}\overline{\mathrm{t}}} $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that do not match prompt electrons, muons or products of $ \tau_\mathrm{h} $ decays at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 6-b:
Jet misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated on 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H}\to \tau \tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Jet misidentification probability is estimated from $ {\mathrm{t}\overline{\mathrm{t}}} $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that do not match prompt electrons, muons or products of $ \tau_\mathrm{h} $ decays at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 7:
Electron misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated on 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H} \to \tau\tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Electron misidentification probability is estimated from Drell--Yan simulation using reconstructed $ \tau_\mathrm{h} $ candidates that match electrons at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 7-a:
Electron misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated on 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H} \to \tau\tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Electron misidentification probability is estimated from Drell--Yan simulation using reconstructed $ \tau_\mathrm{h} $ candidates that match electrons at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 7-b:
Electron misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated on 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H} \to \tau\tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Electron misidentification probability is estimated from Drell--Yan simulation using reconstructed $ \tau_\mathrm{h} $ candidates that match electrons at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 8:
Muon misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H} \to \tau\tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Muon misidentification probability is estimated from Drell--Yan simulation using reconstructed $ \tau_\mathrm{h} $ candidates that match muons at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 8-a:
Muon misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H} \to \tau\tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Muon misidentification probability is estimated from Drell--Yan simulation using reconstructed $ \tau_\mathrm{h} $ candidates that match muons at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 8-b:
Muon misidentification probability versus $ \tau_\mathrm{h} $ identification efficiency for low-$ p_{\mathrm{T}} $ (left) and high-$ p_{\mathrm{T}} $ (right) $ \tau_\mathrm{h} $ candidates, evaluated 2018 simulated datasets. The $ \tau_\mathrm{h} $ identification efficiency is estimated from $ \mathrm{H} \to \tau\tau $ simulations using reconstructed $ \tau_\mathrm{h} $ candidates that match generator level $ \tau_\mathrm{h} $ objects. Muon misidentification probability is estimated from Drell--Yan simulation using reconstructed $ \tau_\mathrm{h} $ candidates that match muons at the generator level. The defined working points of the discriminator are indicated as filled circles.

png pdf
Figure 9:
Distribution of the visible mass in the $ \mu\tau_\mathrm{h} $ channel when using DEEPTAU v2.1 (left) and v2.5 (right) for discrimination in the 2018 dataset.

png pdf
Figure 9-a:
Distribution of the visible mass in the $ \mu\tau_\mathrm{h} $ channel when using DEEPTAU v2.1 (left) and v2.5 (right) for discrimination in the 2018 dataset.

png pdf
Figure 9-b:
Distribution of the visible mass in the $ \mu\tau_\mathrm{h} $ channel when using DEEPTAU v2.1 (left) and v2.5 (right) for discrimination in the 2018 dataset.

png pdf
Figure 10:
The data-to-simulation scale factors of the $ \tau_\mathrm{h} $ identification efficiency as a function of $ p_{\mathrm{T}} $ in the 2018 (left) and 2022 (right) data-taking periods, including all $ \tau_\mathrm{h} $ decay modes, and requiring the $ D_\text{jet} $ Medium working point and an $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) < $ 65 GeV cut. The vertical bars correspond to the combined statistical and systematic uncertainties in the individual scale factors.

png pdf
Figure 10-a:
The data-to-simulation scale factors of the $ \tau_\mathrm{h} $ identification efficiency as a function of $ p_{\mathrm{T}} $ in the 2018 (left) and 2022 (right) data-taking periods, including all $ \tau_\mathrm{h} $ decay modes, and requiring the $ D_\text{jet} $ Medium working point and an $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) < $ 65 GeV cut. The vertical bars correspond to the combined statistical and systematic uncertainties in the individual scale factors.

png pdf
Figure 10-b:
The data-to-simulation scale factors of the $ \tau_\mathrm{h} $ identification efficiency as a function of $ p_{\mathrm{T}} $ in the 2018 (left) and 2022 (right) data-taking periods, including all $ \tau_\mathrm{h} $ decay modes, and requiring the $ D_\text{jet} $ Medium working point and an $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) < $ 65 GeV cut. The vertical bars correspond to the combined statistical and systematic uncertainties in the individual scale factors.

png pdf
Figure 11:
The $ m_\text{vis} $ distribution in the $ \mathrm{Z}\to\tau_\mu\tau_\mathrm{h} $ channel for year 2022 before (left) and after (right) the full calibration. The DEEPTAU working points used are: Medium for $ D_\text{jet} $, VVLoose for $ D_\mathrm{e} $ and, Tight for $ D_\mu $. The application of correction factors improves the agreement between data and simulation.

png pdf
Figure 11-a:
The $ m_\text{vis} $ distribution in the $ \mathrm{Z}\to\tau_\mu\tau_\mathrm{h} $ channel for year 2022 before (left) and after (right) the full calibration. The DEEPTAU working points used are: Medium for $ D_\text{jet} $, VVLoose for $ D_\mathrm{e} $ and, Tight for $ D_\mu $. The application of correction factors improves the agreement between data and simulation.

png pdf
Figure 11-b:
The $ m_\text{vis} $ distribution in the $ \mathrm{Z}\to\tau_\mu\tau_\mathrm{h} $ channel for year 2022 before (left) and after (right) the full calibration. The DEEPTAU working points used are: Medium for $ D_\text{jet} $, VVLoose for $ D_\mathrm{e} $ and, Tight for $ D_\mu $. The application of correction factors improves the agreement between data and simulation.

png pdf
Figure 12:
The $ m_\text{vis} $ distribution in the $ \mathrm{Z}\to\tau_\mathrm{e}\tau_\mathrm{h} $ channel for year 2022 before (left) and after (right) the full calibration. The DEEPTAU working points used are: Medium for $ D_\text{jet} $, Tight for $ D_\mathrm{e} $ and, Tight for $ D_\mu $. Specific 2022 detector conditions that affected electron reconstruction are not perfectly modelled in the simulation. As a result, the amount of electrons misidentified as $ \tau_\mathrm{h} $ is enhanced in data with respect to simulated events. The application of correction factors improves the agreement between data and simulation.

png pdf
Figure 12-a:
The $ m_\text{vis} $ distribution in the $ \mathrm{Z}\to\tau_\mathrm{e}\tau_\mathrm{h} $ channel for year 2022 before (left) and after (right) the full calibration. The DEEPTAU working points used are: Medium for $ D_\text{jet} $, Tight for $ D_\mathrm{e} $ and, Tight for $ D_\mu $. Specific 2022 detector conditions that affected electron reconstruction are not perfectly modelled in the simulation. As a result, the amount of electrons misidentified as $ \tau_\mathrm{h} $ is enhanced in data with respect to simulated events. The application of correction factors improves the agreement between data and simulation.

png pdf
Figure 12-b:
The $ m_\text{vis} $ distribution in the $ \mathrm{Z}\to\tau_\mathrm{e}\tau_\mathrm{h} $ channel for year 2022 before (left) and after (right) the full calibration. The DEEPTAU working points used are: Medium for $ D_\text{jet} $, Tight for $ D_\mathrm{e} $ and, Tight for $ D_\mu $. Specific 2022 detector conditions that affected electron reconstruction are not perfectly modelled in the simulation. As a result, the amount of electrons misidentified as $ \tau_\mathrm{h} $ is enhanced in data with respect to simulated events. The application of correction factors improves the agreement between data and simulation.

png pdf
Figure 13:
Summary of $ \tau_\mathrm{h} $ energy scales (right) and $ \tau_\mathrm{h} $ identification efficiency (left) across $ \tau_\mathrm{h} $ decay modes and $ p_{\mathrm{T}} $ regions for 2018 with $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) < $ 65 GeV and the $ D_\text{jet} $ Medium working point.

png pdf
Figure 13-a:
Summary of $ \tau_\mathrm{h} $ energy scales (right) and $ \tau_\mathrm{h} $ identification efficiency (left) across $ \tau_\mathrm{h} $ decay modes and $ p_{\mathrm{T}} $ regions for 2018 with $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) < $ 65 GeV and the $ D_\text{jet} $ Medium working point.

png pdf
Figure 13-b:
Summary of $ \tau_\mathrm{h} $ energy scales (right) and $ \tau_\mathrm{h} $ identification efficiency (left) across $ \tau_\mathrm{h} $ decay modes and $ p_{\mathrm{T}} $ regions for 2018 with $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) < $ 65 GeV and the $ D_\text{jet} $ Medium working point.

png pdf
Figure 14:
Summary of $ \tau_\mathrm{h} $ energy scales across $ \tau_\mathrm{h} $ decay modes for 2022 with $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) < $ 40 GeV and $ D_\text{jet} $ Medium working point.

png pdf
Figure 15:
Muon misidentification rate scale factors binned by $ |\eta| $ for the Medium $ D_\mu $ working point. Measurement for the 2018 dataset is shown on the left and for the 2022 dataset on the right. The dashed lines indicate the boundaries of $ \tau_\mathrm{h} |\eta| $ bins.

png pdf
Figure 15-a:
Muon misidentification rate scale factors binned by $ |\eta| $ for the Medium $ D_\mu $ working point. Measurement for the 2018 dataset is shown on the left and for the 2022 dataset on the right. The dashed lines indicate the boundaries of $ \tau_\mathrm{h} |\eta| $ bins.

png pdf
Figure 15-b:
Muon misidentification rate scale factors binned by $ |\eta| $ for the Medium $ D_\mu $ working point. Measurement for the 2018 dataset is shown on the left and for the 2022 dataset on the right. The dashed lines indicate the boundaries of $ \tau_\mathrm{h} |\eta| $ bins.

png pdf
Figure 16:
Summary plots of results for electron misidentification rate scale factors divided in decay modes and $ \eta $ regions for the VVLoose $ D_\mathrm{e} $ working point. The corrections are shown on the left for 2018 and on the right for 2022 datasets.

png pdf
Figure 16-a:
Summary plots of results for electron misidentification rate scale factors divided in decay modes and $ \eta $ regions for the VVLoose $ D_\mathrm{e} $ working point. The corrections are shown on the left for 2018 and on the right for 2022 datasets.

png pdf
Figure 16-b:
Summary plots of results for electron misidentification rate scale factors divided in decay modes and $ \eta $ regions for the VVLoose $ D_\mathrm{e} $ working point. The corrections are shown on the left for 2018 and on the right for 2022 datasets.

png pdf
Figure 17:
The high-$ p_{\mathrm{T}} \tau_\mathrm{h} $ identification efficiency scale factors as a function of $ \tau_\mathrm{h} p_{\mathrm{T}} $ for $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ VVLoose (left) and tight (right) discriminators. The scale factors are measured for the 2018 (top) and 2022 (bottom) dataset.

png pdf
Figure 17-a:
The high-$ p_{\mathrm{T}} \tau_\mathrm{h} $ identification efficiency scale factors as a function of $ \tau_\mathrm{h} p_{\mathrm{T}} $ for $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ VVLoose (left) and tight (right) discriminators. The scale factors are measured for the 2018 (top) and 2022 (bottom) dataset.

png pdf
Figure 17-b:
The high-$ p_{\mathrm{T}} \tau_\mathrm{h} $ identification efficiency scale factors as a function of $ \tau_\mathrm{h} p_{\mathrm{T}} $ for $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ VVLoose (left) and tight (right) discriminators. The scale factors are measured for the 2018 (top) and 2022 (bottom) dataset.

png pdf
Figure 17-c:
The high-$ p_{\mathrm{T}} \tau_\mathrm{h} $ identification efficiency scale factors as a function of $ \tau_\mathrm{h} p_{\mathrm{T}} $ for $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ VVLoose (left) and tight (right) discriminators. The scale factors are measured for the 2018 (top) and 2022 (bottom) dataset.

png pdf
Figure 17-d:
The high-$ p_{\mathrm{T}} \tau_\mathrm{h} $ identification efficiency scale factors as a function of $ \tau_\mathrm{h} p_{\mathrm{T}} $ for $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ VVLoose (left) and tight (right) discriminators. The scale factors are measured for the 2018 (top) and 2022 (bottom) dataset.

png pdf
Figure 18:
Prefit (left plots) and postfit (right plots) distribution of $ m_{\mathrm{T}}(\tau_\mathrm{h}, p_{\mathrm{T}}^\text{miss}) $ for $ p_{\mathrm{T}} $ bins of 100 $ < p_{\mathrm{T}} < $ 200 GeV (upper plots) and $ p_{\mathrm{T}} > $ 200 GeV (lower plots) measurement regions of 2022 dataset. Distributions are obtained for a combination of $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ Tight discriminators.

png pdf
Figure 18-a:
Prefit (left plots) and postfit (right plots) distribution of $ m_{\mathrm{T}}(\tau_\mathrm{h}, p_{\mathrm{T}}^\text{miss}) $ for $ p_{\mathrm{T}} $ bins of 100 $ < p_{\mathrm{T}} < $ 200 GeV (upper plots) and $ p_{\mathrm{T}} > $ 200 GeV (lower plots) measurement regions of 2022 dataset. Distributions are obtained for a combination of $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ Tight discriminators.

png pdf
Figure 18-b:
Prefit (left plots) and postfit (right plots) distribution of $ m_{\mathrm{T}}(\tau_\mathrm{h}, p_{\mathrm{T}}^\text{miss}) $ for $ p_{\mathrm{T}} $ bins of 100 $ < p_{\mathrm{T}} < $ 200 GeV (upper plots) and $ p_{\mathrm{T}} > $ 200 GeV (lower plots) measurement regions of 2022 dataset. Distributions are obtained for a combination of $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ Tight discriminators.

png pdf
Figure 18-c:
Prefit (left plots) and postfit (right plots) distribution of $ m_{\mathrm{T}}(\tau_\mathrm{h}, p_{\mathrm{T}}^\text{miss}) $ for $ p_{\mathrm{T}} $ bins of 100 $ < p_{\mathrm{T}} < $ 200 GeV (upper plots) and $ p_{\mathrm{T}} > $ 200 GeV (lower plots) measurement regions of 2022 dataset. Distributions are obtained for a combination of $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ Tight discriminators.

png pdf
Figure 18-d:
Prefit (left plots) and postfit (right plots) distribution of $ m_{\mathrm{T}}(\tau_\mathrm{h}, p_{\mathrm{T}}^\text{miss}) $ for $ p_{\mathrm{T}} $ bins of 100 $ < p_{\mathrm{T}} < $ 200 GeV (upper plots) and $ p_{\mathrm{T}} > $ 200 GeV (lower plots) measurement regions of 2022 dataset. Distributions are obtained for a combination of $ D_\text{jet} $ Medium, $ D_\mu $ Tight and $ D_\mathrm{e} $ Tight discriminators.
Tables

png pdf
Table 1:
Selection requirements for the domain adaptation dataset. The impact parameters, $ d_z $ and $ d_{xy} $, are defined as the distances between the muon track (or leading charged hadron track) and the PV for the muon (or $ \tau_\mathrm{h} $ candidate respectively). The medium muon identification is defined in Ref. [27]. The previous DEEPTAU discriminator scores described in Ref. [21] against quark and gluon jets, electrons, and muons, are denoted $ D_\text{jet}^\text{v2.1} $, $ D_\mathrm{e}^\text{v2.1} $, and $ D_\mu^\text{v2.1} $. The transverse mass of the muon and the missing transverse energy is denoted as $ m_{\mathrm{T}}(\mu,p_{\mathrm{T}}^\text{miss}) $.

png pdf
Table 2:
Target $ \tau_\mathrm{h} $ identification efficiencies for the different working points defined for the three discriminators. The target efficiencies are evaluated with the $ \mathrm{H}\to\tau\tau $ event sample for $ \tau_\mathrm{h} $ with $ p_{\mathrm{T}} \in $ [30, 70] GeV.

png pdf
Table A1:
Default values of the parameters used in the classification loss function for DEEPTAU training.
Summary
In this note, the newly deployed version of the DEEPTAU algorithm, v2.5, used to discriminate $ \tau_\mathrm{h} $ candidates against jets, electrons, and muons has been introduced. This deep convolutional neural network exhibits improved performance with respect to its predecessor, reducing the jet misidentification rate by 30-50% for a given reconstruction and identification $ \tau_\mathrm{h} $ efficiency. The implementation of domain adaptation by backpropagation, has reduced performance discrepancies between collision data and simulation, decreasing the necessary residual corrections. The domain adaptation was introduced by modifying gradient calculation in the neural network, via the addition of an adversarial subnetwork, designed to discriminate between collision data and simulations, in parallel to the tau classification task. The DEEPTAU algorithm, using both collision data and simulated samples, was therefore able to maximize the $ \tau_\mathrm{h} $ classification performance, while minimizing the data-simulation discrepancies. The algorithm was trained on simulated proton-proton collision data corresponding to the 2018 data-taking conditions, as well as data collected during the same year, which was used for domain adaptation. The DEEPTAU v2.5 discriminants were introduced in 2022 to be used by several CMS physics analyses. The algorithm has therefore been validated using both 2018 and 2022 collision data. The observed $ \tau_\mathrm{h} $ efficiencies were found to agree with the expected efficiencies from simulated events within 10% for 2018 and 15% for 2022. This agreement is improved with respect to the previous iteration of the algorithm and confirms the effectiveness of domain adaptation.
References
1 CMS Collaboration Observation of the Higgs boson decay to a pair of $ \tau $ leptons with the CMS detector PLB 779 (2018) 283 CMS-HIG-16-043
1708.00373
2 CMS Collaboration Search for Higgs boson pair production in events with two bottom quarks and two tau leptons in proton--proton collisions at $ \sqrt{s}= $ 13 TeV PLB 778 (2018) 101 CMS-HIG-17-002
1707.02909
3 ATLAS Collaboration Cross-section measurements of the Higgs boson decaying into a pair of $ \tau $-leptons in proton-proton collisions at $ \sqrt{s}= $ 13 TeV with the ATLAS detector PRD 99 (2019) 072001 1811.08856
4 ATLAS Collaboration Search for resonant and nonresonant Higgs boson pair production in the $ {\text{b}\bar{\text{b}}\tau^+\tau^-} $ decay channel in pp collisions at $ \sqrt{s}= $ 13 TeV with the ATLAS detector PRL 121 (2018) 191801 1808.00336
5 ATLAS Collaboration Test of CP invariance in vector-boson fusion production of the Higgs boson in the $ \text{H}\rightarrow\tau\tau $ channel in proton-proton collisions at $ \sqrt{s} = $ 13 TeV with the ATLAS detector PLB 805 (2020) 135426 2002.05315
6 CMS Collaboration Search for additional neutral MSSM Higgs bosons in the $ \tau\tau $ final state in proton-proton collisions at $ \sqrt{s}= $ 13 TeV JHEP 09 (2018) 007 CMS-HIG-17-020
1803.06553
7 ATLAS Collaboration Search for charged Higgs bosons decaying via H$ ^{\pm} \to \tau^{\pm}\nu_{\tau} $ in the $ \tau $+jets and $ \tau $+lepton final states with 36 fb$ ^{-1} $ of pp collision data recorded at $ \sqrt{s} = $ 13 TeV with the ATLAS experiment JHEP 09 (2018) 139 1807.07915
8 CMS Collaboration Search for an exotic decay of the Higgs boson to a pair of light pseudoscalars in the final state with two b quarks and two $ \tau $ leptons in proton-proton collisions at $ \sqrt{s}= $ 13 TeV PLB 785 (2018) 462 CMS-HIG-17-024
1805.10191
9 CMS Collaboration Search for a heavy pseudoscalar Higgs boson decaying into a 125 GeV Higgs boson and a Z boson in final states with two tau and two light leptons at $ \sqrt{s}= $ 13 TeV JHEP 03 (2020) 065 CMS-HIG-18-023
1910.11634
10 CMS Collaboration Search for lepton flavour violating decays of a neutral heavy Higgs boson to $ \mu\tau $ and e$ \tau $ in proton-proton collisions at $ \sqrt{s}= $ 13 TeV JHEP 03 (2020) 103 CMS-HIG-18-017
1911.10267
11 CMS Collaboration Search for a low-mass $ \tau^+\tau^- $ resonance in association with a bottom quark in proton-proton collisions at $ \sqrt{s}= $ 13 TeV JHEP 05 (2019) 210 CMS-HIG-17-014
1903.10228
12 CMS Collaboration Search for charged Higgs bosons in the H$ ^{\pm} \to \tau^{\pm}\nu_\tau $ decay channel in proton-proton collisions at $ \sqrt{s} = $ 13 TeV JHEP 07 (2019) 142 CMS-HIG-18-014
1903.04560
13 ATLAS Collaboration Search for heavy Higgs bosons decaying into two tau leptons with the ATLAS detector using pp collisions at $ \sqrt{s}= $ 13 TeV PRL 125 (2020) 051801 2002.12223
14 CMS Collaboration Search for direct pair production of supersymmetric partners to the $ \tau $ lepton in proton-proton collisions at $ \sqrt{s}= $ 13 TeV EPJC 80 (2020) 189 CMS-SUS-18-006
1907.13179
15 CMS Collaboration Search for heavy neutrinos and third-generation leptoquarks in hadronic states of two $ \tau $ leptons and two jets in proton-proton collisions at $ \sqrt{s} = $ 13 TeV JHEP 03 (2019) 170 CMS-EXO-17-016
1811.00806
16 ATLAS Collaboration Searches for third-generation scalar leptoquarks in $ \sqrt{s} = $ 13 TeV pp collisions with the ATLAS detector JHEP 06 (2019) 144 1902.08103
17 CMS Collaboration Analysis of the CP structure of the Yukawa coupling between the Higgs boson and $ \tau $ leptons in proton-proton collisions at $ \sqrt{s}= $ 13 TeV JHEP 06 (2022) 012 CMS-HIG-20-006
2110.04836
18 CMS Collaboration The CMS experiment at the CERN LHC JINST 3 (2008) S08004
19 CMS Collaboration Development of the CMS detector for the CERN LHC Run 3 JINST 19 (2024) P05064 CMS-PRF-21-001
2309.05466
20 CMS Collaboration Performance of $ \tau $-lepton reconstruction and identification in CMS JINST 7 (2012) P01001 CMS-TAU-11-001
1109.6034
21 CMS Collaboration Identification of hadronic tau lepton decays using a deep neural network JINST 17 (2022) P07023 CMS-TAU-20-001
2201.08458
22 Y. Ganin and V. Lempitsky Unsupervised domain adaptation by backpropagation 1409.7495
23 CMS Collaboration Performance of the CMS Level-1 trigger in proton-proton collisions at $ \sqrt{s} = $ 13 TeV JINST 15 (2020) P10017 CMS-TRG-17-001
2006.10165
24 CMS Collaboration The CMS trigger system JINST 12 (2017) P01020 CMS-TRG-12-001
1609.02366
25 CMS Collaboration Performance of the CMS high-level trigger during LHC run 2 JINST 19 (2024) P11021 CMS-TRG-19-001
2410.17038
26 CMS Collaboration Electron and photon reconstruction and identification with the CMS experiment at the CERN LHC JINST 16 (2021) P05014 CMS-EGM-17-001
2012.06888
27 CMS Collaboration Performance of the CMS muon detector and muon reconstruction with proton-proton collisions at $ \sqrt{s}= $ 13 TeV JINST 13 (2018) P06015 CMS-MUO-16-001
1804.04528
28 CMS Collaboration Description and performance of track and primary-vertex reconstruction with the cms tracker JINST 9 (2014) P10009 CMS-TRK-11-001
1405.6569
29 J. Alwall et al. The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations JHEP 07 (2014) 079 1405.0301
30 J. Alwall et al. Comparative study of various algorithms for the merging of parton showers and matrix elements in hadronic collisions EPJC 53 (2008) 473 0706.2569
31 R. Frederix and S. Frixione Merging meets matching in MC@NLO JHEP 12 (2012) 061 1209.6215
32 P. Nason A new method for combining NLO QCD with shower Monte Carlo algorithms JHEP 11 (2004) 040 hep-ph/0409146
33 S. Frixione, P. Nason, and C. Oleari Matching NLO QCD computations with parton shower simulations: the POWHEG method JHEP 11 (2007) 070 0709.2092
34 S. Alioli, P. Nason, C. Oleari, and E. Re A general framework for implementing NLO calculations in shower Monte Carlo programs: the POWHEG BOX JHEP 06 (2010) 043 1002.2581
35 S. Frixione, P. Nason, and G. Ridolfi A positive-weight next-to-leading-order Monte Carlo for heavy flavour hadroproduction JHEP 09 (2007) 126 0707.3088
36 J. M. Campbell, R. K. Ellis, P. Nason, and E. Re Top-pair production and decay at NLO matched with parton showers JHEP 04 (2015) 114 1412.1828
37 S. Alioli, P. Nason, C. Oleari, and E. Re NLO single-top production matched with shower in POWHEG: $ s $- and $ t $-channel contributions JHEP 09 (2009) 111 0907.4076
38 E. Re Single-top Wt-channel production matched with parton showers using the POWHEG method EPJC 71 (2011) 1547 1009.2450
39 T. Sjöstrand et al. An introduction to PYTHIA 8.2 Comput. Phys. Commun. 191 (2015) 159 1410.3012
40 Y. Li and F. Petriello Combining QCD and electroweak corrections to production in FEWZ PRD 86 (2012) 094034 1208.5967
41 M. Czakon and A. Mitov Top++: A program for the calculation of the top-pair cross-section at hadron colliders Comput. Phys. Commun. 185 (2014) 2930 1112.5675
42 P. Kant et al. HatHor for single top-quark production: Updated predictions and uncertainty estimates for single top-quark production in hadronic collisions Comput. Phys. Commun. 191 (2015) 74 1406.4403
43 CMS Collaboration Event generator tunes obtained from underlying event and multiparton scattering measurements EPJC 76 (2016) 155 CMS-GEN-14-001
1512.00815
44 N. Davidson et al. Universal interface of TAUOLA: Technical and physics documentation Comput. Phys. Commun. 183 (2012) 821 1002.0543
45 GEANT4 Collaboration GEANT4 --- a simulation toolkit NIM A 506 (2003) 250
46 CMS Collaboration Particle-flow reconstruction and global event description with the CMS detector JINST 12 (2017) P10003 CMS-PRF-14-001
1706.04965
47 M. Cacciari, G. P. Salam, and G. Soyez The $ \text{anti-k}_\text{t} $ jet clustering algorithm JHEP 04 (2008) 063 0802.1189
48 M. Cacciari, G. P. Salam, and G. Soyez FastJet user manual EPJC 72 (2012) 1896 1111.6097
49 M. Cacciari and G. P. Salam Dispelling the $ N^{3} $ myth for the $ k_{\mathrm{T}} $ jet-finder PLB 641 (2006) 57 hep-ph/0512210
50 CMS Collaboration Pileup mitigation at CMS in 13 TeV data JINST 15 (2020) P09018 CMS-JME-18-001
2003.00503
51 CMS Collaboration Jet energy scale and resolution in the CMS experiment in pp collisions at 8 TeV JINST 12 (2017) P02014 CMS-JME-13-004
1607.03663
52 CMS Collaboration Performance of missing transverse momentum reconstruction in proton-proton collisions at $ \sqrt{s} = $ 13 TeV using the CMS detector JINST 14 (2019) P07004 CMS-JME-17-001
1903.06078
53 CMS Collaboration ECAL 2016 refined calibration and Run2 summary plots CMS Detector Performance Summary, CMS-DP-2020-021, 2020
CDS
54 CMS Collaboration Performance of reconstruction and identification of $ \tau $ leptons decaying to hadrons and $ \nu_\tau $ in pp collisions at $ \sqrt{s}= $ 13 TeV JINST 13 (2018) P10005 CMS-TAU-16-003
1809.02816
55 CMS Collaboration Performance of electron reconstruction and selection with the CMS detector in proton-proton collisions at $ \sqrt{s} = $ 8 TeV JINST 10 (2015) P06005 CMS-EGM-13-001
1502.02701
56 D. Bertolini, P. Harris, M. Low, and N. Tran Pileup per particle identification JHEP 10 (2014) 059 1407.6013
57 S. Ioffe and C. Szegedy Batch normalization: accelerating deep network training by reducing internal covariate shift in Proceedings of the 32nd International Conference on International Conference on Machine Learning, ICML'15, 2015 1502.03167
58 K. He, X. Zhang, S. Ren, and J. Sun Delving deep into rectifiers: Surpassing human-level performance on ImageNet classification link 1502.01852
59 I. J. Goodfellow, Y. Bengio, and A. Courville Deep Learning MIT Press, Cambridge, MA, USA, 2016
link
60 T.-Y. Lin et al. Focal loss for dense object detection TPAMI 42 (2020) 318 1708.02002
61 D. P. Kingma and J. Ba Adam: A method for stochastic optimization in Conference Track Proceedings, 3rd International Conference on Learning Representations, ICLR 2015 1412.6980
62 T. Dozat Incorporating Nesterov momentum into ADAM in Conference Track Proceedings, 4th International Conference on Learning Representations, ICLR 2016
link
63 M. Abadi et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems Software available from
https://www.tensorflow.org/
64 F. Chollet et al. Keras Software available from
https://keras.io
65 CMS Collaboration A deep neural network to search for new long-lived particles decaying to jets MLST 1 (2020) 035012 CMS-EXO-19-011
1912.12238
66 L. Russell Identification of Hadronic Tau Lepton Decays with Domain Adaptation using Adversarial Machine Learning at CMS Master's thesis, Imperial College London, École Polytechnique Fédérale de Lausanne, 2022
67 CMS Collaboration The CMS statistical analysis and combination tool: Combine CSBS 8 (2024) 19 CMS-CAT-23-001
2404.06614
68 M. Baak, S. Gadatsch, R. Harrington, and W. Verkerke Interpolation between multi-dimensional histograms using a new non-linear moment morphing method NIM A 771 (2015) 39 1410.7388
69 CMS Collaboration Search for W' decaying to tau lepton and neutrino in proton-proton collisions at $ \sqrt{s} = $ 8 TeV PLB 755 (2016) 196 CMS-EXO-12-011
1508.04308
Compact Muon Solenoid
LHC, CERN