Search of a Vector-Like Quark T' tH in the di-photon final state
Abstract: A search for the production of a Vector-Like Quark, T' is presented. The search is based on proton-proton collision events collected at 13 TeV by the CMS detector at the CERN LHC. The data sample corresponds to an integrated luminonsity of 138 fb1, collected between 2016 and 2018. The search looks specifically for the production of a T' quark which then decays to a top quark and a Higgs boson (T' tH); with the Higgs boson subsequently decaying into a pair of photons (H γγ). The top quark can decay either hadronically (tbqˉq) or leptonically (tblν). This search presents an upper limit on the T' production cross section for the mass range 600-1200 GeV. No significant excess over the standard model background is observed, accordingly T' masses up to 730 GeV are excluded at 95% confidence level. This search is the first to use the reconstruction of the H γγ invariant mass, leveraging an experimental resolution of 1-2%. This technique leads to an increased sensitivity to T' mass values up to 1 TeV with respect to the previous searches.
Figure 1:
The BDT output distributions for data, backgrounds and signal events in the leptonic and the hadronic categories: (a) Leptonic BDT trained against the SM Higgs boson backgrounds, (b) Hadronic BDT trained against the SM Higgs boson backgrounds, and (c) Hadronic BDT trained against the non-resonant backgrounds processes. For the leptonic category, MC-estimated non-resonant backgrounds is normalized to the number of observed data events. For the hadronic category, data-driven estimation has been adapted for γ+jets backgrounds, while all other MC samples are normalized to an integrated luminosity of 138 fb1.

Figure 1-a:
Leptonic BDT trained against the SM Higgs boson backgrounds: output distributions for data, backgrounds and signal events. MC-estimated non-resonant backgrounds is normalized to the number of observed data events.

Figure 1-b:
Hadronic BDT trained against the SM Higgs boson backgrounds: output distributions for data, backgrounds and signal events. Data-driven estimation has been adapted for γ+jets backgrounds, while all other MC samples are normalized to an integrated luminosity of 138 fb1.

Figure 1-c:
Hadronic BDT trained against the non-resonant backgrounds processes: output distributions for data, backgrounds and signal events. Data-driven estimation has been adapted for γ+jets backgrounds, while all other MC samples are normalized to an integrated luminosity of 138 fb1.

Figure 2:
The combined distributions for data and mγγ signal-plus-background model fits for VLQ signal with MT of 600 GeV (left), 900 GeV (middle) and 1200 GeV (right).

Figure 2-a:
The combined distributions for data and mγγ signal-plus-background model fits for VLQ signal with MT of 600 GeV.

Figure 2-b:
The combined distributions for data and mγγ signal-plus-background model fits for VLQ signal with MT of 900 GeV.

Figure 2-c:
The combined distributions for data and mγγ signal-plus-background model fits for VLQ signal with MT of 1200 GeV.

Figure 3:
Expected and observed upper limits at 95% CL on σ×B(Hγγ) after combining the leptonic and the hadronic channels with mT [600, 1200] GeV.

Figure 4:
Observed and expected upper limits at 95% CL on signal strength after combining the leptonic and the hadronic channels with mT [600, 1200] GeV. Sensitivity of other CMS [18,49] and ATLAS [19] searches are also displayed for comparison.

Figure 5:
Observed and expected upper limits at 95% CL on σ×B(Hγγ) in the leptonic channel with mT [600, 1200] GeV.

Figure 6:
Observed and expected upper limits at 95% CL on σ×B(Hγγ) in the hadronic channel with mT [600, 1200] GeV.

Figure 7:
Observed and expected upper limits at 95% CL on the coupling, κT with the SM particles under the narrow width approximation (NWA) for mT [600, 1200] GeV.

Table 1:
Selection of the signal region for different T' mass hypotheses and number of events in the mγγ side band region, defined by mγγ<115 || mγγ> 135 GeV.
A search for the vector-like quark with T' tH(γγ) has been performed using 138 fb1 of proton-proton collisions (at s= 13 TeV) data recorded with the CMS detector during LHC Run 2 (2016-2018). The search considers both the hadronic and leptonic decay modes of the top quark and exploits Boosted Decision Trees to separate likely signal events from background processes, including standard model Higgs boson production processes. No statistically significant excess over the expected background prediction is observed; accordingly, T' masses up to 730 GeV have been excluded at 95% CL.
Additional Figures

Additional Figure 1:
Expected and observed upper limits at 95% CL on singlet T production signal strength parameter (μ) after combining the leptonic and the hadronic channels with MT [600, 1200] GeV. Sensitivity of other CMS [18,49] and ATLAS [19] searches are also displayed for comparison.

Additional Figure 2:
Reconstructed signal T mass in the hadronic channel with Γ/MT= 1%. The number of events reflects the expected signal cross sections as a function of the T mass. The resolutions of the reconstructed signal T masses are in the 5--7% range.

Additional Figure 3:
Signal efficiency for the leptonic channel in the signal regions as optimized for each of the three different T mass ranges: [600, 700], [700, 1000] and [1000, 1200] GeV, represented as red, green and blue curves respectively. The efficiency is defined as the ratio of the events after the final selection to the total expected events.

Additional Figure 4:
Signal efficiency for the hadronic channel in the signal regions as optimized for each of the three different T mass ranges: [600, 700], [700, 1000] and [1000, 1200] GeV, represented as red, green and blue curves respectively. The efficiency is defined as the ratio of the events after the final selection to the total expected events.

Additional Figure 5:
The combined expected (dotted black) and observed (solid black) upper limits at 95% CL on σTbq(TtH) displayed as a function of MT. The red dashed lines illustrate theoretical cross sections for the singlet T production with Γ/MT = 1 and 5%. The theoretical cross sections of the singlet T production with representative κT-values fixed at 0.1, 0.15, 0.2 and 0.25 (for Γ/MT< 5%) are shown as red solid lines.

Additional Figure 6:
The expected (dotted black) and observed (solid black) upper limits at 95% CL on σTbq(TtH) in the leptonic channel displayed as a function of MT. The theoretical cross sections of the singlet T production with representative κT-values fixed at 0.1, 0.15, 0.2 and 0.25 (for Γ/MT< 5%) are shown as solid red lines.

Additional Figure 7:
The expected (dotted black) and observed (solid black) upper limits at 95% CL on σTbq(TtH) in the hadronic channel displayed as a function MT. The theoretical cross sections of the singlet T production with representative κT-values fixed at 0.1, 0.15, 0.2 and 0.25 (for Γ/MT< 5%) are shown as solid red lines.

Additional Figure 8:
The combined expected (dotted black) and observed (solid black) upper limits at 95% CL on the T coupling with the SM particles, κT, under the narrow width approximation (NWA) displayed as a function MT. The theoretical κT values corresponding to the Γ/MT-values fixed at 1, 2, 3, 4, and 5% are shown as red dashed lines.
Additional Tables

Additional Table 1:
The expected yields of different processes in each signal window for events with MT [600, 1200] GeV. Yields are shown for events with 115 <mγγ< 135 GeV, and are calculated from MC samples only.
