Semi-Merged Diphoton Systems in Higgs Decays
- Semi-merged diphoton systems are defined by an intermediate photon separation, combining a resolved and a merged photon cluster in exotic decay chains.
- Advanced ECAL clustering and shower-shape analyses, complemented by machine learning mass regression, enable accurate reconstruction of these complex photon signatures.
- These systems provide new search avenues at the LHC for light boson decays and extended resonance models while offering precise efficiency and background modeling insights.
A semi-merged diphoton system is a composite photon-like object arising from the decay of a light boson—commonly in exotic decay chains such as —where one decay produces two resolved photon candidates, while the other yields a highly collimated photon pair reconstructed as a single merged photon cluster in the electromagnetic calorimeter (ECAL). This regime is distinguished by an opening angle, , intermediate between the limits set by ECAL granularity and cluster-separation criteria, specifically defined for GeV in the context of LHC searches for new light resonances (Collaboration, 1 Jan 2026, Collaboration, 2024, Bi et al., 2015).
1. Kinematic Regimes and Definition
For a scalar produced in Higgs decays, the photon pair from acquires an opening angle in the lab frame. Detailed studies by CMS show:
- GeV: –$0.035$ (typical ECAL Molière radius); photons form a nearly indistinguishable shower overlap ("fully merged").
- GeV: –$0.09$; partially resolved with strong overlap.
- GeV: –$0.3$; photons are fully resolved as separate clusters.
The “semi-merged” diphoton regime is operationally defined for , corresponding to intermediate photon separations. In this interval, one decay is reconstructed as two resolved photons ("resolved leg") and the other as a merged photon-like object ("merged leg"). For high-mass parents ( in ), semi-merged diphotons occur for – (Collaboration, 2024, Bi et al., 2015).
2. ECAL Clustering and Photon Identification
Photon clustering in CMS ECAL follows a seed-based algorithm:
- Seed crystals are required to have MeV.
- Basic clusters aggregate energy from adjacent crystals within local for superclusters (for GeV).
- For merged configurations, two collimated photon showers coalesce into an extended energy distribution, which standard PF (particle-flow) algorithms may not split.
Discrimination between single photons, merged diphotons, and hadronic backgrounds employs several shower-shape variables:
- (ratio of hadronic to electromagnetic energy)
- Charged-hadron isolation, , and electron veto
Merged photon-like PF candidates that satisfy loose photon ID (tight , , ) but are not split into two PF photons are classified as the merged leg in semi-merged event selections (Collaboration, 1 Jan 2026).
3. Event Selection and Categorization
Selection of semi-merged diphoton topologies in dedicated searches (e.g., ) is performed as follows:
- Trigger requires diphoton events with GeV, leading GeV, subleading GeV; all photons must satisfy (restricted to ECAL barrel).
- Offline, exactly three photon-like PF candidates must pass preselection (, , , GeV, no pixel seed).
- Triphoton invariant mass GeV.
- Resolved candidate comprises the closest photon pair; merged leg's mass is reconstructed via machine learning regression.
In extended resonance searches (), semi-merged objects are classified by CNNs as either diphoton, single photon, or hadron, using a normalized ECAL energy image. Mass regression CNNs, taking crystal energy images and candidate , predict to determine the cluster mass (Collaboration, 2024).
4. Mass Reconstruction and Machine Learning Techniques
Mass reconstruction for merged diphoton objects employs dedicated machine learning models:
- In searches, a graph neural network (GNN) regresses of the merged leg, producing linear response () across $0$–$18$ GeV with typical resolution –$1.5$ GeV and scale uncertainty $1.6$–.
- For , the mass regression CNN outputs ; cluster mass is . Resolution matches simulation within ; energy-scale uncertainty per cluster is .
Selection efficiency for semi-merged topologies rises with —– for GeV—and the CNN classifier achieves 55% efficiency for true merged (Collaboration, 1 Jan 2026, Collaboration, 2024).
5. Backgrounds and Statistical Modeling
Dominant backgrounds include:
- QCD multijet events with jets faking photons (“+jets”, “jet+jet”)
- Prompt +jet production
Background shapes are extracted from multiple sideband regions in the two-dimensional plane of merged vs. resolved masses, . Empirical functions (dijet-like, modified dijet, diphoton, power×exp, four-parameter) are fit to the invariant mass spectra in binned categories, with discrete profiling and floating nuisance parameters (Collaboration, 2024).
Validation compares predicted 2D backgrounds to data in sideband regions, with residuals fit by Chebyshev polynomials to assign shape uncertainties. All systematic sources (luminosity, trigger, ID efficiency, energy scale/resolution, ML calibration, background function choice) are treated as nuisance parameters in final profile-likelihood fits (Collaboration, 1 Jan 2026, Collaboration, 2024).
6. Theoretical Interpretations and Model Significance
The semi-merged diphoton signature naturally arises in models with new light scalars coupling to the Higgs or heavy sector. One illustrative example is the extension of the Standard Model (Bi et al., 2015), with and GeV. Highly boosted yields photon pairs with ; for GeV, the pair is fully merged in the ECAL, satisfying .
Experimental constraints—dijet cross section bounds, photon-jet searches, ECAL granularity—are respected by restricting the relevant parameter space; prediction efficiency factors –$0.9$ and production rates –$13$ fb are demonstrated to explain observed anomalies without contradicting negative searches in the broader parameter space.
7. Experimental Limits and Outlook
Dedicated analyses at CMS have set stringent limits on the cross section times branching ratio: is constrained to $0.264$–$0.005$ pb at CL for –$15$ GeV—the strongest bounds to date in the $1$–$5$ GeV regime (Collaboration, 1 Jan 2026). Analogous searches for set 95% CL bounds from $0.03$ to $1.06$ fb for –$3000$ GeV and – (Collaboration, 2024). No significant excess has been observed; future searches will benefit from enhanced ECAL granularity, improved machine learning mass regression, and further event topology exploitation, which will extend sensitivity into new regimes of collimated photon emission.