Lorentz-Boosted Hadronic Top Quarks
- The paper demonstrates that Lorentz-boosted top quarks, with highly collimated decay products, can be robustly identified using fat-jet algorithms and tailored grooming techniques to preserve jet mass accuracy.
- Key methods include the use of substructure observables such as N-subjettiness, energy correlation functions, and advanced machine learning taggers to efficiently distinguish signal from background.
- Implications of these techniques extend to precision top quark mass measurements, differential cross-section studies, and searches for new physics in high-energy collider experiments.
A Lorentz-boosted hadronically decaying top quark is a top quark produced with transverse momentum substantially larger than its rest mass (173 GeV), such that all three of its hadronic decay products () are collimated into a single high- jet. Identification and measurement of such top quarks at hadron colliders are central for precision Standard Model studies and new-physics searches at high energy scales.
1. Kinematics and Topology of Lorentz-Boosted Hadronic Top Decays
The critical kinematic parameter for boosted tops is the Lorentz boost factor,
For TeV, . The decay products are confined to a cone of
which results in typical angular separations of for TeV (Auerbach et al., 2013). This forces all three decay prongs (, , ) into a region much smaller than the usual jet radii (), necessitating specialized "fat-jet" (–$1.5$) reconstruction to capture the complete top decay.
The resulting jet’s invariant mass,
peaks near GeV (including detector resolution, missing energy from neutrinos or semileptonic decays, and associated radiation).
2. Experimental Identification and Tagging Strategies
The identification of boosted hadronic tops leverages:
- Jet reconstruction: Anti- or Cambridge–Aachen algorithms with large radius parameters (typically –$1.5$) (Schätzel, 2014, Collaboration, 2015, Collaboration, 2022). Variable- schemes, where , are used in certain algorithms (e.g., HOTVR) to optimize capture of the decay at all (Lapsien et al., 2016).
- Jet grooming: Pruning, trimming, and soft-drop/mMDT removal of soft, wide-angle radiation to stabilize jet mass and mitigate pileup/underlying event effects (Schätzel, 2014, Collaboration, 2022, Lapsien et al., 2016). Parameters (e.g. soft-drop , ) and grooming radius are tuned to maintain signal efficiency and mitigate mass sculpting.
- Substructure discrimination:
- N-subjettiness (): Measures degree to which a jet possesses subjets. The ratio is minimized for true three-prong top decays; typical selection is (Kuutmann, 2014, Conway et al., 2016, Collaboration, 2016).
- Energy correlation functions (ECFs): Ratios such as probe three-prong substructure and are highly effective in backgrounds with different radiation patterns (Collaboration, 2020, Collaboration, 2018).
- HEPTopTagger variables: Jet mass windows, pairwise subjet masses consistent with and top kinematics, subjet pairings. A frequently used mass window is 140–210 GeV for the full jet and 65–95 GeV for the candidate (Schätzel, 2014, Collaboration, 2015).
- Mass-drop and splitting scales: splitting scales () and mass-drop criteria (, ) are used to test compatibility with boosted top topology (Schätzel, 2014).
- -tagging in boosted environments: Performed either on subjets inside the fat-jet (using information such as CSVv2, DeepJet, or impact parameter significances) or via specialized algorithms in the jet center-of-mass frame. Subjet -tagging gives critical background suppression, often by an order of magnitude, while maintaining 40% efficiency (Chen, 2013, Auerbach et al., 2013, Collaboration, 2016).
- Multivariate and Machine Learning (ML) Taggers:
- Boosted-Decision Trees (BDTs): Combine substructure variables and -tag to optimize separation (Collaboration, 2018, Collaboration, 2020).
- DNN/CNN/LoLa architectures: Work on low-level four-vector (PF candidate) inputs or jet images, incorporating Lorentz symmetries (e.g., DeepAK8, LoLa) and achieving up to an order of magnitude background reduction relative to cut-based taggers at fixed efficiency (Butter et al., 2017, Collaboration, 2020).
- Boosted Event Shape Tagger (BEST): Uses event shapes calculated in multiple Lorentz-boosted frames (for , , , ) as neural network input, further enhancing multi-class separation (Conway et al., 2016, Collaboration, 2020).
3. Performance Metrics and Systematics
Performance is quantified by tagging efficiency (signal acceptance) and background rejection (inverse mistag rate), both as functions of and jet characteristics.
| Technique/Tagger | Signal Eff. () | Background Rejection () | Comments |
|---|---|---|---|
| Cut-based (e.g., ) | 30–50% | –$40$ | GeV typical |
| HOTVR | 35–48% | 12–25 | Robust up to several TeV |
| ECF+BDT | 30–50% | 30–50 | , -tag, BDT |
| CNN/DNN (ImageTop, DeepAK8) | up to 70% | higher than legacy | Multiclass (top, , , , QCD) |
Systematic uncertainties are dominated by jet energy/mass calibration (JES/JMS), subjet -tag scale factors, modeling of QCD radiation (ISR/FSR), pileup mitigation effectiveness, and MC generator parton-shower/tune choices. In data/MC comparisons, scale factors and uncertainties are derived in control regions (e.g., semileptonic , +jets, multijets) for validation (Collaboration, 2020, Collaboration, 2015).
4. Applications: Precision Measurements and New Physics Searches
Lorentz-boosted hadronic top identification is pivotal for:
- Differential cross-section and top-mass measurements: The unfolded and spectra of boosted top candidates are used to measure , , and extract with precision. For example, (Collaboration, 2022) achieved GeV using XCone-reconstructed jets at GeV, exploiting in-situ -mass calibration and substructure to constrain FSR uncertainties.
- BSM searches: High-mass resonances decaying to or monotop signatures necessitate efficient rejection of QCD backgrounds in the high- regime. Improved taggers—cut-based, BDT, or DNN—significantly enhance sensitivity and extend the reach to or in the TeV scale (Collaboration, 2018, Collaboration, 2015, Collaboration, 2016).
- SUSY and Exotics: In SUSY cascades and vector-like quark searches, Lorentz-boosted tops arise from heavy state decays with –1000 GeV, making boosted top tagging essential for both background suppression and signal significance (Bandyopadhyay et al., 2010, Collaboration, 2016).
- Control of backgrounds and systematic effects: Event selection and tagging are designed to provide data-driven control over dominant backgrounds (jets, lepton+jets, QCD multijets), and pileup effects are mitigated by grooming and PUPPI weighting.
5. Evolution of Tagging Algorithms
Historically, cut-based approaches using jet mass windows, splitting scales, and simple thresholds were standard (Schätzel, 2014, Kuutmann, 2014). Modern analyses rely increasingly on multivariate taggers:
- Energy Correlation Functions (ECF): E.g., provides robust QCD discrimination and, when decorrelated with jet mass, reduces classifier-induced sculpting.
- Variable- clustering and mass-jump veto: HOTVR achieves stable performance over a broad range by dynamically adjusting and freezing out genuine massive subjets (Lapsien et al., 2016).
- Deep Learning: Approaches using constituent four-vectors, convolutional filters (jet images), or Lorentz-invariant layers outperform legacy taggers, especially at the highest boosts. DeepAK8 and similar architectures gain up to a factor of 20 in background suppression at fixed efficiency, crucial for future high-luminosity scenarios (Collaboration, 2020, Butter et al., 2017).
6. Special Techniques: Subjet -Tagging and Rest-Frame Methods
Standard -tagging loses power at high due to collimation; associating tracks to subjets in the jet rest frame and using signed impact parameter significances yields an order of magnitude better background rejection at fixed signal efficiency, stable even in high pileup (Chen, 2013).
Additionally, methods leveraging rest-frame reclustering, energy asymmetry, and opening angle variables, combined in boosted decision trees, optimize discrimination between and QCD (Chen, 2013). These are complementary to substructure-cut and ML-based methods.
7. Outlook and Significance
Lorentz-boosted hadronic top quark reconstruction and tagging are now central in precision Standard Model and BSM analyses at the LHC. Tagging techniques—including advanced grooming, substructure observables, multivariate (BDT, DNN) architectures, and sophisticated -tagging—decisively improve selection purity, signal sensitivity, and systematic control across a wide range. These developments are key enablers for analyses in high-luminosity runs and are broadly adopted or adapted by ATLAS, CMS, and theoretical groups (Auerbach et al., 2013, Kuutmann, 2014, Schätzel, 2014, Butter et al., 2017, Collaboration, 2022, Collaboration, 2020, Lapsien et al., 2016).