Run-2 Proton-Proton Collision Dataset
- Run-2 Proton-Proton Collision Dataset is a comprehensive collection from LHC Run 2 at 13 TeV featuring an integrated luminosity of ~140 fb⁻¹ and rigorous calibration.
- The dataset employs advanced methods like van der Meer scans and cross-detector validations to minimize systematic uncertainties to 1–2%.
- State-of-the-art proton reconstruction techniques, including PPS and multi-RP global fits, offer precise measurements for Standard Model and new physics studies.
The Run-2 Proton-Proton (pp) Collision Dataset denotes the collection of proton-proton collision data acquired during Large Hadron Collider (LHC) Run 2 (2015–2018), predominantly at a center-of-mass energy TeV. Both the CMS and ATLAS experiments, as well as specialized systems such as the Precision Proton Spectrometer (PPS), utilized dedicated data-taking, calibration, and validation workflows to enable high-precision Standard Model measurements and searches for new physics. This dataset is characterized by its unprecedented integrated luminosity, meticulous absolute luminosity calibration, detailed quantification of systematic uncertainties, and rigorous cross-detector consistency verification (Giraldi, 2022, Collaboration, 2021, Ferro, 2021).
1. Integrated Luminosity and Data Collection
During Run 2, the LHC provided proton beams at TeV, resulting in TeV collisions. Data-taking extended from 2015 to 2018, with annual and total integrated luminosities as summarized in the following table, which aggregates values from CMS and ATLAS:
| Year | CMS (fb) | ATLAS (fb) |
|---|---|---|
| 2015 | 2.27 0.04 | — |
| 2016 | 36.3 0.44 | — |
| 2017 | 41.5 0.96 | — |
| 2018 | 59.8 1.50 | — |
| Total | 139.9 1.8 | 139 2.4 |
For ATLAS, the total good-quality integrated luminosity is $139$ fb with a systematic uncertainty of from LUCID-2 calibration. CMS quotes an overall Run 2 precision of , with the total relative uncertainty at approximately (Giraldi, 2022, Collaboration, 2021).
2. Absolute Luminosity Calibration
The absolute luminosity scale for pp collisions in Run 2 was established via van der Meer (VdM) beam-separation scans. During dedicated VdM fills, LHC beams were moved in steps in the transverse (, ) directions. The observed luminometer rates were fitted to double-Gaussian profiles to extract the overlap widths . The visible cross section for each luminometer was determined by
where are the bunch populations (corrected for ghost and satellite charge), Hz is the revolution frequency, and is the head-on rate. Instantaneous luminosity in physics fills was derived as (Giraldi, 2022, Collaboration, 2021).
ATLAS utilized the LUCID-2 Cherenkov detector, also calibrated via VdM scans. The primary methods and performance metrics were consistent across major LHC experiments.
3. Systematic Uncertainty Quantification
Systematic uncertainties in the luminosity measurement are categorized as arising from VdM-scan calibration and physics-fill integration. Quantitative breakdowns for each year (for CMS) include:
- Calibration (VdM scan):
- Ghost & satellite charge:
- Beam-current normalization:
- Orbit drift:
- Residual scan-to-scan differences:
- Beam–beam effects:
- Length scale calibration:
- Transverse non-factorizability:
- Integration (physics fill):
- Out-of-time pileup Type-1 (afterglow):
- Out-of-time pileup Type-2:
- Cross-detector stability:
- Linearity (extrapolation effects):
- CMS DAQ deadtime:
Total per-year uncertainties for CMS are (2015), (2016), (2017), and (2018). The final combined uncertainty across Run 2 is , which surpasses the precision achieved in previous LHC and Tevatron runs (Giraldi, 2022).
ATLAS quotes an integrated luminosity uncertainty of on the full dataset, validated by cross-checks between LUCID-2 and supplementary luminometers (Collaboration, 2021).
4. Data Quality, Triggering, and Pile-Up Mitigation
Run 2 data-taking imposed stringent quality requirements, including stable beams and full operational status of all detector subsystems. Events passing these criteria constitute the good-run lists (GRL). Peak instantaneous luminosities reached in 2018. The mean pile-up was $33.7$, with instantaneous values up to $60$.
ATLAS employed single-lepton triggers with online thresholds of $24-26$ GeV (electrons) and thresholds of $20-26$ GeV (muons). Trigger efficiency turn-on reached plateau by GeV (Collaboration, 2021). Offline, reconstructed objects were matched to trigger objects, with jets and leptons subjected to pile-up mitigation using:
- Jet–Vertex Tagger (JVT) to associate jets with primary vertices,
- Track-based soft terms in using tracks matched to the primary vertex,
- Lepton isolation and dedicated BDTs for suppression of non-prompt leptons.
Pile-up modeling in Monte Carlo overlaid inelastic pp events with minimum-bias events (Pythia 8, A3 tune, NNPDF2.3lo), and pile-up weights corrected to match the observed distribution:
5. Cross-Detector Linearity and Stability Validation
CMS employed a comprehensive suite of stability and linearity checks:
- Emittance scans: Short "mini‐VdM" scans at the start and end of physics fills probed detector response vs. beam overlap and pile-up.
- Afterglow corrections: Correction for out-of-time pileup in luminometers with long response tails, performed by subtracting measured rates in empty bunch crossings.
- Time-dependent corrections: Efficiency drifts in channels (e.g., HFET/HFOC) addressed via time-dependent correction factors from emittance-scan data.
- Cross-detector cross-checks: Comparison and correlation studies across luminometers (PLT, BCM1F, HF, PCC, VTX, DT, RAMSES) to ensure alignment within systematic uncertainties. Any residual non-linearity or response drift is included in the “cross-detector stability” systematic (Giraldi, 2022).
6. The PPS Run 2 Dataset and Proton Reconstruction
The Precision Proton Spectrometer (PPS) collected 110 fb (2016–2018), covering approximately of the CMS pp dataset. Alignment and optics calibration proceeded in three stages:
- Absolute alignment: Roman Pots (RP) inserted down to using collimator-scan techniques, achieving m precision.
- Relative alignment: Internal alignment within each arm attains m precision.
- Transfer to physics fills: Correction of (, ) shifts ensures matching between the alignment fill and physics data, with typical combined position uncertainties of m.
Optics calibration used first-order transport equations, with parameters matched to LHC optics databases and updated using minimum-bias and exclusive dilepton events:
where is the proton fractional momentum loss. Calibration of the horizontal dispersion has reached precisions , with optical-uncertainty contributions to at the level.
Proton reconstruction is performed via:
- Single-RP method: , ,
- Multi-RP global fit: using full transport matrices, achieving and GeV, with systematic uncertainties on at the few level.
Validation utilizes exclusive events, where (proton-determined minus central detector-determined) agrees within between data and simulation.
7. Significance and Impact
The Run-2 proton-proton dataset delivers an integrated luminosity (140 fb) with a relative precision of , representing the most precise luminosity determination to date at bunched-beam hadron colliders (Giraldi, 2022, Collaboration, 2021). The resulting data underpin a broad range of Standard Model and beyond-the-Standard-Model analyses, from R-parity-violating supersymmetry searches to measurements of central exclusive production channels. The infrastructure of beam-based calibration, cross-checks among luminometers, and advanced reconstruction (e.g., PPS optics and tracking) ensures high-fidelity measurements suitable for future precision physics at the highest LHC luminosities (Ferro, 2021).