Papers
Topics
Authors
Recent
Search
2000 character limit reached

HistoAE: Histogram AutoEncoder in Particle Physics

Updated 4 December 2025
  • HistoAE is a deep unsupervised autoencoder that employs a histogram-based loss function to structure its latent space for decoding particle charge and impact position.
  • The architecture uses specialized vecoding preprocessing and fully-connected layers to achieve high measurement precision, with charge resolution <0.3 e and position resolution of ~3 μm.
  • Its generative decoding mode provides fast, accurate simulation of silicon microstrip detector signals, matching the performance of traditional Geant4-based methods.

The Histogram AutoEncoder (HistoAE) is a deep unsupervised representation learning architecture designed for high precision measurement and structured latent interpretability in particle physics applications, specifically silicon microstrip detectors. HistoAE incorporates a tailored histogram-based loss function to enforce physically meaningful organization of the latent space, enabling direct, quantitative decoding of particle charge and inter-strip impact position from raw detector signals. The architecture facilitates both measurement tasks and generative simulation, achieving resolutions on real beam-test datasets that match conventional supervised approaches and providing substantial speed advantages in detector simulation contexts (Lv et al., 27 Nov 2025).

1. Architecture and Input Preprocessing

HistoAE leverages an autoencoder framework that processes silicon-microstrip-detector clusters via a specialized preprocessing stage termed “vecoding” (Editor's term). For each event cluster, the five readout channels with the largest signals are selected. Each channel’s raw ADC value—varying from O(1)\mathcal{O}(1) to O(104)\mathcal{O}(10^4)—is encoded into a fixed-length, digit-wise base-10 vector, including one digit after the decimal point. For example, x=9764.4x=9764.4 is encoded as [0,9,7,6,4,4][0,9,7,6,4,4]. Concatenation across five channels yields a $30$-dimensional input vector.

The encoder consists of fully-connected layers sequentially mapping 30256128643230 \rightarrow 256 \rightarrow 128 \rightarrow 64 \rightarrow 32 units, followed by batch normalization and SiLU activation after each layer, and a final $2$-dimensional linear projection producing the latent code z=(zc,zp)z=(z_c,z_p). The decoder mirrors the encoder: 232641282562 \rightarrow 32 \rightarrow 64 \rightarrow 128 \rightarrow 256 units, with similar normalization and activation, projecting back to a $30$-dimensional output vector reshaped as five decoded channel vecodings.

2. Histogram-Based Latent-Space Loss (HistoLoss)

Central to HistoAE is the custom histogram-based loss function ("HistoLoss") that enforces structure in the latent space. This loss guides the autoencoder to represent charge and position in the latent codes (zc,zp)(z_c,z_p):

  • Target Density Htarget(x,y)H_{\mathrm{target}}(x,y): Encodes physical priors—a Gaussian mixture for charge (integer nuclear ZZ species in the observed range) combined with a uniform distribution for normalized impact positions:

Htarget(x,y)=[k=ZminZmaxwkN(xμk,σk2)]Uniform(y[0,1])H_{\mathrm{target}}(x, y) = \left[\, \sum_{k=Z_{\min}}^{Z_{\max}} w_k\, N(x\,|\,\mu_k,\sigma_k^2)\, \right] \cdot \mathrm{Uniform}(y \in [0,1])

  • Empirical Density Hemp(x,y)H_{\mathrm{emp}}(x, y): Computed by linearly rescaling zcz_c and zpz_p to [0,1][0,1] and constructing mini-batch histograms (binned or kernel-smoothed with Gaussian kernels):

Hempj=1Ni=1N1{(zc,i,zp,i)binj}H_{\mathrm{emp}}^j = \frac{1}{N} \sum_{i=1}^N \mathbb{1}\left\{\, (z_{c,i},z_{p,i}) \in \text{bin}\, j \,\right\}

or

Hemp(x,y)=1Ni=1NKσ(xzc,i)Kσ(yzp,i)H_{\mathrm{emp}}(x,y) = \frac{1}{N} \sum_{i=1}^N K_\sigma(x-z_{c,i}) K_\sigma(y-z_{p,i})

  • Loss: The HistoLoss is the L1L_1 distance between empirical and target histograms:

Lhisto=HempHtarget1=jHempjHtargetjL_{\mathrm{histo}} = \| H_{\mathrm{emp}} - H_{\mathrm{target}} \|_1 = \sum_j | H_{\mathrm{emp}}^j - H_{\mathrm{target}}^j |

  • Full Objective:

Ltotal=Lrecon+λhistoLhistoL_{\mathrm{total}} = L_{\mathrm{recon}} + \lambda_{\mathrm{histo}} L_{\mathrm{histo}}

where LreconL_{\mathrm{recon}} is the mean-squared error between input and output vecodings, and λhisto\lambda_{\mathrm{histo}} regulates latent-space regularization.

This approach directs zcz_c to cluster at integer charge values (via Gaussian peaks) and zpz_p to span the full inter-strip region uniformly, recovering latent coordinates directly aligned with physical observables, without additional supervision.

3. Latent-Space Interpretability and Physical Mapping

The HistoLoss shapes the distribution of the latent codes such that:

  • zcz_c forms KK equidistant peaks, each corresponding to a distinct nuclear charge Z{3,,22}Z\in\{3,\ldots,22\} (the experimental range), matching the Gaussian mixture specified in HtargetH_{\mathrm{target}}.
  • zpz_p is distributed smoothly and uniformly in [0,1][0,1], encoding the normalized sub-strip impact position between adjacent detector strips.

This mapping emerges solely from unsupervised histogram matching, requiring no explicit physical labels during training. Physical interpretability is validated post hoc by comparing the latent codes against beam-test ground truth, confirming precise alignment with charge and position.

4. Training Protocol and Dataset

HistoAE training utilizes approximately 5M5\,\mathrm{M} real beam-test clusters acquired at the CERN SPS, probing mixed nuclei (A/Z=2A/Z=2, 150GeV/n150\,\mathrm{GeV}/n). Data is partitioned 80%80\% for training, 20%20\% for testing. Training employs a two-stage strategy:

  1. Pre-training: Restricted to Z[3,13]Z\in[3,13], batch size 10,00010{,}000, run for 10,00010{,}000 epochs.
  2. Post-training: Extended to Z[3,22]Z\in[3,22], batch size 40,00040{,}000, additional 10,00010{,}000 epochs.

Optimisation uses AdamW with weight decay 10410^{-4} and initial learning rate 10310^{-3}, following a schedule of linear warm-up for $10$ epochs and cosine annealing to 10510^{-5}. Mixed-precision training via torch.amp enhances speed and numerical stability.

5. Measurement Extraction and Performance

After training, physical observables are extracted from the latent space via streamlined post-processing routines:

  • Charge Extraction:
    • Cluster zc,iz_{c,i} from the test set, identify local maxima in the zcz_c distribution, corresponding to integer ZZ via kZ=k+Zmin1k\mapsto Z=k+Z_{\min}-1.
    • Define charge estimator y^charge=azc+b\hat{y}_{\mathrm{charge}} = a\,z_c + b by linear interpolation between the first and last charge peaks.
    • The achieved charge resolution is σcharge(Z)=E[(y^chargeZtrue)2]\sigma_{\mathrm{charge}}(Z) = \sqrt{E[(\hat{y}_{\mathrm{charge}}-Z_{\mathrm{true}})^2]}, yielding σcharge0.3e\sigma_{\mathrm{charge}}\lesssim 0.3\,e across Z=322Z=3\ldots22, mean 0.25e0.25\,e.
  • Position Extraction:
    • Raw zp[0,1]z_p\in[0,1] is ambiguous under left-right symmetry; resolved using the two largest channels to compute η=SR/(SL+SR)\eta=S_R/(S_L+S_R).
    • If η<0.5\eta<0.5, retain zpz_p; else replace zp1zpz_p \to 1-z_p.
    • Rescale to physical sub-strip coordinate x[0,pitch]x\in[0,\mathrm{pitch}].
    • Position resolution: σpos=E[(xAExtrue)2]\sigma_{\mathrm{pos}} = \sqrt{E[(x_{\mathrm{AE}} - x_{\mathrm{true}})^2]}, achieving σpos3μm\sigma_{\mathrm{pos}}\simeq 3\,\mu\mathrm{m}, equivalent to conventional interpolation.

These results demonstrate competitive measurement precision in fully unsupervised mode.

6. Fast Detector Simulation via Generative Decoding

HistoAE’s decoder models p(clusterzc,zp)p(\mathrm{cluster}|z_c,z_p), facilitating efficient cluster simulation:

  1. Select target (Z,x)(Z,x), map to latent mean (μZ,xnorm)(\mu_Z,x_{\mathrm{norm}}).
  2. Sample perturbed latents zcN(μZ,σcharge2)z'_c\sim N(\mu_Z,\sigma^2_{\mathrm{charge}}), zpN(xnorm,σpos2)z'_p\sim N(x_{\mathrm{norm}},\sigma^2_{\mathrm{pos}}).
  3. Decode to obtain five synthetic channel signals.

This generative mode produces strip-cluster patterns orders of magnitude faster than traditional Geant4-based simulation, reproducing charge and position distributions consistent with the measurement branch.

7. Significance and Outlook

HistoAE constitutes the first framework for unsupervised, interpretable, and quantitatively precise measurement of both charge and impact position in silicon microstrip detectors. Its architecture—combining deep autoencoders, histogram-based latent regularization, and minimal post-processing—enables not only direct measurement but also rapid simulation, with demonstrable parity to established supervised approaches (Lv et al., 27 Nov 2025). This suggests broader applicability to other measurement-driven unsupervised learning tasks in high energy physics and beyond, provided suitable alignment of latent priors and physical distributions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Histogram AutoEncoder (HistoAE).