Statistical Jet Bundle: Information Geometry
- The statistical jet bundle is a geometric structure that organizes higher-order differential information and hierarchies of variance bounds in statistical models.
- It employs jet bundles, contact geometry, and Cartan distributions to derive curvature corrections and integrability conditions for estimator efficiency.
- The framework finds applications in both information geometry and collider phenomenology, enabling improved background rejection and measurement precision via ensemble jet analysis.
A statistical jet bundle is a geometric structure that systematically organizes higher-order differential information about statistical models, particularly encoding the hierarchy of variance bounds—including the Cramér–Rao and Bhattacharyya-type inequalities—within the framework of jet bundles, contact geometry, and Cartan distributions. Introduced and developed in the context of information geometry, the statistical jet bundle formalism provides a unified, intrinsic, and coordinate-free foundation for analyzing estimator efficiency, curvature corrections to variance lower bounds, and the geometric and differential-algebraic criteria for optimality of statistical estimators (Krishnan, 19 Nov 2025).
1. The Statistical Bundle and Jet Bundles
The foundational object is the statistical bundle , where is the parameter space with coordinate , and the fibre over each is the Hilbert space of square-integrable functions on the sample space. A section of is a mapping , such as the square-root embedding associated with a parametric family of densities.
The -th order statistical jet bundle over consists, at each , of all -jets (equivalence classes of local sections matching up to derivatives at ). In coordinates, an element of is represented as , where each denotes the -th derivative . Natural projections forget higher derivatives, forming a tower of bundles.
2. Canonical Contact Forms and Total Derivatives
On the trivial finite-dimensional jet bundle , the standard contact 1-forms are
These forms vanish along holonomic prolongations, i.e., lifts of genuine sections and their derivatives.
On the infinite-dimensional statistical jet bundle , retains its form, with interpreted as -valued and as -valued 1-forms. For each in the sample space, the scalar evaluation matches the classical contact form at .
The total derivative, or Cartan vector field, on is given by
This operator generates the rank-1 Cartan distribution by annihilation of the contact forms.
3. Cartan Distribution, Ehresmann Connection, Torsion, and Curvature
The Cartan distribution is a rank-1 distribution on , spanned by . For the bundle projection , the vertical bundle . The connection is encoded by requiring the contact 1-form to vanish on horizontal vectors, yielding a decomposition , with the horizontal component the kernel of .
The torsion 1-form is calculated as
while the curvature 2-form is . Non-zero curvature measures the non-integrability of , representing the geometric source of curvature corrections in estimator variance bounds.
4. Efficient Models and ODE Submanifolds
A statistical model is termed “efficient up to order ” if the estimator residual function lies in the span of the first derivatives , . Equivalently, there exist coefficients , not all zero, such that for all sample space,
The submanifold , defined by the linear constraint , specifies the locus of -th order efficient models. The image of the -jet prolongation must lie entirely in for all .
5. Integrability, Variance Bounds, and Curvature Corrections
Classical information inequalities, such as the Cramér–Rao bound (CRB) and Bhattacharyya-type bounds, are re-expressed geometrically in the jet bundle formalism. An unbiased estimator achieves -th order efficiency if and only if the residual lies in the span of the first derivatives of . This is algebraically equivalent to the assertion that satisfies a homogeneous linear ODE of order .
Geometrically, must be both contained in and tangent to the Cartan distribution, i.e., , implying that is an integral submanifold for . Non-integrability, measured by torsion or curvature, precisely quantifies the amount by which an estimator fails to achieve the refined bound, and the geometric structure supplies the necessary extrinsic corrections to the variance.
For , the second fundamental form of the embedding is
and the variance of an unbiased estimator satisfies
where the first term is the inverse Fisher information and the second term arises from curvature. For , higher-order “fundamental forms” yield Bhattacharyya-type corrections, and integrability of multiple intersecting efficiency ODEs signals the vanishing of all residual torsion/curvature, corresponding to full -th order efficiency.
6. Jet Bundle Formalism in Collider Phenomenology
In collider phenomenology, a “statistical jet bundle” denotes the construction in which each collision event’s jet is represented by an ensemble of clustering trees, obtained by randomizing the clustering sequence according to probabilistic weights (as in the Q-jets formalism) (Ellis et al., 2012). Each jet thus gives rise to a bundle of trees over the same event, enabling one to empirically paper the distribution of any observable across the ensemble.
For a given event, the Q-jets procedure randomizes the recombination of constituent four-vectors according to a parameter (“rigidity”), constructing multiple trees per jet. Observables are collected over this ensemble, defining the empirical distribution and associated summary statistics—mean, variance, and higher moments.
The width (variance) of these distributions serves as a powerful new discriminant: signal jets (e.g., boosted ) typically exhibit narrow mass distributions, while QCD jets display broad volatility. Application of volatility cuts yields significant improvements to signal significance and statistical efficiency, reducing required integrated luminosity by up to a factor of two for boosted-object searches, as detailed in (Ellis et al., 2012).
7. Unifying Principles and Significance
The statistical jet bundle provides a rigorous language for encoding estimator efficiency, variance bounds, and curvature corrections in a single geometric hierarchy. In the information geometry context, the data required for the statement “variance 1/Fisher + curvature correction” is summarized as “the -th prolonged section is an integral curve of the Cartan distribution restricted to the ODE submanifold .” The jet bundle formalism links algebraic projection conditions and geometric integrability in a unified framework, offering new conceptual and technical insights into the geometry of statistical estimation (Krishnan, 19 Nov 2025).
In collider analysis, the statistical jet bundle (in the Q-jets sense) enables the exploration of the statistical properties of jet observables at the ensemble level, fostering enhanced stability of background rejection and improved measurement precision (Ellis et al., 2012). This dual usage underscores the unifying power of jet bundle formalism in both information geometry and physical data analysis contexts.
Sponsored by Paperpile, the PDF & BibTeX manager trusted by top AI labs.
Get 30 days free