Papers
Topics
Authors
Recent
2000 character limit reached

Statistical Jet Bundle: Information Geometry

Updated 21 November 2025
  • The statistical jet bundle is a geometric structure that organizes higher-order differential information and hierarchies of variance bounds in statistical models.
  • It employs jet bundles, contact geometry, and Cartan distributions to derive curvature corrections and integrability conditions for estimator efficiency.
  • The framework finds applications in both information geometry and collider phenomenology, enabling improved background rejection and measurement precision via ensemble jet analysis.

A statistical jet bundle is a geometric structure that systematically organizes higher-order differential information about statistical models, particularly encoding the hierarchy of variance bounds—including the Cramér–Rao and Bhattacharyya-type inequalities—within the framework of jet bundles, contact geometry, and Cartan distributions. Introduced and developed in the context of information geometry, the statistical jet bundle formalism provides a unified, intrinsic, and coordinate-free foundation for analyzing estimator efficiency, curvature corrections to variance lower bounds, and the geometric and differential-algebraic criteria for optimality of statistical estimators (Krishnan, 19 Nov 2025).

1. The Statistical Bundle and Jet Bundles

The foundational object is the statistical bundle E=Θ×H=R×L2(μ)E = \Theta \times H = \mathbb{R} \times L^2(\mu), where ΘR\Theta \cong \mathbb{R} is the parameter space with coordinate θ\theta, and the fibre over each θ\theta is the Hilbert space H=L2(μ)H = L^2(\mu) of square-integrable functions on the sample space. A section of EE is a mapping s:ΘHs: \Theta \to H, such as the square-root embedding sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)} associated with a parametric family of densities.

The mm-th order statistical jet bundle Jm(E)J^m(E) over Θ\Theta consists, at each θ\theta, of all mm-jets (equivalence classes of local sections matching up to mm derivatives at θ\theta). In coordinates, an element of Jm(E)J^m(E) is represented as (θ;s0,s1,...,sm)(\theta; s_0, s_1, ..., s_m), where each skHs_k \in H denotes the kk-th derivative s(k)(θ)s^{(k)}(\theta). Natural projections πm,k:Jm(E)Jk(E)\pi_{m,k}: J^m(E) \rightarrow J^k(E) forget higher derivatives, forming a tower of bundles.

2. Canonical Contact Forms and Total Derivatives

On the trivial finite-dimensional jet bundle Jm(R×R)J^m(\mathbb{R} \times \mathbb{R}), the standard contact 1-forms are

ωk=dsksk+1dθ,0km1.\omega_k = ds_k - s_{k+1} d\theta, \quad 0 \leq k \leq m-1.

These forms vanish along holonomic prolongations, i.e., lifts of genuine sections and their derivatives.

On the infinite-dimensional statistical jet bundle Jm(E)J^m(E), ωk\omega_k retains its form, with sks_k interpreted as HH-valued and dskds_k as HH-valued 1-forms. For each xx in the sample space, the scalar evaluation ωk,δx\langle \omega_k, \delta_x \rangle matches the classical contact form at xx.

The total derivative, or Cartan vector field, on Jm(E)J^m(E) is given by

D=θ+s1s0+s2s1++smsm1.D = \partial_\theta + s_1 \partial_{s_0} + s_2 \partial_{s_1} + \cdots + s_m \partial_{s_{m-1}}.

This operator generates the rank-1 Cartan distribution by annihilation of the contact forms.

3. Cartan Distribution, Ehresmann Connection, Torsion, and Curvature

The Cartan distribution Cm=k=0m1kerωk\mathcal{C}^m = \bigcap_{k=0}^{m-1} \ker \omega_k is a rank-1 distribution on Jm(E)J^m(E), spanned by DD. For the bundle projection πm,m1:Jm(E)Jm1(E)\pi_{m,m-1}: J^m(E) \rightarrow J^{m-1}(E), the vertical bundle Vm=ker(πm,m1)=span{sm}V^m = \ker (\pi_{m,m-1})_* = \operatorname{span}\{ \partial_{s_m} \}. The connection is encoded by requiring the contact 1-form ωm1\omega_{m-1} to vanish on horizontal vectors, yielding a decomposition TJm(E)=HmVmT J^m(E) = H^m \oplus V^m, with the horizontal component HmH^m the kernel of ωm1\omega_{m-1}.

The torsion 1-form is calculated as

T=iDdωm1=ωm,T = i_D d\omega_{m-1} = -\omega_m,

while the curvature 2-form is R=dωm1R = d\omega_{m-1}. Non-zero curvature measures the non-integrability of HmH^m, representing the geometric source of curvature corrections in estimator variance bounds.

4. Efficient Models and ODE Submanifolds

A statistical model is termed “efficient up to order mm” if the estimator residual function lies in the span of the first mm derivatives ηk(θ)=θksθ\eta_k(\theta) = \partial_\theta^k s_\theta, 1km1 \leq k \leq m. Equivalently, there exist coefficients a0(θ),...,am(θ)a_0(\theta), ..., a_m(\theta), not all zero, such that for all xx \in sample space,

k=0mak(θ)θksθ(x)=0.\sum_{k=0}^m a_k(\theta) \, \partial_\theta^k s_\theta(x) = 0.

The submanifold ΣJm(E)\Sigma \subset J^m(E), defined by the linear constraint F(θ,s0,...,sm)=k=0mak(θ)sk=0F(\theta, s_0, ..., s_m) = \sum_{k=0}^m a_k(\theta) s_k = 0, specifies the locus of mm-th order efficient models. The image of the mm-jet prolongation jmsj^m s must lie entirely in Σ\Sigma for all θ\theta.

5. Integrability, Variance Bounds, and Curvature Corrections

Classical information inequalities, such as the Cramér–Rao bound (CRB) and Bhattacharyya-type bounds, are re-expressed geometrically in the jet bundle formalism. An unbiased estimator TT achieves mm-th order efficiency if and only if the residual e(x;θ)sθ(x)=(T(x)θ)sθ(x)e(x;\theta) \, s_\theta(x) = (T(x) - \theta) s_\theta(x) lies in the span of the first mm derivatives of sθs_\theta. This is algebraically equivalent to the assertion that sθs_\theta satisfies a homogeneous linear ODE of order mm.

Geometrically, jmsj^m s must be both contained in Σ\Sigma and tangent to the Cartan distribution, i.e., DFΣ=0D F|_{\Sigma} = 0, implying that Σ\Sigma is an integral submanifold for DD. Non-integrability, measured by torsion or curvature, precisely quantifies the amount by which an estimator fails to achieve the refined bound, and the geometric structure supplies the necessary extrinsic corrections to the variance.

For m=1m=1, the second fundamental form of the embedding s:ΘHs: \Theta \to H is

II(θ,θ)=P(s(θ))=s(θ)s(θ),s(θ)s(θ)2s(θ),II(\partial_\theta, \partial_\theta) = P_\perp(s''(\theta)) = s''(\theta) - \frac{\langle s''(\theta), s'(\theta) \rangle}{\|s'(\theta)\|^2} s'(\theta),

and the variance of an unbiased estimator TT satisfies

Varθ(T)s(θ)2+II(θ,θ)2s(θ)4,\operatorname{Var}_\theta(T) \geq \|s'(\theta)\|^{-2} + \frac{\|II(\partial_\theta, \partial_\theta)\|^2}{\|s'(\theta)\|^4},

where the first term is the inverse Fisher information and the second term arises from curvature. For m>1m > 1, higher-order “fundamental forms” yield Bhattacharyya-type corrections, and integrability of multiple intersecting efficiency ODEs signals the vanishing of all residual torsion/curvature, corresponding to full mm-th order efficiency.

6. Jet Bundle Formalism in Collider Phenomenology

In collider phenomenology, a “statistical jet bundle” denotes the construction in which each collision event’s jet is represented by an ensemble of clustering trees, obtained by randomizing the clustering sequence according to probabilistic weights (as in the Q-jets formalism) (Ellis et al., 2012). Each jet thus gives rise to a bundle of trees over the same event, enabling one to empirically paper the distribution of any observable OO across the ensemble.

For a given event, the Q-jets procedure randomizes the recombination of constituent four-vectors according to a parameter α\alpha (“rigidity”), constructing multiple trees per jet. Observables are collected over this ensemble, defining the empirical distribution fN(O)=1Nk=1Nδ(OOk)f_N(O) = \frac{1}{N} \sum_{k=1}^N \delta(O - O_k) and associated summary statistics—mean, variance, and higher moments.

The width (variance) of these distributions serves as a powerful new discriminant: signal jets (e.g., boosted WqqˉW \to q\bar{q}) typically exhibit narrow mass distributions, while QCD jets display broad volatility. Application of volatility cuts yields significant improvements to signal significance and statistical efficiency, reducing required integrated luminosity by up to a factor of two for boosted-object searches, as detailed in (Ellis et al., 2012).

7. Unifying Principles and Significance

The statistical jet bundle provides a rigorous language for encoding estimator efficiency, variance bounds, and curvature corrections in a single geometric hierarchy. In the information geometry context, the data required for the statement “variance \geq 1/Fisher + curvature correction” is summarized as “the mm-th prolonged section jmsj^m s is an integral curve of the Cartan distribution restricted to the ODE submanifold Σm\Sigma_m.” The jet bundle formalism links algebraic projection conditions and geometric integrability in a unified framework, offering new conceptual and technical insights into the geometry of statistical estimation (Krishnan, 19 Nov 2025).

In collider analysis, the statistical jet bundle (in the Q-jets sense) enables the exploration of the statistical properties of jet observables at the ensemble level, fostering enhanced stability of background rejection and improved measurement precision (Ellis et al., 2012). This dual usage underscores the unifying power of jet bundle formalism in both information geometry and physical data analysis contexts.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)
Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Statistical Jet Bundle.