- The paper proves that positive semiparametric Fisher information is equivalent to the gradient being in the adjoint score operator's range.
- It employs a quotient space construction and functional analysis to address challenges beyond standard Hilbert space geometry.
- Applications include heavy-tailed mean estimation and pointwise density estimation, clarifying conditions for root-n efficiency.
Introduction and Motivation
This paper addresses a fundamental question in semiparametric inference: when is semiparametric Fisher information positive, and how does this relate to the differentiability of functionals of interest, especially in models parametrized by general normed spaces instead of Hilbert spaces? The work generalizes the canonical result of van der Vaart (1991) to normed models, formalizing the precise condition for positive Fisher information—namely, that the gradient of the parameter lies in the range of the adjoint score operator. This equivalence yields an abstract yet powerful analytic tool for problems where standard Hilbert-space geometry fails, such as estimation under heavy tails or pointwise density estimation.
Consider models parametrized by elements λ in a normed space V, leading to dominated families P={Pλ:λ∈Λ} with associated tangent space T⊂V. Statistical functionals ψ:Λ→R are assumed pathwise differentiable with gradient ψ˙∈T∗. Mean-square differentiability of the model is captured by a continuous linear score operator A:T→L2(P0). The parameter estimation target is β0=ψ(λ0). The central object is the semiparametric Fisher information:
I≡α∈Tinf∣ψ˙α∣2∥Aα∥22.
The pivotal condition, generalizing Hilbert space intuition, is that positive semiparametric Fisher information (I>0) holds if and only if V0, where V1 is the adjoint score operator.
Main Results
The paper’s main theorem formalizes the above equivalence: positive semiparametric Fisher information is necessary and sufficient for the gradient of the target functional to be in the range of the adjoint of the score operator, irrespective of whether V2 is a Hilbert or general normed space. This unifies the analytic structure of semiparametric efficiency bounds:
- Main Theorem: V3 if and only if V4.
Unlike the Hilbert case, the proof does not rely on orthogonal projection or Parseval’s identity, but instead uses the structure of the tangent space quotient V5, and arguments from functional analysis about the dual and adjoint operators. Positive information is tightly characterized via the geometry of the range and kernel of the score operator, extended to the dual spaces.
Local identifiability, i.e., V6, is shown to be both necessary and sufficient (under closedness of the score range) for positive information, and the machinery extends through functional analytic duality relations and quotient space construction.
Applications to Canonical Statistical Problems
Two distinct problems exemplify the analytic reach of the general theorem:
Let V7 for V8, and consider V9. The model is parametrized by deviations P={Pλ:λ∈Λ}0 in P={Pλ:λ∈Λ}1 (the dual space to P={Pλ:λ∈Λ}2), with P={Pλ:λ∈Λ}3 and P={Pλ:λ∈Λ}4 the inclusion into P={Pλ:λ∈Λ}5. The gradient is given by P={Pλ:λ∈Λ}6.
- Criterion: Information is positive if and only if P={Pλ:λ∈Λ}7.
- When P={Pλ:λ∈Λ}8 falls outside P={Pλ:λ∈Λ}9 (the heavy-tailed regime), T⊂V0 and root-T⊂V1 estimation is not possible.
- This corresponds to minimax rates slower than T⊂V2, perfectly aligning semiparametric information geometry with minimax lower bounds for heavy-tailed mean estimation.
Estimating Density at a Point
For T⊂V3 (the space of continuous functions), suppose the parameter of interest is T⊂V4 for some T⊂V5. The local parametrization uses deviations T⊂V6 with T⊂V7 for T⊂V8 supported locally at T⊂V9. The gradient is ψ:Λ→R0, with the adjoint score operator mapping to regular Borel measures.
- Criterion: Information is positive if and only if the underlying reference measure ψ:Λ→R1 has positive mass at ψ:Λ→R2, i.e., ψ:Λ→R3 (as in the discrete case).
- For ψ:Λ→R4 the Lebesgue measure, information vanishes, reflecting the impossibility of root-ψ:Λ→R5 estimation for density at a fixed point under standard smoothness.
Both results reframe classical minimax theory in terms of the geometry of score and adjoint operators, providing not only necessary but also sufficient analytic conditions for regular estimation.
Theoretical Implications
This work achieves a rigorous unification between differentiability, the analytic geometry of adjoint operators, and lower bounds for estimation in broad normed settings. It establishes the general functional-analytic condition under which root-ψ:Λ→R6 estimation is possible in semiparametric or nonparametric models, subsuming and clarifying the scope of prior results for Hilbert space models. The duality between tangent space geometry and the nature of the statistical functional connects efficiency theory, minimax theory, and pathwise differentiability in a single analytic framework.
Moreover, the quotient space construction and conditions involving closedness of ψ:Λ→R7 provide a template for handling cases where direct Hilbert methods break down (non-square-integrable settings, point evaluation functionals, etc.).
Practical Consequences and Future Directions
The criterion exposed has direct implications for the design of estimators and the choice of modeling restrictions in high-dimensional, heavy-tailed, and nonparametric problems. In complex models, it indicates:
- When information is strictly positive, semiparametric efficiency bounds and root-ψ:Λ→R8 regular estimators exist.
- When the condition fails, minimax rates must necessarily be slower, and alternative strategies (e.g., trimming, aggregation, robustification) may be required.
In practice, the criterion can be used diagnostically: functional gradients, spaces, and score operators can be directly checked for the range condition, clarifying in advance when efficiency theory applies or standard estimators will be rate-optimal.
A natural avenue of future exploration is to relax the domination, mean-square differentiability, and global pathwise differentiability assumptions, potentially treating infinite-dimensional or more intricate statistical manifolds.
Conclusion
By generalizing the core equivalence between differentiability and positive information to general normed space models, this paper establishes a precise algebraic-analytic condition for semiparametric efficiency. The structural characterization of positive Fisher information through the range of the adjoint score operator clarifies the limitations and possibilities for regular inference beyond classical Hilbert-space settings, operationalizing efficiency theory for a broad class of statistical models.