Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
118 tokens/sec
GPT-4o
12 tokens/sec
Gemini 2.5 Pro Pro
34 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
3 tokens/sec
DeepSeek R1 via Azure Pro
33 tokens/sec
2000 character limit reached

A note on the Artstein-Avidan-Milman's generalized Legendre transforms (2507.20577v1)

Published 28 Jul 2025 in cs.IT, cs.LG, and math.IT

Abstract: Artstein-Avidan and Milman [Annals of mathematics (2009), (169):661-674] characterized invertible reverse-ordering transforms on the space of lower-semi-continuous extended real-valued convex functions as affine deformations of the ordinary Legendre transform. In this note, we prove that all those generalized Legendre transforms on functions correspond to the ordinary Legendre transform on dually corresponding affine-deformed functions. That is, generalized convex conjugates are convex conjugates of affine-deformed functions. We conclude this note by sketching how this result can be interpreted from the lens of information geometry.

Summary

  • The paper demonstrates that any generalized Legendre-Fenchel transform (GLFT) can be recast as an ordinary Legendre-Fenchel transform applied to an affine-deformed function.
  • It provides an explicit, involutive mapping between affine deformation parameters, simplifying the structure and analysis of convex conjugates.
  • The work links these findings to information geometry, showing that affine invariance underpins the duality and divergence measures in dually flat spaces.

Generalized Legendre Transforms as Affine-Deformed Convex Conjugates

Introduction

This note addresses the structure of generalized Legendre-Fenchel transforms (GLFTs) as introduced by Artstein-Avidan and Milman, which characterize all invertible, order-reversing transforms on the space of proper lower semi-continuous convex functions as affine deformations of the classical Legendre-Fenchel transform (LFT). The main result demonstrates that any such GLFT can be realized as an ordinary LFT applied to an affine-deformed function, thus reducing the apparent generality of GLFTs to a canonical form. The implications of this result are further interpreted through the lens of information geometry, connecting the algebraic structure of GLFTs to the geometry of dually flat spaces.

Theoretical Framework

Let Γ0\Gamma_0 denote the space of proper, lower semi-continuous, extended real-valued convex functions on Rm\mathbb{R}^m. The classical Legendre-Fenchel transform LL maps FΓ0F \in \Gamma_0 to its convex conjugate FF^*:

(LF)(η)=supθRm{θ,ηF(θ)}.(LF)(\eta) = \sup_{\theta \in \mathbb{R}^m} \left\{ \langle \theta, \eta \rangle - F(\theta) \right\}.

The Moreau-Fenchel-Rockafellar theorem ensures that for FΓ0F \in \Gamma_0, the biconjugate FF^{**} recovers FF.

Artstein-Avidan and Milman established that any invertible, order-reversing transform TT on Γ0\Gamma_0 must be of the form:

(TF)(η)=λ(LF)(Eη+f)+η,g+h,(TF)(\eta) = \lambda (LF)(E\eta + f) + \langle \eta, g \rangle + h,

where λ>0\lambda > 0, EGL(Rm)E \in GL(\mathbb{R}^m), f,gRmf, g \in \mathbb{R}^m, and hRh \in \mathbb{R}. This result axiomatizes the class of GLFTs.

Main Result: GLFTs as LFTs of Affine-Deformed Functions

The central contribution of the note is the explicit construction showing that any GLFT can be written as the LFT of an affine-deformed function. Specifically, for parameters P=(λ,A,b,c,d)P = (\lambda, A, b, c, d), define the affine deformation:

FP(θ)=λF(Aθ+b)+θ,c+d.F_P(\theta) = \lambda F(A\theta + b) + \langle \theta, c \rangle + d.

It is shown that FPΓ0F_P \in \Gamma_0 whenever FΓ0F \in \Gamma_0 and AGL(Rm)A \in GL(\mathbb{R}^m), and that the LFT of FPF_P is another affine deformation of FF^*, with parameters given by an explicit involutive mapping PPP \mapsto P^\diamond:

P=(λ,1λA1,1λA1c,A1b,b,A1cd).P^\diamond = \left( \lambda, \frac{1}{\lambda}A^{-1}, -\frac{1}{\lambda}A^{-1}c, -A^{-1}b, \langle b, A^{-1}c \rangle - d \right).

Thus,

L(FP)=(LF)P.L(F_P) = (LF)_{P^\diamond}.

The involutive property (P)=P({P^\diamond})^\diamond = P ensures that the LFT remains an involution under affine deformation.

Consequently, any GLFT TT can be realized as:

(TF)(η)=L(FP)(η),(TF)(\eta) = L(F_{P^\diamond})(\eta),

where the parameters of the affine deformation are determined by the GLFT parameters. Figure 1

Figure 1: The ordinary Legendre transform on classes of functions: Relationships with representational Fenchel-Young and Bregman divergences, flat Hessian divergence, and α\alpha-geometry in information geometry.

Examples and Structural Properties

The note provides several canonical examples of convex conjugate pairs, including affine functions, exponential functions (yielding the Shannon entropy as conjugate), and LpL_p norms (with conjugate exponents). The structure of Legendre-type functions is discussed, emphasizing the reciprocal nature of gradients for such pairs and the importance of strict convexity and differentiability on open domains.

The reverse-ordering property of the LFT is highlighted: if F2F1F_2 \leq F_1, then LF2LF1LF_2 \geq LF_1, and vice versa. This property underpins the order-reversing nature of GLFTs. Figure 2

Figure 2

Figure 2: A pair (F(θ),F(η))(F(\theta),F^*(\eta)) of conjugate functions (top) with their subgradients plotted (bottom). F(θ)F(\theta) is not differentiable at θ=0\theta=0 and thus admits a subgradient F(0)\partial F(0) at θ=0\theta=0. F(η)F^*(\eta) is everywhere differentiable, and when θ0\theta \neq 0, F=(F)1\nabla F^* = (\nabla F)^{-1}.

Information-Geometric Interpretation

The result is interpreted within the framework of information geometry, where a strictly convex function FF induces a dually flat manifold (M,g,,)(M, g, \nabla, \nabla^*). The affine freedom in the choice of coordinate systems and potential functions corresponds precisely to the affine deformations in the GLFT characterization. The Fenchel-Young inequality and the associated divergences (Fenchel-Young, Bregman, and dually flat divergences) are invariant under these affine transformations, up to scaling.

The equivalence relation FF~F \sim \tilde{F} if F~=FP\tilde{F} = F_P for some PP partitions Γ0\Gamma_0 into moduli spaces of dually flat structures, and the GLFTs correspond to isometries on these moduli spaces. The scaling parameter λ\lambda reflects the freedom to rescale the metric and connections in the information-geometric structure.

Implications and Future Directions

The reduction of GLFTs to ordinary LFTs on affine-deformed functions clarifies the algebraic and geometric structure underlying duality in convex analysis. This result has implications for the paper of divergences, optimization, and statistical models where duality and convex conjugacy play a central role. In information geometry, it provides a rigorous foundation for the invariance of divergences and geometric structures under affine transformations.

Potential future directions include the exploration of these structures in infinite-dimensional settings, applications to statistical exponential families, and the development of algorithms that exploit the affine invariance of divergences and duality mappings.

Conclusion

This note establishes that all generalized Legendre-Fenchel transforms, as characterized by Artstein-Avidan and Milman, are equivalent to ordinary Legendre-Fenchel transforms applied to affine-deformed functions. The explicit involutive correspondence between the parameters of the affine deformation and the GLFT parameters provides a complete structural understanding of these transforms. The information-geometric interpretation further situates this result within the broader context of dually flat spaces and invariant divergences, offering a unified perspective on duality in convex analysis and geometry.

Authors (1)