Papers
Topics
Authors
Recent
2000 character limit reached

Shifted-Truncated Gamma (G-STG) Prior

Updated 16 November 2025
  • The Shifted-Truncated-Gamma (G-STG) prior is a finite-interval modification of the gamma distribution that enables effective shrinkage and variable selection in high-dimensional GLMs.
  • It applies truncation and shifting to achieve analytic tractability, closed-form normalization, and robust moment computations essential for precise Bayesian inference.
  • The prior supports efficient sampling and Laplace approximations, enhancing model selection consistency and predictive performance across various scientific applications.

The Shifted-Truncated-Gamma (G-STG) prior is a finite-interval modification of the classical gamma distribution, intensively utilized in Bayesian model selection for Generalized Linear Models (GLMs) to parameterize shrinkage or regularization factors. This prior is defined by truncating the support of the gamma distribution to a bounded domain and optionally shifting its lower bound. The G-STG prior is notable for its analytic tractability, local geometric adaptation to model curvature, and satisfaction of essential Bayesian consistency desiderata, rendering it particularly effective for variable selection and model averaging tasks in high-dimensional inference settings.

1. Definition and Mathematical Formulation

The untruncated gamma distribution with shape parameter α>0\alpha > 0 and rate parameter β>0\beta > 0 is given by

p0(x;α,β)=βαxα1eβxΓ(α),x>0.p_0(x; \alpha, \beta) = \frac{\beta^\alpha x^{\alpha-1} e^{-\beta x}}{\Gamma(\alpha)}, \qquad x > 0.

The G-STG prior introduces a lower truncation L0L \geq 0 and upper truncation U<U < \infty, yielding the density

f(θ;α,β,L,U)=βαθα1eβθZ(α,β;L,U),LθU,f(\theta; \alpha, \beta, L, U) = \frac{\beta^\alpha \theta^{\alpha-1} e^{-\beta\theta}}{Z(\alpha, \beta; L, U)}, \qquad L \leq \theta \leq U,

where Z(α,β;L,U)=Γ(α,βL)Γ(α,βU)Z(\alpha, \beta; L, U) = \Gamma(\alpha, \beta L) - \Gamma(\alpha, \beta U) is the normalization constant and Γ(α,z)\Gamma(\alpha, z) is the upper incomplete gamma function. The G-STG prior is a member of the truncated Compound Confluent Hypergeometric (tCCH) family, a general class encompassing standard hyper-g, Beta-prime, and various robust priors (Li et al., 2015).

When used to regularize GLM coefficient shrinkage, the typical G-STG parameterization is over u=1/(1+g)u = 1/(1+g), where gg controls the scaling of the Zellner-style prior covariance. In canonical form for u(0,1)u \in (0,1),

p(u)=statγ(at,st)uat1estu,0<u<1,p(u) = \frac{s_t^{a_t}}{\gamma(a_t, s_t)} u^{a_t-1} e^{-s_t u}, \qquad 0 < u < 1,

with ata_t governing the tail behavior and sts_t the overall shrinkage (“unit information” scaling with sample size nn is common).

2. Induced Prior on Regularization Parameter and Change of Variables

Considering u=1/(1+g)u = 1/(1+g) with g>0g > 0, the transformation yields g=(1u)/ug = (1-u)/u and du/dg=1/(1+g)2=u2du/dg = -1/(1+g)^2 = -u^2. The induced prior density on gg is

p(g)=[stat/γ(at,st)](1+g)(at+1)est/(1+g).p(g) = [s_t^{a_t}/\gamma(a_t,s_t)] \cdot (1+g)^{-(a_t+1)} e^{-s_t/(1+g)}.

A customary “Gamma-mixing-in-g” representation follows from this derivation, but parameterization in terms of uu optimizes analytic marginal likelihood construction (Li et al., 2015).

3. Properties and Statistical Implications

Raw Moments

For θ[L,U]\theta \in [L,U],

E[θk]=Γ(α+k,βL)Γ(α+k,βU)βk[Γ(α,βL)Γ(α,βU)].\mathrm{E}[\theta^k] = \frac{\Gamma(\alpha + k, \beta L) - \Gamma(\alpha + k, \beta U)} {\beta^k \left[\Gamma(\alpha, \beta L) - \Gamma(\alpha, \beta U)\right]}.

Specifically,

E[θ]=Γ(α+1,βL)Γ(α+1,βU)β(Γ(α,βL)Γ(α,βU))\mathrm{E}[\theta] = \frac{\Gamma(\alpha+1, \beta L) - \Gamma(\alpha+1, \beta U)} {\beta (\Gamma(\alpha, \beta L) - \Gamma(\alpha, \beta U))}

with variance given by the usual M2(M1)2M_2 - (M_1)^2 formula (Zaninetti, 2014).

Normalization

The normalizing constant Z(α,β;L,U)Z(\alpha, \beta; L, U) can be equivalently written using lower incomplete gamma functions: Z(α,β;L,U)=γ(α,βU)γ(α,βL).Z(\alpha, \beta; L, U) = \gamma(\alpha, \beta U) - \gamma(\alpha, \beta L). This closed-form normalization admits efficient, numerically stable evaluation.

4. Bayesian Embedding and Computational Strategies

Prior Construction

When embedding as a Bayesian prior, L,UL, U reflect known bounds, while α,β\alpha, \beta are selected to match prior means and variances via the truncated gamma moment equations. Embedding as the prior for a GLM “g-prior” regularization factor, sts_t is scaled to ensure model selection and intrinsic consistency. Default choices at{1/2,1}a_t \in \{1/2, 1\}, st=ns_t = n (sample size) are recommended for their robust tail behavior and analytic tractability (Li et al., 2015).

Posterior Update

With likelihoods possessing a gamma kernel (e.g., Poisson, exponential models), the resulting posterior for θ\theta maintains the truncated gamma form, with updated parameters α=α+k\alpha' = \alpha + k and β=β+S\beta' = \beta + S. The only non-conjugate aspect is the new normalization.

Monte Carlo Sampling

Samples may be generated by rejection sampling from the untruncated gamma, accepting only those within [L,U][L,U]. Inverse-CDF sampling is more efficient: draw uUniform(0,1)u \sim \mathrm{Uniform}(0,1) and solve F(θ;α,β,L,U)u=0F(\theta; \alpha, \beta, L, U) - u = 0 using root-finding methods. These steps generalize naturally to Gibbs or Metropolis algorithms for hierarchical models (Zaninetti, 2014).

5. Application in Generalized Linear Models and Marginal Likelihoods

Within the GLM context, the G-STG prior on u=1/(1+g)u = 1/(1+g) enables analytic marginal likelihoods via Laplace approximation. If uTG(at,st)u \sim \mathrm{TG}(a_t, s_t),

p(yM)p(yα^,β^)J(α^)1/2statγ(at,st)γ(at+p/2,st+Q/2)(st+Q/2)at+p/2,p(y|M) \propto p(y|\hat\alpha, \hat\beta)|J(\hat\alpha)|^{-1/2} \frac{s_t^{a_t}}{\gamma(a_t,s_t)} \frac{\gamma(a_t+p/2,s_t+Q/2)}{(s_t + Q/2)^{a_t+p/2}},

where Q=β^Jβ^Q = \hat\beta^\top J \hat\beta is the observed Wald statistic. The Bayes factor comparing models follows in closed form, enabling Compound Hypergeometric Information Criteria (CHIC) as a straightforward generalization of well-known Bayesian criteria. CHIC is expressed as

2logp(yM)deviance(M)+logJ(α^)+plog(st+Q/2)2[]+const.-2\,\log\,p(y|M) \simeq \text{deviance}(M) + \log|J(\hat\alpha)| + p\log(s_t + Q/2) - 2[\ldots] + \text{const}.

6. Local Geometric Adaptation and Theoretical Justification

The G-STG prior inherits local geometric properties from the information metric of the GLM:

  • The prior covariance Cov(β)J1\mathrm{Cov}(\beta) \propto J^{-1} adapts to the curvature of the log-likelihood at the MLE, de-emphasizing directions with high information.
  • Measurement invariance is maintained, as transforms of XX scale the covariance equivariantly.
  • The prior volume element dVol=gp/2J1/2dβd\text{Vol} = g^{p/2} |J|^{-1/2} d\beta, integrated over gg, is finite for at>0a_t > 0, guaranteeing propriety.

Selection consistency is guaranteed by scaling st=O(n)s_t = \mathcal{O}(n), placing sufficient prior mass on large gg and ensuring Bayes factors behave correctly under any fixed alternative or under the null. Intrinsic consistency follows as p(u)p(u) remains diffuse with increasing nn. For estimation, posterior shrinkage converges so that posterior means approach the MLE, preserving unbiasedness in large samples.

Defaults are:

  • at=1/2a_t = 1/2 or $1$ for robust heavy-tailed behavior.
  • st=ns_t = n for just-identifiable shrinkage and model selection consistency.
  • uTG(at,st), at{1/2,1}, st=nu \sim \mathrm{TG}(a_t, s_t),\ a_t \in \{1/2,1\},\ s_t = n yields closed-form γ\gamma-function Bayes factors, robust inference, low computational overhead, and satisfaction of all Bayarri et al. desiderata (Li et al., 2015).

In high-dimensional applications, at=1a_t = 1 is slightly more stable for sparse signals; at=1/2a_t = 1/2 improves prediction with moderate signals. The analyst can tune ata_t and sts_t for application-specific tail and shrinkage regimes.

8. Practical Examples and Performance

In astronomical data modeling (e.g., stellar mass functions), the right–left truncated gamma (G-STG) dramatically outperforms lognormal and four-power–law models in χ2\chi^2, AIC, and Kolmogorov–Smirnov criteria. For example, in NGC 6611 with L=0.019ML = 0.019\,M_\odot, U=1.36MU = 1.36\,M_\odot, fit yields α1.287\alpha \approx 1.287, β2.69\beta \approx 2.69; for NGC 2362, α3.933\alpha \approx 3.933, β6.21\beta \approx 6.21 (Zaninetti, 2014).

The G-STG prior thus offers closed-form recipes (PDF, CDF, moments, normalization), efficient parameter estimation, and Bayesian conjugate embedding, facilitating robust model selection and prediction in a variety of scientific inference applications.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)
Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Shifted-Truncated-Gamma (G-STG) Prior.