Shifted-Truncated Gamma (G-STG) Prior

Updated 16 November 2025

The Shifted-Truncated-Gamma (G-STG) prior is a finite-interval modification of the gamma distribution that enables effective shrinkage and variable selection in high-dimensional GLMs.
It applies truncation and shifting to achieve analytic tractability, closed-form normalization, and robust moment computations essential for precise Bayesian inference.
The prior supports efficient sampling and Laplace approximations, enhancing model selection consistency and predictive performance across various scientific applications.

The Shifted-Truncated-Gamma (G-STG) prior is a finite-interval modification of the classical gamma distribution, intensively utilized in Bayesian model selection for Generalized Linear Models (GLMs) to parameterize shrinkage or regularization factors. This prior is defined by truncating the support of the gamma distribution to a bounded domain and optionally shifting its lower bound. The G-STG prior is notable for its analytic tractability, local geometric adaptation to model curvature, and satisfaction of essential Bayesian consistency desiderata, rendering it particularly effective for variable selection and model averaging tasks in high-dimensional inference settings.

1. Definition and Mathematical Formulation

The untruncated gamma distribution with shape parameter $\alpha > 0$ and rate parameter $\beta > 0$ is given by

$p_0(x; \alpha, \beta) = \frac{\beta^\alpha x^{\alpha-1} e^{-\beta x}}{\Gamma(\alpha)}, \qquad x > 0.$

The G-STG prior introduces a lower truncation $L \geq 0$ and upper truncation $U < \infty$ , yielding the density

$f(\theta; \alpha, \beta, L, U) = \frac{\beta^\alpha \theta^{\alpha-1} e^{-\beta\theta}}{Z(\alpha, \beta; L, U)}, \qquad L \leq \theta \leq U,$

where $Z(\alpha, \beta; L, U) = \Gamma(\alpha, \beta L) - \Gamma(\alpha, \beta U)$ is the normalization constant and $\Gamma(\alpha, z)$ is the upper incomplete gamma function. The G-STG prior is a member of the truncated Compound Confluent Hypergeometric (tCCH) family, a general class encompassing standard hyper-g, Beta-prime, and various robust priors (Li et al., 2015).

When used to regularize GLM coefficient shrinkage, the typical G-STG parameterization is over $u = 1/(1+g)$ , where $g$ controls the scaling of the Zellner-style prior covariance. In canonical form for $u \in (0,1)$ ,

$p(u) = \frac{s_t^{a_t}}{\gamma(a_t, s_t)} u^{a_t-1} e^{-s_t u}, \qquad 0 < u < 1,$

with $a_t$ governing the tail behavior and $s_t$ the overall shrinkage (“unit information” scaling with sample size $n$ is common).

2. Induced Prior on Regularization Parameter and Change of Variables

Considering $u = 1/(1+g)$ with $g > 0$ , the transformation yields $g = (1-u)/u$ and $du/dg = -1/(1+g)^2 = -u^2$ . The induced prior density on $g$ is

$p(g) = [s_t^{a_t}/\gamma(a_t,s_t)] \cdot (1+g)^{-(a_t+1)} e^{-s_t/(1+g)}.$

A customary “Gamma-mixing-in-g” representation follows from this derivation, but parameterization in terms of $u$ optimizes analytic marginal likelihood construction (Li et al., 2015).

3. Properties and Statistical Implications

Raw Moments

For $\theta \in [L,U]$ ,

$\mathrm{E}[\theta^k] = \frac{\Gamma(\alpha + k, \beta L) - \Gamma(\alpha + k, \beta U)} {\beta^k \left[\Gamma(\alpha, \beta L) - \Gamma(\alpha, \beta U)\right]}.$

Specifically,

$\mathrm{E}[\theta] = \frac{\Gamma(\alpha+1, \beta L) - \Gamma(\alpha+1, \beta U)} {\beta (\Gamma(\alpha, \beta L) - \Gamma(\alpha, \beta U))}$

with variance given by the usual $M_2 - (M_1)^2$ formula (Zaninetti, 2014).

Normalization

The normalizing constant $Z(\alpha, \beta; L, U)$ can be equivalently written using lower incomplete gamma functions: $Z(\alpha, \beta; L, U) = \gamma(\alpha, \beta U) - \gamma(\alpha, \beta L).$ This closed-form normalization admits efficient, numerically stable evaluation.

4. Bayesian Embedding and Computational Strategies

Prior Construction

When embedding as a Bayesian prior, $L, U$ reflect known bounds, while $\alpha, \beta$ are selected to match prior means and variances via the truncated gamma moment equations. Embedding as the prior for a GLM “g-prior” regularization factor, $s_t$ is scaled to ensure model selection and intrinsic consistency. Default choices $a_t \in \{1/2, 1\}$ , $s_t = n$ (sample size) are recommended for their robust tail behavior and analytic tractability (Li et al., 2015).

Posterior Update

With likelihoods possessing a gamma kernel (e.g., Poisson, exponential models), the resulting posterior for $\theta$ maintains the truncated gamma form, with updated parameters $\alpha' = \alpha + k$ and $\beta' = \beta + S$ . The only non-conjugate aspect is the new normalization.

Monte Carlo Sampling

Samples may be generated by rejection sampling from the untruncated gamma, accepting only those within $[L,U]$ . Inverse-CDF sampling is more efficient: draw $u \sim \mathrm{Uniform}(0,1)$ and solve $F(\theta; \alpha, \beta, L, U) - u = 0$ using root-finding methods. These steps generalize naturally to Gibbs or Metropolis algorithms for hierarchical models (Zaninetti, 2014).

5. Application in Generalized Linear Models and Marginal Likelihoods

Within the GLM context, the G-STG prior on $u = 1/(1+g)$ enables analytic marginal likelihoods via Laplace approximation. If $u \sim \mathrm{TG}(a_t, s_t)$ ,

$p(y|M) \propto p(y|\hat\alpha, \hat\beta)|J(\hat\alpha)|^{-1/2} \frac{s_t^{a_t}}{\gamma(a_t,s_t)} \frac{\gamma(a_t+p/2,s_t+Q/2)}{(s_t + Q/2)^{a_t+p/2}},$

where $Q = \hat\beta^\top J \hat\beta$ is the observed Wald statistic. The Bayes factor comparing models follows in closed form, enabling Compound Hypergeometric Information Criteria (CHIC) as a straightforward generalization of well-known Bayesian criteria. CHIC is expressed as

$-2\,\log\,p(y|M) \simeq \text{deviance}(M) + \log|J(\hat\alpha)| + p\log(s_t + Q/2) - 2[\ldots] + \text{const}.$

6. Local Geometric Adaptation and Theoretical Justification

The G-STG prior inherits local geometric properties from the information metric of the GLM:

The prior covariance $\mathrm{Cov}(\beta) \propto J^{-1}$ adapts to the curvature of the log-likelihood at the MLE, de-emphasizing directions with high information.
Measurement invariance is maintained, as transforms of $X$ scale the covariance equivariantly.
The prior volume element $d\text{Vol} = g^{p/2} |J|^{-1/2} d\beta$ , integrated over $g$ , is finite for $a_t > 0$ , guaranteeing propriety.

7. Model Selection Consistency, Recommended Defaults, and Empirical Behavior

Selection consistency is guaranteed by scaling $s_t = \mathcal{O}(n)$ , placing sufficient prior mass on large $g$ and ensuring Bayes factors behave correctly under any fixed alternative or under the null. Intrinsic consistency follows as $p(u)$ remains diffuse with increasing $n$ . For estimation, posterior shrinkage converges so that posterior means approach the MLE, preserving unbiasedness in large samples.

Defaults are:

$a_t = 1/2$ or $1$ for robust heavy-tailed behavior.
$s_t = n$ for just-identifiable shrinkage and model selection consistency.
$u \sim \mathrm{TG}(a_t, s_t),\ a_t \in \{1/2,1\},\ s_t = n$ yields closed-form $\gamma$ -function Bayes factors, robust inference, low computational overhead, and satisfaction of all Bayarri et al. desiderata (Li et al., 2015).

In high-dimensional applications, $a_t = 1$ is slightly more stable for sparse signals; $a_t = 1/2$ improves prediction with moderate signals. The analyst can tune $a_t$ and $s_t$ for application-specific tail and shrinkage regimes.

8. Practical Examples and Performance

In astronomical data modeling (e.g., stellar mass functions), the right–left truncated gamma (G-STG) dramatically outperforms lognormal and four-power–law models in $\chi^2$ , AIC, and Kolmogorov–Smirnov criteria. For example, in NGC 6611 with $L = 0.019\,M_\odot$ , $U = 1.36\,M_\odot$ , fit yields $\alpha \approx 1.287$ , $\beta \approx 2.69$ ; for NGC 2362, $\alpha \approx 3.933$ , $\beta \approx 6.21$ (Zaninetti, 2014).

The G-STG prior thus offers closed-form recipes (PDF, CDF, moments, normalization), efficient parameter estimation, and Bayesian conjugate embedding, facilitating robust model selection and prediction in a variety of scientific inference applications.

PDF Markdown Chat (Pro)

References (2)

Mixtures of g-priors in Generalized Linear Models (2015)

A right and left truncated gamma distribution with application to the stars (2014)

Follow Topic

Get notified by email when new papers are published related to Shifted-Truncated-Gamma (G-STG) Prior.