Papers
Topics
Authors
Recent
Search
2000 character limit reached

Warped Linear Prediction for Count Time Series

Updated 18 February 2026
  • Warped Linear Prediction is a semiparametric approach that extends Gaussian DLMs by applying a transformation and rounding mechanism to generate count-valued observations.
  • It achieves analytic conjugate inference through a selection-normal framework, supporting both batch and online estimation with efficient recursive updates.
  • The method rigorously handles exact count support, including zero-inflation and multivariate counts, while offering flexibility via nonparametric transformation options.

Warped Linear Prediction (WLP) is a semiparametric methodology that extends Gaussian Dynamic Linear Models (DLMs) to time series of counts by introducing a two-step warping mechanism—a flexible transformation followed by a rounding operator. This approach enables analytic conjugate inference for count-valued time series, supporting both batch and online algorithms, and yields a framework that unifies and extends a variety of discrete time series models while retaining probabilistic and recursive updating properties (King et al., 2021).

1. Gaussian Dynamic Linear Model Foundation

The backbone of WLP is the Gaussian DLM, specified in state-space form for a latent continuous process {zt}\{z_t\}:

  • State Evolution: xt=Gtxt1+ωtx_t = G_t\, x_{t-1} + \omega_t, with ωtNp(0,Wt)\omega_t \sim N_p(0, W_t)
  • Observation Model: zt=Ftxt+νtz_t = F_t^\top x_t + \nu_t, with νtNn(0,Vt)\nu_t \sim N_n(0, V_t)
  • Prior Distribution: x0Np(a0,R0)x_0 \sim N_p(a_0, R_0)

Here, xtRpx_t \in \mathbb{R}^p is the latent state, ztRnz_t \in \mathbb{R}^n is the latent "observation," FtF_t and GtG_t are known design matrices, and VtV_t, WtW_t are known or estimated covariance matrices. In standard form, the Kalman filter and smoother yield closed-form recursive updates for the filtered and smoothed posterior moments of xtx_t.

2. Warping Procedure: Transformation and Rounding

WLP introduces a two-step warping of the latent Gaussian sequence {zt}\{z_t\} to generate count-valued observations ytNny_t \in \mathbb{N}^n:

  1. Transformation TT: A continuous, strictly monotonic operator T:RnRnT:\mathbb{R}^n \to \mathbb{R}^n. Options include parametric forms (e.g., log, sqrt, identity) or a nonparametric construction via the empirical CDF of yy:

T^(z)=yˉ+s^yΦ1(F~y(j))for z[j,j+1),\widehat T(z) = \bar y + \hat s_y\, \Phi^{-1}(\tilde F_y(j)) \quad \text{for } z \in [j, j+1),

with smooth interpolation to ensure monotonicity.

  1. Rounding RR: A discrete operator R:RnNnR:\mathbb{R}^n \to \mathbb{N}^n, typically the component-wise floor with a zero-modification:
    • R(w)=0R(w) = 0 if w<0w < 0,
    • R(w)=wR(w) = \lfloor w \rfloor otherwise.

The observed count is then:

yt=R(T(Ftxt+νt)).y_t = R\left(T\left(F_t^\top x_t + \nu_t\right)\right).

The design admits exact count support, easily accommodating zero-inflation, upper bounds, and various marginal distributions.

3. Conjugate Analytic Inference via Selection-Normal Family

Conditioning on a count yty_t is equivalent to conditioning on a truncation event in latent space: ztCt=T1(R1(yt))z_t \in C_t = T^{-1}(R^{-1}(y_t)). Under this construction, all marginal and joint posteriors over latent states, such as p(xty1:t)p(x_t \mid y_{1:t}) and p(x1:Ty1:T)p(x_{1:T} \mid y_{1:T}), fall within the Selection-Normal (SLCT-N) family:

[xzC]SLCT-Nn,p(μz,μx,Σz,Σx,Σzx,C).[x \mid z \in C] \sim \mathrm{SLCT\text{-}N}_{n,p}(\mu_z, \mu_x, \Sigma_z, \Sigma_x, \Sigma_{z\,x}, C).

The SLCT-N density is proportional to

ϕp(x;μx,Σx)Φn(C;μz+ΣzxΣx1(xμx),ΣzΣzxΣx1Σzx),\phi_p(x; \mu_x, \Sigma_x)\, \overline\Phi_n(C; \mu_z + \Sigma_{z\,x} \Sigma_x^{-1}(x - \mu_x), \Sigma_z - \Sigma_{z\,x} \Sigma_x^{-1} \Sigma_{z\,x}^\top),

where ϕp\phi_p is the pp-dimensional normal density and Φn\overline\Phi_n is the truncated cumulative normal. The filtering and smoothing recursions for state prediction, filtering update, and joint smoothing maintain SLCT-N conjugacy with closed-form parameter updates.

4. Batch and Online Computational Algorithms

Algorithmic inference in WLP leverages the SLCT-N structure for both batch (offline) and online (streaming) settings:

  • Direct Monte Carlo (MC) Sampling: For batch analysis, draws are generated from SLCT-N by:

    1. Sampling V0Nd1(0,Σz)V_0 \sim N_{d_1}(0, \Sigma_z), truncated to CμzC - \mu_z,
    2. Sampling V1Nd2(0,ΣxΣzxΣz1Σzx)V_1 \sim N_{d_2}(0, \Sigma_x - \Sigma_{z\,x}^\top \Sigma_z^{-1} \Sigma_{z\,x}),
    3. Setting x=μx+V1+ΣzxΣz1V0x = \mu_x + V_1 + \Sigma_{z\,x}^\top \Sigma_z^{-1} V_0, which yields exact draws from the desired selection-normal distribution.
  • Optimal Particle Filter: For online inference, at each time tt and for each particle xt1(s)x_{t-1}^{(s)}, a new particle is drawn from

q(xtxt1,yt)=p(xtxt1,yt)=SLCT-Nn,p(,Ct),q(x_t \mid x_{t-1}, y_t) = p(x_t \mid x_{t-1}, y_t) = \mathrm{SLCT\text{-}N}_{n,p}(\dots, C_t),

with weights proportional to Φn(Ct;FtGtxt1(s),Vt+FtWtFt)\overline\Phi_n(C_t; F_t G_t x_{t-1}^{(s)}, V_t + F_t W_t F_t^\top). This choice of proposal minimizes weight variance, enhancing online filter efficiency.

5. Forecasting and Predictive Distribution

WLP supports both analytic and simulation-based forecasting for count time series:

  • Analytic One-Step Forecast: The one-step predictive likelihood is given by the ratio of model marginal probabilities:

p(yt+1y1:t)=p(y1:t+1)p(y1:t)=Φn(t+1)(C1:t×Ct+1;μz,Σz)Φnt(C1:t;μz,Σz),p(y_{t+1} \mid y_{1:t}) = \frac{p(y_{1:t+1})}{p(y_{1:t})} = \frac{\overline\Phi_{n(t+1)}(C_{1:t} \times C_{t+1}; \mu_z, \Sigma_z)}{\overline\Phi_{n t}(C_{1:t}; \mu'_z, \Sigma'_z)},

where parameters come from joint smoothing up to t+1t+1. This is evaluated over a local grid of yy values.

  • Simulation-Based Forecast: Posterior samples xt(s)x_t^{(s)} are propagated forward:
    1. xt+1N(Gt+1xt(s),Wt+1)x_{t+1} \sim N(G_{t+1}x_t^{(s)}, W_{t+1}),
    2. zt+1N(Ft+1xt+1,Wt+1)z_{t+1} \sim N(F_{t+1}x_{t+1}, W_{t+1}),
    3. yt+1(s)=R(T(zt+1(s)))y_{t+1}^{(s)} = R(T(z_{t+1}^{(s)})), constructing an empirical forecast distribution.

6. Strengths and Practical Limitations

WLP offers several key advantages:

  • Data Coherency: yty_t has exact count support, with direct handling of zero-inflation, discrete bounds, and heaping.
  • Flexible Marginals: The transformation T()T(\cdot) can be nonparametric, supporting semiparametric modeling.
  • Analytic Recursions: Selection-Normal conjugacy allows closed-form updating for filtering, smoothing, and likelihood.
  • Multivariate and Missing Data Handling: Direct support for multivariate counts (Nn\mathbb{N}^n), missingness, and mixed-frequency data.
  • Computation: Algorithms are suitable for both offline (MC, batch Gibbs) and online (optimal particle filter) settings with minimal reliance on traditional MCMC.

Limitations include:

  • High-Dimensionality: Sampling from large-dimensional truncated normals may become a computational bottleneck.
  • Growing Filter Dimension: Naive filtering leads to latent block dimension growth as tt increases, requiring attention to algorithm scalability (e.g., marginal or particle implementations).
  • Transformation Estimation: Nonparametric TT requires (possibly repeated) estimation—especially challenging in streaming contexts.
  • Variance Estimation: Covariances WtW_t, VtV_t are typically estimated via marginal likelihood maximization or modeled as hyperpriors, adding a layer of inference.

In sum, Warped Linear Prediction applies a semiparametric warping to the Gaussian DLM, yielding a coherent, conjugate, and flexible framework for count time series modeling that preserves the structural and computational benefits of Kalman-type models while ensuring respect for the discreteness of observed data (King et al., 2021).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Warped Linear Prediction (WLP).