Warped Linear Prediction for Count Time Series
- Warped Linear Prediction is a semiparametric approach that extends Gaussian DLMs by applying a transformation and rounding mechanism to generate count-valued observations.
- It achieves analytic conjugate inference through a selection-normal framework, supporting both batch and online estimation with efficient recursive updates.
- The method rigorously handles exact count support, including zero-inflation and multivariate counts, while offering flexibility via nonparametric transformation options.
Warped Linear Prediction (WLP) is a semiparametric methodology that extends Gaussian Dynamic Linear Models (DLMs) to time series of counts by introducing a two-step warping mechanism—a flexible transformation followed by a rounding operator. This approach enables analytic conjugate inference for count-valued time series, supporting both batch and online algorithms, and yields a framework that unifies and extends a variety of discrete time series models while retaining probabilistic and recursive updating properties (King et al., 2021).
1. Gaussian Dynamic Linear Model Foundation
The backbone of WLP is the Gaussian DLM, specified in state-space form for a latent continuous process :
- State Evolution: , with
- Observation Model: , with
- Prior Distribution:
Here, is the latent state, is the latent "observation," and are known design matrices, and , are known or estimated covariance matrices. In standard form, the Kalman filter and smoother yield closed-form recursive updates for the filtered and smoothed posterior moments of .
2. Warping Procedure: Transformation and Rounding
WLP introduces a two-step warping of the latent Gaussian sequence to generate count-valued observations :
- Transformation : A continuous, strictly monotonic operator . Options include parametric forms (e.g., log, sqrt, identity) or a nonparametric construction via the empirical CDF of :
with smooth interpolation to ensure monotonicity.
- Rounding : A discrete operator , typically the component-wise floor with a zero-modification:
- if ,
- otherwise.
The observed count is then:
The design admits exact count support, easily accommodating zero-inflation, upper bounds, and various marginal distributions.
3. Conjugate Analytic Inference via Selection-Normal Family
Conditioning on a count is equivalent to conditioning on a truncation event in latent space: . Under this construction, all marginal and joint posteriors over latent states, such as and , fall within the Selection-Normal (SLCT-N) family:
The SLCT-N density is proportional to
where is the -dimensional normal density and is the truncated cumulative normal. The filtering and smoothing recursions for state prediction, filtering update, and joint smoothing maintain SLCT-N conjugacy with closed-form parameter updates.
4. Batch and Online Computational Algorithms
Algorithmic inference in WLP leverages the SLCT-N structure for both batch (offline) and online (streaming) settings:
- Direct Monte Carlo (MC) Sampling: For batch analysis, draws are generated from SLCT-N by:
- Sampling , truncated to ,
- Sampling ,
- Setting , which yields exact draws from the desired selection-normal distribution.
Optimal Particle Filter: For online inference, at each time and for each particle , a new particle is drawn from
with weights proportional to . This choice of proposal minimizes weight variance, enhancing online filter efficiency.
5. Forecasting and Predictive Distribution
WLP supports both analytic and simulation-based forecasting for count time series:
- Analytic One-Step Forecast: The one-step predictive likelihood is given by the ratio of model marginal probabilities:
where parameters come from joint smoothing up to . This is evaluated over a local grid of values.
- Simulation-Based Forecast: Posterior samples are propagated forward:
- ,
- ,
- , constructing an empirical forecast distribution.
6. Strengths and Practical Limitations
WLP offers several key advantages:
- Data Coherency: has exact count support, with direct handling of zero-inflation, discrete bounds, and heaping.
- Flexible Marginals: The transformation can be nonparametric, supporting semiparametric modeling.
- Analytic Recursions: Selection-Normal conjugacy allows closed-form updating for filtering, smoothing, and likelihood.
- Multivariate and Missing Data Handling: Direct support for multivariate counts (), missingness, and mixed-frequency data.
- Computation: Algorithms are suitable for both offline (MC, batch Gibbs) and online (optimal particle filter) settings with minimal reliance on traditional MCMC.
Limitations include:
- High-Dimensionality: Sampling from large-dimensional truncated normals may become a computational bottleneck.
- Growing Filter Dimension: Naive filtering leads to latent block dimension growth as increases, requiring attention to algorithm scalability (e.g., marginal or particle implementations).
- Transformation Estimation: Nonparametric requires (possibly repeated) estimation—especially challenging in streaming contexts.
- Variance Estimation: Covariances , are typically estimated via marginal likelihood maximization or modeled as hyperpriors, adding a layer of inference.
In sum, Warped Linear Prediction applies a semiparametric warping to the Gaussian DLM, yielding a coherent, conjugate, and flexible framework for count time series modeling that preserves the structural and computational benefits of Kalman-type models while ensuring respect for the discreteness of observed data (King et al., 2021).