Projected Normal Distribution
- Projected Normal Distribution is a model for circular data that projects a multivariate normal onto a unit circle or sphere, capturing both symmetric and asymmetric features.
- It employs nonlinear change-of-variable techniques and latent variable methods to estimate parameters, accommodating dependencies in circular-linear datasets.
- Its applications span meteorology, animal movement, and geology, providing interpretable insights into mean direction and data spread while addressing non-identifiability issues.
The projected normal distribution is a probabilistic model for circular (directional or angular) data, constructed by projecting a multivariate normal distribution onto the unit sphere or circle. It is widely employed in directional statistics, including modeling of wind directions, biological orientation data, geological features, and, more generally, for poly-cylindrical (mixed circular-linear) data where standard linear models fail to capture the geometry of the sample space.
1. Mathematical Formulation and Construction
The projected normal distribution arises by projecting a multivariate normal random vector onto the unit sphere or unit circle. Consider the 2-dimensional case, which is prototypical for angular variables in or for modeling a direction on the circle:
Let , where . Define the projected angle as: Here, denotes the quadrant-corrected arctangent. The distribution of is termed the two-dimensional projected normal distribution, often denoted , where and
The density of is induced from the joint density via nonlinear change of variables from to , . The marginal density of is then obtained by integrating over the hidden : where .
For the multivariate case, the projected normal for angles is defined analogously by projecting a $2p$-variate normal vector onto circles:
This framework generalizes naturally to the -sphere for data in , where the projection to the sphere is .
2. Distributional Properties and Statistical Features
The projected normal distribution possesses several desirable properties for modeling circular data:
- Flexibility: Capable of representing symmetric, asymmetric, unimodal, or bimodal circular densities.
- Parameter Interpretability: For the 2D case, the parameters and relate to the mean direction (modulo scale), while and control spread and shape.
- Dependence structure: In the multivariate setting, the covariance matrix fully captures the dependencies among angular variables. Arbitrary dependence structures can be accommodated.
- Rotation/reflection invariance: The marginal distribution on the angle is invariant to rotations or reflections of the coordinate axes.
- Closure under marginalization: Any subset of the angles retains the projected normal form.
Non-identifiability arises because scaling a normal random vector does not change the projected angle. Therefore, constraints (e.g., unit variance for a specific component) are imposed to ensure identifiability in estimation procedures.
3. Projected Normal Distribution for Poly-Cylindrical and Mixed Data
The projected normal serves as the foundational building block for models involving circular-linear and poly-cylindrical data. Within this context:
- Circular-linear General Projected Normal (CL-GPN): Augments the projected normal with conditional (often Gaussian) models for associated linear variables, permitting regimes with jointly distributed circular and linear measurements. For the -th regime in a Hidden Markov Model (HMM), the joint emission distribution is
where is circular, linear, is the latent radius, and .
- Joint Projected Normal and Skew-Normal (JPSN) (1711.10463): Extends the projected normal for use in higher-dimensional mixed circular-linear-vector settings by coupling with a multivariate skew-normal for the linear part. The full model integrates latent variables for efficient posterior sampling and supports arbitrary dependence among all variable types.
4. Estimation and Computational Considerations
Inference for the projected normal and its joint extensions typically relies on Bayesian methods using data augmentation with latent radii and (for skew-normal extensions) skewness variables:
- Gibbs Sampling: Latent variables (radii and skewness) are introduced to retain conditionally normal likelihoods, facilitating joint updates of mean vectors and covariance matrices via conjugate priors.
- Non-identifiability resolution: Sampling is performed in an unconstrained space, with post-processing of posterior draws via linear transformations to enforce constraints (e.g., variance normalization).
- Numerical Integration: For likelihoods involving marginalized latent radii, either analytical approximations or numerical integration techniques are used when closed-form expressions are unavailable (as is generally the case for ).
- Computational Effort: For instance, fitting HMMs with circular-linear GPN emissions in long time series (e.g., ) may require several hours but yields stable convergence when implemented with efficient MCMC (occasionally using adaptive Metropolis steps for select parameters).
5. Parameter Estimation from Projected Data
For the case of isotropic normal distributions projected onto spheres (2410.22384), intrinsic statistics can be exploited:
- Mean estimation: The projected intrinsic mean (Fréchet mean) on the sphere commutes with the projection for isotropic covariance, i.e., the mean direction estimated on the sphere coincides with the projection of the ambient-space mean.
- Variance estimation: There exists a one-to-one correspondence between the normalized variance in the ambient normal and the intrinsic covariance of the projected data; estimation is effected by inverting an explicit bijection linking the two. Only the mean direction and the variance-to-mean ratio are identifiable from sphere-projected observations; separate recovery of and is not possible.
Estimation proceeds via computation of the empirical Fréchet mean and empirical intrinsic covariance, followed by numerical inversion to solve for the normalized variance. Consistency of estimators and convergence rates are established under regularity conditions.
6. Applications and Extensions
The projected normal distribution is extensively employed in:
- Hidden Markov Models with circular-linear observations: As demonstrated in wind regime classification, where wind direction (circular) and speed (linear) are modeled jointly, allowing dependence between variables.
- Animal movement analysis: As in joint modeling of zebras’ step-lengths (linear) and turning angles (circular) via the JPSN distribution, with improved recovery of joint dependence structure over cylindrical (independent) models.
- Directional statistics: Applications span meteorology (wind and wave modeling), geology, structural biology, robotics, and wireless communication (e.g., spherical/antenna orientation statistics).
- Spherical data parameter recovery: The projected normal framework allows consistent recovery of underlying normal distribution parameters when only directions are observed, a scenario common in various applied domains.
7. Key Formulas and Table
Aspect | Formula / Description |
---|---|
Projected Normal (2D) | ; |
Marginal Joint Density | |
Intrinsic Covariance Link | , bijective; estimate from projected covariance |
JPSN Joint Density (aug) |
These methodological developments enable flexible, robust, and interpretable modeling and inference for a wide range of problems in which standard Euclidean or cylindrical distributions are inadequate due to the rich geometry of the sample spaces involved.