Recurrence Period Density Entropy (RPDE)
- RPDE is a nonparametric measure that quantifies dynamic complexity by embedding time series and analyzing recurrence time distributions.
- It computes the normalized Shannon entropy of the empirical recurrence period density, distinguishing periodic, chaotic, and noise-like behaviors.
- RPDE is applied in biomedical contexts, aiding in the detection of anomalies in voice signals and cardiac dynamics by contrasting healthy and pathological states.
Recurrence Period Density Entropy (RPDE) is a nonparametric, time series–based quantifier of dynamic complexity, originally introduced to objectively characterize aperiodicity and irregularity in signals exhibiting nonstationary or nonlinear dynamics. It is constructed by embedding the scalar time series in a reconstructed phase space, extracting all first recurrence times to a local neighborhood (the “recurrence period”), constructing their empirical probability density function, and then measuring the normalized Shannon entropy of this distribution. RPDE has been demonstrated to have diagnostic utility in biomedical applications, notably for distinguishing pathological from normal voice signals and differentiating healthy from pathological cardiac dynamics by quantifying the degree of deterministic versus stochastic structure in the phase-space trajectories (0707.0086, Mukherjee et al., 2015).
1. Formal Definition and Computational Workflow
Given a scalar time series , , RPDE is computed via the following steps:
- Phase-Space Embedding:
Each point is embedded as a delay vector in :
where is the embedding dimension and the embedding delay.
- Recurrence Times Extraction:
For each , identify the smallest such that
where is the recurrence threshold and denotes Euclidean norm. The resulting is the recurrence time .
- Recurrence-Period Density:
The histogram of observed recurrence times is normalized:
with a finite cutoff, typically chosen to encompass all relevant recurrence intervals.
- Entropy Calculation:
The Shannon entropy of is computed:
Normalization yields
with by construction.
Low () is indicative of nearly periodic or regular dynamics, while high () signals highly aperiodic, noise-like, or stochastic behavior (0707.0086, Mukherjee et al., 2015).
2. Parameter Selection and Practical Considerations
The robustness of RPDE depends critically on selecting appropriate embedding parameters . Recommended procedures include:
- Time Delay ():
Select via the first local minimum of the Average Mutual Information (AMI):
- Embedding Dimension ():
Use False Nearest Neighbors (FNN), increasing until the fraction of false neighbors drops to near zero.
- Recurrence Threshold ():
Typically set as a small fraction (e.g., $0.1$) of the time series standard deviation; values in have been used upon normalization to .
- Binning ():
Chosen to capture all physiologically or physically relevant recurrence times, e.g., .
Values found effective include , , for voice signals (0707.0086); for ECG, –$9$ and –$30$ depending on health status (Mukherjee et al., 2015).
3. Interpretative Framework
RPDE provides a scalar representation of time series complexity and regularity. Two reference cases are:
- Perfectly periodic signals:
is a Kronecker delta, .
- Purely stochastic/i.i.d. noise:
is nearly uniform, .
Intermediate values reflect varying degrees of chaoticity. Empirical studies demonstrate that regular/periodic (e.g., healthy heart or normal voice) signals yield significantly lower than pathological or noise-corrupted signals (0707.0086, Mukherjee et al., 2015).
The following table illustrates typical interpretative ranges:
| Range | Signal Type | Example Condition |
|---|---|---|
| Periodic / Regular | Type I voice, healthy ECG | |
| Chaotic / Complex | Type II voice, chaotic heart | |
| Noise-Like / Irregular | Disordered voice, pathological ECG |
In practical application, thresholds can be defined by empirical distributions, such as for healthy heart vs. for congestive heart failure (Mukherjee et al., 2015).
4. Windowed and Temporal Analysis
To accommodate nonstationary or evolving systems, RPDE can be adapted to a windowed approach:
- The time series is divided into contiguous, nonoverlapping windows of length .
- RPDE is computed independently within each window, yielding for window .
- The mean or trajectory of characterizes temporal variation in complexity.
For biomedical signals such as ECG, window length samples is employed, and windows encompass the record. This averaging procedure preserves discrimination between classes (e.g., normal vs. pathological) while enabling time-resolved assessment (Mukherjee et al., 2015).
5. Biomedical and Applied Usage
RPDE has been validated in several biomedical contexts:
- Voice Pathology Detection:
On a dataset of sustained vowels, RPDE (paired with detrended fluctuation analysis in a quadratic discriminant) achieved overall classification accuracy, with true positive and true negative rates, surpassing traditional metrics (jitter, shimmer, HNR) (0707.0086). RPDE values are robust to non-Gaussian noise and highly nonlinear oscillatory phenomena characteristic of disordered vocal fold vibration.
- Cardiac Complexity Analysis:
In ECG applications, windowed RPDE reveals lower complexity (lower ) in congestive heart failure compared to healthy heart dynamics. Empirically, in healthy subjects and in heart failure, with a discriminative gap of (Mukherjee et al., 2015).
A plausible implication is that RPDE, by measuring average entropic complexity over attractor reconstructions, is sensitive to the shift from deterministic to stochastic dynamical regimes in physiological systems.
6. Limitations and Theoretical Distinction
RPDE is to be distinguished from universal typical-signal entropy estimators based on recurrence/waiting time statistics in symbolic dynamics, as in the context of nonequilibrium statistical mechanics and information theory (Cristadoro et al., 2022). In those works, estimators rely on asymptotic scaling of integer-valued recurrence times and their logarithmic rates, not on constructing return-time density functions or their associated Shannon entropies. Thus RPDE is not directly related to the Shannon entropy rate via such return-time scaling limits; it instead operates as an instantaneous, empirical entropy of distributional recurrence statistics. No directly comparable bounds or inequalities relating RPDE to are established.
RPDE’s effective domain remains continuous-valued or real-world signals requiring nonlinear dynamical analysis, rather than discrete-alphabet processes or symbolic shifts (0707.0086, Mukherjee et al., 2015, Cristadoro et al., 2022).
7. Summary Table: Core Steps and Distinctions
| Step | Continuous-State RPDE (0707.0086, Mukherjee et al., 2015) | Symbolic Recurrence Time Estimators (Cristadoro et al., 2022) |
|---|---|---|
| Embedding | Required (phase-space reconstruction) | Not used |
| Recurrence Stat | Empirical recurrence time distribution / | First return time (integer sequence) |
| Entropic Measure | Shannon entropy of ("RPDE") | Asymptotic logarithmic scaling () |
| Domain | Real-valued signals, biomedical/chaotic time series | Symbolic sequences, ergodic processes |
The combination of phase-space topology and statistical entropy distinguishes RPDE as a diagnostic and analytic tool in nonlinear signal analysis—particularly in biomedical applications where quantifying the complexity of oscillatory and irregular physiological signals is essential.