Papers
Topics
Authors
Recent
Search
2000 character limit reached

Bandt–Pompe Symbolization Methodology

Updated 28 June 2026
  • Bandt–Pompe symbolization is a rank-based method that converts time series data into discrete ordinal patterns, facilitating robust complexity quantification.
  • It constructs overlapping delay vectors and maps them to permutations, making the approach resilient to measurement noise and monotonic transformations.
  • The method is widely applied in fields like biomedical signal processing and nonlinear dynamics, supporting advanced analyses of complex systems.

The Bandt–Pompe symbolization methodology is a rank-based approach for transforming real-valued time series into symbolic sequences of ordinal patterns, facilitating the quantitative characterization of complex dynamics using information-theoretic measures such as permutation entropy and statistical complexity. Ordinal analysis, as introduced by Bandt and Pompe in 2002, has become a foundational tool in the study of time series from diverse domains, leveraging the temporal ordering relations within embedded vectors to yield robust, easily interpretable statistics that are insensitive to monotonic transformations and measurement noise.

1. Formal Definition and Symbolization Procedure

The methodology begins with the construction of overlapping delay vectors from a given real-valued time series X={xt}t=1NX = \{x_t\}_{t=1}^N, using two key parameters: embedding dimension D2D \geq 2 and temporal delay τ1\tau \geq 1 (Aquino et al., 2015, Rosso et al., 2016, Leyva et al., 2022, Gonzalez et al., 2024, Avhale et al., 27 Aug 2025). For tt ranging from $1$ to N(D1)τN-(D-1)\tau,

Xt=[xt,xt+τ,,xt+(D1)τ]\mathbf{X}_t = [x_t,\, x_{t+\tau},\, \ldots,\, x_{t+(D-1)\tau}]

Each vector is mapped to an ordinal pattern (permutation) π=(r0,r1,...,rD1)\pi = (r_0, r_1, ..., r_{D-1}) such that

xt+r0τxt+r1τxt+rD1τx_{t+r_0\tau} \leq x_{t+r_1\tau} \leq \cdots \leq x_{t+r_{D-1}\tau}

Ties are resolved by prescribing that earlier time indices are ranked lower (i.e., ri<ri+1r_i < r_{i+1} if D2D \geq 20) (Rosso et al., 2016, Leyva et al., 2022, Gonzalez et al., 2024).

The mapping of all such vectors yields a sequence of symbols (ordinal patterns) taking as values the D2D \geq 21 possible permutations of D2D \geq 22 indices.

2. Permutation Probability Distribution and Information-Theoretic Quantifiers

The empirical distribution of observed patterns D2D \geq 23 is defined as

D2D \geq 24

Permutation entropy D2D \geq 25 quantifies the diversity of observed patterns via the Shannon entropy (Aquino et al., 2015, Rosso et al., 2016, Gonzalez et al., 2024, Avhale et al., 27 Aug 2025):

D2D \geq 26

D2D \geq 27

To capture both randomness and structure, statistical complexity D2D \geq 28 is constructed by combining normalized entropy with a disequilibrium measure such as the normalized Jensen–Shannon divergence D2D \geq 29 relative to the uniform distribution τ1\tau \geq 10:

τ1\tau \geq 11

τ1\tau \geq 12

τ1\tau \geq 13

This construction ensures τ1\tau \geq 14 and is maximized for distributions that are neither completely random nor perfectly ordered (Aquino et al., 2015, Rosso et al., 2016, Avhale et al., 27 Aug 2025). Alternative complexity quantifiers, such as LMC complexity and, for discrete distributions, Fisher information, are also widely employed (Rosso et al., 2016, Montani et al., 2013).

3. Practical Considerations: Parameter Selection and Tie Handling

Selection of the embedding dimension τ1\tau \geq 15 and delay τ1\tau \geq 16 is informed by trade-offs between pattern diversity and statistical reliability. Bandt and Pompe recommend τ1\tau \geq 17, with τ1\tau \geq 18 (e.g., τ1\tau \geq 19) to ensure all patterns are well-sampled (Aquino et al., 2015, Rosso et al., 2016, Leyva et al., 2022, Gonzalez et al., 2024, Avhale et al., 27 Aug 2025). The time delay tt0 is often set to tt1 for consecutive samples but can be increased to probe slower dynamics or specific time scales indicated by autocorrelation or mutual information analyses (Aquino et al., 2015, Gonzalez et al., 2024).

Ties in floating-point data are rare, but real-world measurement constraints (e.g., quantization or plateaus) can produce equal values. The canonical approach resolves ties by temporal order—earlier indices are ranked lower (Leyva et al., 2022, Gonzalez et al., 2024, Aquino et al., 2015). In high-frequency tied data such as HRV time series, alternative tie-breaking or imputation strategies are required. Bayesian missing data imputation, which infers a tie-case's permutation based on observed pattern frequencies from complete cases, yields minimally biased entropy estimates (Traversaro et al., 2017).

4. The Entropy–Complexity Plane and Dynamical Inference

Plotting normalized permutation entropy tt2 versus statistical complexity tt3 situates time series within a causality plane whose boundaries are determined by theoretical minima and maxima of tt4 at each entropy level (Aquino et al., 2015, Leyva et al., 2022, Gonzalez et al., 2024, Lucas et al., 12 Nov 2025). Different regimes cluster in distinct regions:

  • Purely stochastic (white noise): tt5, tt6
  • Deterministic chaos: intermediate tt7, maximal tt8
  • Regular deterministic: tt9, $1$0

This framework robustly distinguishes stochastic, chaotic, and regular dynamics, and is resilient to noise contamination. Analyses of forbidden (never occurring in deterministic systems) and missing (unobserved due to finite sampling) patterns further decode the presence of underlying deterministic structure, even under strong noise (Rosso et al., 2011, Aronis et al., 2018).

5. Extensions: Weighted, Global, and Multivariate Variants

Weighted permutation entropy (WPE) enhances classical permutation entropy by incorporating amplitude information, such as local variance within each embedded vector, resulting in a weighted distribution over ordinal patterns (Lucas et al., 12 Nov 2025). WPE improves sensitivity to changes in both ordering and amplitude dynamics.

Global Permutation Entropy (GPE) generalizes the standard consecutive-pattern PE by considering all possible order-$1$1 patterns, including non-consecutive point subsets, leveraging efficient algorithms to scale the enumeration (Avhale et al., 27 Aug 2025). GPE reveals structural information otherwise inaccessible through the classical consecutive embedding approach.

Multivariate generalizations, bivariate and network extensions facilitate ordinal pattern analysis of multidimensional or coupled time series. Permutation mutual information and transfer entropy derived from joint or conditional pattern distributions are increasingly used to probe coupling and causality in networked systems (Lehnertz, 2023).

6. Representative Applications and Impact

Bandt–Pompe analysis has demonstrated value across physics, physiology, neuroscience, geosciences, and engineering. Applications include:

A condensed table of core metric definitions is shown below (notation as in text):

Quantifier Formula Range
Permutation entropy $1$2 $1$3
Jensen–Shannon divergence $1$4 $1$5
Statistical complexity $1$6 $1$7

7. Open Problems and Limitations

Current limitations include the statistical characterization of ordinal measures under non-stationarity and in finite samples, principled feature reduction for ordinal-pattern-based machine learning, reliable handling of missing or irregularly sampled data, and extensions to higher-dimensional or network time series (Leyva et al., 2022, Lehnertz, 2023). While the methodology is robust to monotonic transformations and moderate noise, it discards amplitude magnitudes, which may limit its discriminative power in settings where both order and value are informative (Flores et al., 12 Jul 2025, Leyva et al., 2022).

Combining Bandt–Pompe features with kernel or deep-learning models is an active area of research, promising further improvements in time series classification and forecasting (Leyva et al., 2022, Rosso et al., 2016). Ongoing studies explore optimal parameterization, efficient computation for high $1$8, and integration with non-parametric or compression-based complexity measures (Zozor et al., 2013, Mateos et al., 2017).


References:

(Aquino et al., 2015, Rosso et al., 2016, Leyva et al., 2022, Gonzalez et al., 2024, Avhale et al., 27 Aug 2025, Rosso et al., 2011, Flores et al., 12 Jul 2025, Aronis et al., 2018, Montani et al., 2013, Mateos et al., 2017, Carlos et al., 22 Jul 2025, Zozor et al., 2013, Lehnertz, 2023, Keller et al., 2015, Lucas et al., 2021, Traversaro et al., 2017, Lucas et al., 12 Nov 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (17)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Bandt–Pompe Symbolization Methodology.