Papers
Topics
Authors
Recent
Search
2000 character limit reached

SIGL Method: Unified Time Series Feature Extraction

Updated 23 February 2026
  • SIGL method is a unifying framework for feature extraction from multivariate time series using signature transforms and offers order-aware representations.
  • It modularizes steps like augmentation, windowing, transformation, and rescaling to create a customizable feature pipeline validated by empirical studies.
  • The approach improves classification tasks by subsuming prior signature-based techniques and delivering practical performance gains in time series analysis.

The SIGL method refers to a unifying, generalised framework for extracting features from multivariate time series based on the signature transform from rough path theory. It is designed to systematize and extend the diverse set of signature-based techniques prevalent in both mathematical analysis and machine learning. The SIGL construction modularizes every step—augmentation, windowing, transformation depth, and rescaling—enabling systematic adaptation and providing a canonical pipeline validated by large-scale empirical studies (Morrill et al., 2020). The method is applicable across a wide range of domains, especially in time series classification and analysis.

1. Mathematical Foundation: Signature Transform and Its Properties

Let X:[0,T]RdX:[0,T]\to\mathbb{R}^d be a continuous path, typically realized by piecewise-linear interpolation of a discrete time series x\mathbf{x}. The truncated signature of depth NN, denoted SigN(X)\mathrm{Sig}^N(X), is the collection of all iterated integrals up to order NN: SigN(X)={SI(X)I=(i1,,ik),  1kN},SI(X)=0<t1<<tk<TdXt1i1dXtkik.\mathrm{Sig}^N(X) = \left\{ S^I(X) \mid I = (i_1,\ldots,i_k),\; 1 \leq k \leq N \right\}, \quad S^I(X) = \int_{0 < t_1 < \ldots < t_k < T} dX_{t_1}^{i_1} \cdots dX_{t_k}^{i_k}. Key theoretical guarantees:

  • Universality: Any continuous functional f(X)f(X) can be arbitrarily well-approximated linearly on Sig(X)\mathrm{Sig}(X).
  • Uniqueness: Under mild hypotheses, the (full) signature determines the path up to translation and time-reparameterization.
  • Log-signature: The Lie-series logarithm of SigN(X)\mathrm{Sig}^N(X) removes algebraic redundancies, yielding a lower-dimensional, non-linear representation.

These properties furnish strong theoretical justification for deploying signatures as universal, order-aware features in sequential modeling.

2. Modular Generalised Signature Framework

Traditionally, signature feature extraction is viewed monolithically as Sigidentity\mathrm{Sig} \circ \mathrm{identity}. The SIGL method generalizes this via the following composable modules:

(A) Augmentation (ϕ\phi):

  • Time augmentation ϕtime\phi_{\mathrm{time}}: Incorporates time as an extra dimension, breaking parameterization invariance.
  • Basepoint augmentation ϕbp\phi_{\mathrm{bp}}: Restores translation visibility; can be implemented as invisibility-reset or coordinate drop to zero.
  • Lead–lag transformation ϕll\phi_{\mathrm{ll}}: Duplicates channels to capture quadratic variation.
  • Other: Coordinate projections (random, learned), and trainable stream-preserving functions.

(B) Windowing (WW):

  • Global: Full path.
  • Sliding: Local, fixed-length subpaths (start, step, length).
  • Expanding: Increasing-size prefixes.
  • Hierarchical dyadic: All sub-paths at dyadic scales, yielding 2q12^q-1 windows for dyadic depth qq.

(C) Transform (SNS^N or logSN\log S^N):

  • Iterated integrals computed up to order NN.
  • Optionally replaced by log-signature for redundancy reduction.

(D) Rescaling (ρ\rho):

  • Post-scaling: Each kk-order term multiplied by k!k!.
  • Pre-scaling: Path rescaled so αN/N!1\alpha^N/N!\approx 1.

A generic feature vector for augmentation ii and window jj is: zi,j=ρpostSNρpreWjϕi(x)z_{i, j} = \rho_{\mathrm{post}} \circ S^N \circ \rho_{\mathrm{pre}} \circ W^j \circ \phi^i (\mathbf{x}) The final representation concatenates all {zi,j}\{z_{i,j}\} vectors.

Based on an empirical study across 26 datasets and 4 classifiers, the SIGL framework identifies a robust, domain-agnostic canonical pipeline: zj=SNWdyadic,qjϕbpϕtime(x),j=1,,2q1z_j = S^N \circ W^j_{\mathrm{dyadic},q} \circ \phi_{\mathrm{bp}} \circ \phi_{\mathrm{time}} (\mathbf{x}),\quad j=1,\ldots,2^q-1 With the following recommended defaults:

  • Signature depth N=3N=3.
  • Hierarchical dyadic windowing, dyadic depth q=3q=3 (i.e., 7 windows).
  • Time and basepoint augmentations only.
  • No pre-scaling; use simple post-scaling (factorial scaling). Hyperparameters (N,q)(N, q) are tuned in {1,,6}×{2,3,4}\{1,\ldots,6\}\times\{2,3,4\}.

This canonical pipeline provides strong baseline performance, and is extensible by modifying any of the four principal modules.

4. Connections to Prior Methods and Special Cases

The SIGL framework subsumes earlier signature-based techniques as special cases:

  • Levin & Lyons: ϕtime\phi_{\mathrm{time}}, global window, pure SNS^N.
  • Yang et al. (“invisibility-reset”): ϕtimeϕinv\phi_{\mathrm{time}} \circ \phi_{\mathrm{inv}}.
  • Lead–lag signature: ϕll\phi_{\mathrm{ll}}, global window.
  • Sliding-window signatures: appropriate ϕ\phi, sliding window WslidingW_{\mathrm{sliding}}.

Each is expressible as zj=(SNWjϕ)(x)z_j = (S^N \circ W^j \circ \phi)(\mathbf{x}) for appropriate choices of component functions.

5. Practical Implementation Considerations

SIGL offers concrete guidance for practitioners:

  • Computational complexity: Calculation of SigN\mathrm{Sig}^N on length-nn, dd-channel paths is O(ndN)\mathcal{O}(n d^N), producing (dN+11)/(d1)(d^{N+1}-1)/(d-1) features.
  • Software: Open-source iisignature (C/CUDA) and PyTorch–based Signatory libraries are available.
  • Model selection: Depth NN and window depth qq are cross-validated (typical N4N \leq 4, q4q \leq 4). Downstream predictors include 2\ell^2-regularized linear models, random forests, or small RNN/CNNs.
  • Preprocessing: Channel-wise standardization; irregularly sampled time series handled via piecewise-linear interpolation, with no explicit resampling required.

6. Empirical Observations and Performance

Findings from broad evaluation include:

  • Time + basepoint augmentations yield marked improvement over unaugmented signals.
  • Lead–lag augmentation offers further gains, while coordinate projections and learned heads are particularly helpful in high-dimensional, e.g., EEG data.
  • Hierarchical dyadic windowing (q2q \geq 2) outperforms global and sliding windows.
  • Conventional signature generally outperforms log-signature (statistically significant, p0.01p \approx 0.01).
  • Optimal extraction depth is typically N=3N = 3 or 4; deeper extraction tends to overfit.
  • With the canonical SIGL pipeline and a random forest classifier, average method ranking was 4.3 out of 9 strong baselines (MUSE, HIVE-COTE, TapNet, MLCN, etc.) without ever exceeding 32 GB RAM or 30 minutes runtime per dataset on a single GPU.

7. Significance and Flexibility of the SIGL Framework

SIGL is fundamentally a modular, unifying abstraction rather than a fixed method. All prior signature-based feature extraction variants are expressible within its architecture. The framework supports out-of-the-box adoption via its canonical pipeline but permits seamless adaptation to diverse applications simply by exchanging or composing its four principal modules (ϕ,W,SN,ρ\phi, W, S^N, \rho) (Morrill et al., 2020). This positions SIGL as both a practical standard and an extensible, deeply justified feature generator for multivariate time series in contemporary machine learning settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to SIGL Method.