SIGL Method: Unified Time Series Feature Extraction
- SIGL method is a unifying framework for feature extraction from multivariate time series using signature transforms and offers order-aware representations.
- It modularizes steps like augmentation, windowing, transformation, and rescaling to create a customizable feature pipeline validated by empirical studies.
- The approach improves classification tasks by subsuming prior signature-based techniques and delivering practical performance gains in time series analysis.
The SIGL method refers to a unifying, generalised framework for extracting features from multivariate time series based on the signature transform from rough path theory. It is designed to systematize and extend the diverse set of signature-based techniques prevalent in both mathematical analysis and machine learning. The SIGL construction modularizes every step—augmentation, windowing, transformation depth, and rescaling—enabling systematic adaptation and providing a canonical pipeline validated by large-scale empirical studies (Morrill et al., 2020). The method is applicable across a wide range of domains, especially in time series classification and analysis.
1. Mathematical Foundation: Signature Transform and Its Properties
Let be a continuous path, typically realized by piecewise-linear interpolation of a discrete time series . The truncated signature of depth , denoted , is the collection of all iterated integrals up to order : Key theoretical guarantees:
- Universality: Any continuous functional can be arbitrarily well-approximated linearly on .
- Uniqueness: Under mild hypotheses, the (full) signature determines the path up to translation and time-reparameterization.
- Log-signature: The Lie-series logarithm of removes algebraic redundancies, yielding a lower-dimensional, non-linear representation.
These properties furnish strong theoretical justification for deploying signatures as universal, order-aware features in sequential modeling.
2. Modular Generalised Signature Framework
Traditionally, signature feature extraction is viewed monolithically as . The SIGL method generalizes this via the following composable modules:
(A) Augmentation ():
- Time augmentation : Incorporates time as an extra dimension, breaking parameterization invariance.
- Basepoint augmentation : Restores translation visibility; can be implemented as invisibility-reset or coordinate drop to zero.
- Lead–lag transformation : Duplicates channels to capture quadratic variation.
- Other: Coordinate projections (random, learned), and trainable stream-preserving functions.
(B) Windowing ():
- Global: Full path.
- Sliding: Local, fixed-length subpaths (start, step, length).
- Expanding: Increasing-size prefixes.
- Hierarchical dyadic: All sub-paths at dyadic scales, yielding windows for dyadic depth .
(C) Transform ( or ):
- Iterated integrals computed up to order .
- Optionally replaced by log-signature for redundancy reduction.
(D) Rescaling ():
- Post-scaling: Each -order term multiplied by .
- Pre-scaling: Path rescaled so .
A generic feature vector for augmentation and window is: The final representation concatenates all vectors.
3. Canonical Pipeline and Recommended Defaults
Based on an empirical study across 26 datasets and 4 classifiers, the SIGL framework identifies a robust, domain-agnostic canonical pipeline: With the following recommended defaults:
- Signature depth .
- Hierarchical dyadic windowing, dyadic depth (i.e., 7 windows).
- Time and basepoint augmentations only.
- No pre-scaling; use simple post-scaling (factorial scaling). Hyperparameters are tuned in .
This canonical pipeline provides strong baseline performance, and is extensible by modifying any of the four principal modules.
4. Connections to Prior Methods and Special Cases
The SIGL framework subsumes earlier signature-based techniques as special cases:
- Levin & Lyons: , global window, pure .
- Yang et al. (“invisibility-reset”): .
- Lead–lag signature: , global window.
- Sliding-window signatures: appropriate , sliding window .
Each is expressible as for appropriate choices of component functions.
5. Practical Implementation Considerations
SIGL offers concrete guidance for practitioners:
- Computational complexity: Calculation of on length-, -channel paths is , producing features.
- Software: Open-source iisignature (C/CUDA) and PyTorch–based Signatory libraries are available.
- Model selection: Depth and window depth are cross-validated (typical , ). Downstream predictors include -regularized linear models, random forests, or small RNN/CNNs.
- Preprocessing: Channel-wise standardization; irregularly sampled time series handled via piecewise-linear interpolation, with no explicit resampling required.
6. Empirical Observations and Performance
Findings from broad evaluation include:
- Time + basepoint augmentations yield marked improvement over unaugmented signals.
- Lead–lag augmentation offers further gains, while coordinate projections and learned heads are particularly helpful in high-dimensional, e.g., EEG data.
- Hierarchical dyadic windowing () outperforms global and sliding windows.
- Conventional signature generally outperforms log-signature (statistically significant, ).
- Optimal extraction depth is typically or 4; deeper extraction tends to overfit.
- With the canonical SIGL pipeline and a random forest classifier, average method ranking was 4.3 out of 9 strong baselines (MUSE, HIVE-COTE, TapNet, MLCN, etc.) without ever exceeding 32 GB RAM or 30 minutes runtime per dataset on a single GPU.
7. Significance and Flexibility of the SIGL Framework
SIGL is fundamentally a modular, unifying abstraction rather than a fixed method. All prior signature-based feature extraction variants are expressible within its architecture. The framework supports out-of-the-box adoption via its canonical pipeline but permits seamless adaptation to diverse applications simply by exchanging or composing its four principal modules () (Morrill et al., 2020). This positions SIGL as both a practical standard and an extensible, deeply justified feature generator for multivariate time series in contemporary machine learning settings.