Papers
Topics
Authors
Recent
Search
2000 character limit reached

Log-Signature: Theory, Methods & Applications

Updated 1 March 2026
  • Log-signature is a minimal Lie series embedding that encodes the geometric and analytic features of paths within the free Lie algebra framework.
  • Efficient algorithms, like segment-wise BCH and projection methods, enable robust computation and dimensionality reduction for high-dimensional sequential data.
  • Applications span deep learning, statistical modeling, and signal processing, providing interpretable, lower-dimensional features for time series and stochastic processes.

The log-signature of a path encodes its geometric and analytic features as a Lie series in the tensor algebra, providing a minimal and highly structured embedding for streams, stochastic processes, and time series. It enables fundamental reductions in dimension and redundancy versus the raw signature, and underpins both theoretical developments in rough path theory and a broad range of data-driven applications, notably in deep learning and statistical modeling of sequential data. This article presents a complete account of log-signatures, including their algebraic construction, analytic characterization, efficient computation, statistical and geometric properties, classification results, and principal uses in modern learning systems and time series models.

1. Algebraic Definition and Structure

Given a continuous or piecewise-linear path X:[0,T]RdX:[0,T]\to \mathbb{R}^d, its signature S(X)S(X) is the formal series in the tensor algebra T(Rd)=k=0(Rd)kT(\mathbb{R}^d) = \bigoplus_{k=0}^\infty (\mathbb{R}^d)^{\otimes k}:

S(X)=1+k=10<t1<<tk<TdXt1dXtk.S(X) = 1 + \sum_{k=1}^\infty \int_{0 < t_1 < \cdots < t_k < T} dX_{t_1} \otimes \cdots \otimes dX_{t_k}.

This object is group-like, i.e., it encodes concatenation as the shuffle product, and is a complete invariant for tree-reduced rough paths up to reparametrization.

The log-signature is the tensor algebra logarithm,

logS(X)=n=1(1)n1n(S(X)1)n,\log S(X) = \sum_{n=1}^\infty \frac{(-1)^{n-1}}{n} (S(X)-1)^{n},

which lies, by Chen's theorem, in the free Lie algebra generated by Rd\mathbb{R}^d. Each homogeneous component corresponds to iterated commutators (brackets), and every log-signature admits an expansion:

logS(X)=1(X)+2(X)+,k(X)Lk(Rd).\log S(X) = \ell_1(X) + \ell_2(X) + \cdots, \quad \ell_k(X) \in L_k(\mathbb{R}^d).

The truncation at depth mm yields coordinates indexed by Lie words of length m\leq m.

The log-signature serves as the generator (in the BCH sense) of the signature, so that concatenation of paths is mapped by the Baker–Campbell–Hausdorff formula:

log(S(X)S(Y))=logS(X)+logS(Y)+12[logS(X),logS(Y)]+\log(S(X)\cdot S(Y)) = \log S(X) + \log S(Y) + \frac{1}{2}[\log S(X),\log S(Y)] + \cdots

Thus, the log-signature endows the path space with a Lie group structure (Reizenstein, 2017).

2. Analytic and Geometric Properties

For paths of bounded variation or finite pp-variation (p<3p<3), the signature characterizes the path up to “tree-like” equivalence, and the log-signature provides a minimal sufficient statistic:

  • Minimality: The log-signature eliminates algebraic redundancy present in the full tensor signature; within degree mm, the number of log-signature features is dimnd,m=k=1m1kikμ(k/i)di\dim \mathfrak{n}_{d,m} = \sum_{k=1}^m \frac{1}{k} \sum_{i|k} \mu(k/i)d^i (Möbius function μ\mu), asymptotically O(dm/m)O(d^m/m) (Friz et al., 2024).
  • Geometric interpretability: The first term (level 1) encodes the total increment; the second term (level 2) records signed “areas,” and higher terms reflect increasingly complex noncommutative commutators, capturing the order of oscillation and interaction among path coordinates (Boedihardjo et al., 22 Jun 2025).
  • Decay: Signature coefficients decay factorially, whereas log-signature coefficients, generically, exhibit only geometric decay. For rectifiable paths, if the log-signature has infinite radius of convergence, the path is a straight line (Boedihardjo et al., 22 Jun 2025).
  • Path classification: For rectifiable paths, the log-signature is a finite-degree Lie polynomial if and only if the path is a straight line up to reparametrization; otherwise, its Lie series has infinite support (Friz et al., 2023).

3. Efficient Algorithms and Software

Efficient computation of log-signatures is essential for large-scale applications. The predominant approaches are:

  • Segment-wise BCH update: Each path segment is interpreted as a Lie algebra element; the log-signature of the whole path is recursively updated using truncated BCH formulas. Complexity is O(Ndimnd,m)O(N \cdot \dim \mathfrak{n}_{d,m}) for NN segments and depth mm (Reizenstein, 2017, Reizenstein et al., 2018).
  • Signature + projection: Compute the truncated signature, expand its tensor logarithm, and project onto a chosen free Lie algebra basis (commonly Lyndon for the lower-triangular property). This method dominates for high d,md,m (Reizenstein et al., 2018).
  • Practical packages: “iisignature” (C++/Python, Reizenstein & Graham) and “Signatory” (C++/CUDA, Kidger & Lyons) implement these methods, using precompiled or JIT-compiled code for the BCH step and optimized basis projections. Computation for thousands of paths with d=64,m=3d=64, m=3 is achievable in seconds on modern CPUs (Reizenstein et al., 2018, Curtò et al., 2022).

4. Statistical, Learning, and Signal Processing Applications

Log-signatures are effective feature embeddings in machine learning, particularly for time series and sequential data. Key use cases include:

  • Similarity metrics for empirical distributions: In generative modeling (e.g., GAN evaluation), the RMSE and MAE between mean log-signatures of real and generated samples provide highly efficient and interpretable alternatives to classical metrics (FID, KID). Log-signature-based metrics can detect convergence and overfitting in GANs with substantially lower computation (Curtò et al., 2022).
  • Neural sequence models: Hybrid models (e.g., Logsig-RNN, LogSig-LSTM) feed log-signature features into RNNs, dramatically reducing the input sequence length (by orders of magnitude), improving robustness to irregular sampling, and achieving state-of-the-art accuracy on synthetic SDE regression, action recognition, and gesture classification tasks (Liao et al., 2019, Feng et al., 2021).
  • Score-based generative models: Score-based diffusion models for time series operate directly on log-signature embeddings, exploiting their linear structure for both forward and reverse SDEs in Lie algebra space. Explicit inversion formulas recover the original path from the log-signature in Fourier or orthogonal polynomial bases (Barancikova et al., 2024).
  • Expected signature and cumulant analysis: In stochastic modeling, the log of the expected signature (“signature cumulant”) organizes higher-order moments and reduces complexity by an order of nn, yielding recursive Magnus-type expansions and diamond product recursions (Friz et al., 2024).
  • Benchmarking:
    • In GAN evaluation, log-signature RMSE and MAE capture convergence ordering in seconds on CPUs, compared to minutes for FID on GPUs (Curtò et al., 2022).
    • In SDE learning, Logsig-RNN achieves sub-2×10⁻⁶ MSE with feature dimension 4×84\times8, much more efficiently than vanilla RNNs (Liao et al., 2019).
    • For high-frequency BSDEs, LogSig-LSTM enables accurate pricing at long time horizons and in high dimension with coarse segmentation (Feng et al., 2021).

5. Theoretical Classification and Uniqueness

Research establishes stringent constraints on the algebraic and analytic structure of log-signatures:

  • Lyons–Sidorova conjecture: Only straight lines (up to tree reduction) yield infinite-radius log-signatures. If the log-signature is entire on all subintervals, the path must be globally linear (Boedihardjo et al., 22 Jun 2025).
  • Algebraic identities: Infinite radius of convergence for the log-signature enforces vanishing of certain complex-weighted iterated integrals, providing a system of necessary and generically sufficient conditions for “straightness” in the path class (Boedihardjo et al., 22 Jun 2025).
  • Tree-like equivalence: The signature—and log-signature—captures the essential information of a path modulo negligible “tree-like” pieces, and for monotone or generic piecewise-linear paths, injectivity is characterized explicitly (Friz et al., 2023).

6. Computational and Practical Considerations

Practical deployment of log-signature methods requires careful trade-offs and implementation choices:

  • Basis selection: Hall and Lyndon bases are standard; the Lyndon basis accelerates projection (triangularization) and is used in high-performance code (Reizenstein et al., 2018).
  • Truncation depth: Selection of mm balances expressivity (capturing higher-order geometry) against cost; m=3m=3–$5$ suffices for handwriting/skeleton data, higher mm for richer dynamics.
  • Numerical stability: Higher-degree log-signature terms can decay rapidly and be sensitive to noise; normalization and segment scaling are often required.
  • Backpropagation: The log-signature map is differentiable, permitting efficient end-to-end learning in neural architectures (Reizenstein et al., 2018, Liao et al., 2019).
  • Scalability: Modern libraries achieve computation time linear in path length and moderate in dd, mm. For very long streams or high-dimensional data, each channel can be processed separately (Barancikova et al., 2024).

7. Summary Table: Central Properties of the Log-Signature

Aspect Log-signature Signature
Algebraic structure Free Lie algebra (minimal, non-redundant) Full tensor algebra (with shuffle relations)
Size for level mm in dd dimensions O(dm/m)O(d^m / m) O(dm)O(d^m)
Universality (approximation) With nonlinear read-out (RNN, etc.) Linear functionals are universal
Time reparametrization invariance Yes Yes
Robust to missing/irregular data More robust (lower redundancy) Sensitive to redundancies
Efficient computation Yes; via segment-wise BCH or projection Yes; via Chen's relations
Unique path recovery Up to tree-like equivalence (full infinite log-sig) Up to tree-like equivalence

The log-signature serves as a mathematically principled, dimension-reducing, algebraically interpretable, and computationally tractable representation of paths, with rigorous foundations and extensive empirical validation across deep learning, stochastic analysis, and statistical signal processing (Reizenstein, 2017, Reizenstein et al., 2018, Liao et al., 2019, Curtò et al., 2022, Friz et al., 2023, Barancikova et al., 2024, Friz et al., 2024, Boedihardjo et al., 22 Jun 2025).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Log-signature.