SOM-VAE: Interpretable Discrete Representation Learning on Time Series (1806.02199v7)

Published 6 Jun 2018 in cs.LG and stat.ML

Abstract: High-dimensional time series are common in many domains. Since human cognition is not optimized to work well in high-dimensional spaces, these areas could benefit from interpretable low-dimensional representations. However, most representation learning algorithms for time series data are difficult to interpret. This is due to non-intuitive mappings from data features to salient properties of the representation and non-smoothness over time. To address this problem, we propose a new representation learning framework building on ideas from interpretable discrete dimensionality reduction and deep generative modeling. This framework allows us to learn discrete representations of time series, which give rise to smooth and interpretable embeddings with superior clustering performance. We introduce a new way to overcome the non-differentiability in discrete representation learning and present a gradient-based version of the traditional self-organizing map algorithm that is more performant than the original. Furthermore, to allow for a probabilistic interpretation of our method, we integrate a Markov model in the representation space. This model uncovers the temporal transition structure, improves clustering performance even further and provides additional explanatory insights as well as a natural representation of uncertainty. We evaluate our model in terms of clustering performance and interpretability on static (Fashion-)MNIST data, a time series of linearly interpolated (Fashion-)MNIST images, a chaotic Lorenz attractor system with two macro states, as well as on a challenging real world medical time series application on the eICU data set. Our learned representations compare favorably with competitor methods and facilitate downstream tasks on the real world data.

Authors (5)

Vincent Fortuin (52 papers)
Matthias Hüser (8 papers)
Francesco Locatello (92 papers)
Heiko Strathmann (19 papers)
Gunnar Rätsch (59 papers)

Citations (130)

View on Semantic Scholar

Summary

Overview of SOM-VAE: Interpretable Discrete Representation Learning on Time Series

The paper presents SOM-VAE, an innovative deep learning framework designed for interpretable discrete representation learning on time series data. This approach addresses the existing gap in interpretable low-dimensional representations for high-dimensional time series, a challenge prevalent across domains such as finance, healthcare, and dynamical systems. The authors combine principles from self-organizing maps (SOM), variational autoencoders (VAE), and probabilistic models to create a cohesive structure that facilitates smooth and interpretable embeddings with improved clustering capability.

SOM-VAE integrates a gradient-based version of the classical SOM, enabling the automatic learning of topologically interpretable discrete representations. The model overcomes the inherent non-differentiability in discrete representation learning via a tailored loss function and gradient-based optimization, thereby surpassing traditional implementation challenges. Furthermore, the integration of a Markov transition model within the representation space allows for a probabilistic interpretation of transitions over time, enhancing clustering performance and providing insights into temporal structures.

Core Contributions

The paper's primary contributions include:

Development of a novel framework for interpretable discrete representation learning specifically tailored for time series data.
Effective incorporation of a probabilistic model in the latent space alongside SOM and VAE components, which is shown to enhance cluster interpretability and performance.
Demonstration of superior clustering results compared to traditional methods on benchmark datasets and practical utility on real-world medical datasets.

Evaluations and Implications

The paper's empirical evaluations encompass various datasets, including MNIST and Fashion-MNIST for static data assessment, interpolated time series of these datasets, and real-world chaotic systems such as the Lorenz attractor. The model exhibits remarkable clustering performance on the benchmark datasets, achieving higher purity and normalized mutual information (NMI) metrics compared to existing approaches such as k-means and VQ-VAE.

One of the practical applications explored is in the domain of medical time series, specifically the eICU dataset, where SOM-VAE showcases its ability to facilitate downstream tasks such as predicting future physiology states. This potential for meaningful predictions in high-stakes environments like intensive care units underlines the framework's utility beyond theoretical constructs, demonstrating a tangible benefit to end-users.

Future Directions

The paper suggests several avenues for future research, including extending the probabilistic model to predictive tasks by incorporating more sophisticated models like higher-order Markov models or Gaussian processes. Addressing non-differentiability with more theoretically grounded approaches and exploring dynamically learned neighborhood structures in SOMs are also posited as promising directions.

In conclusion, SOM-VAE represents a significant advancement in the field of interpretable deep learning models for time series data. Its blend of SOM, VAE, and probabilistic modeling sets a new precedent for creating comprehensible and practically effective representations, especially appropriate for applications demanding clear insight into model decisions and transitions over time.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - ratschlab/SOM-VAE: TensorFlow implementation of the SOM-VAE model as described in https://arxiv.org/abs/1806.02199 (196 stars)

Tweets

https://twitter.com/MohammadHAmani/status/1822236656218378456