Distributional Reduction: Unifying Dimensionality Reduction and Clustering with Gromov-Wasserstein (2402.02239v2)

Published 3 Feb 2024 in cs.LG and stat.ML

Abstract: Unsupervised learning aims to capture the underlying structure of potentially large and high-dimensional datasets. Traditionally, this involves using dimensionality reduction (DR) methods to project data onto lower-dimensional spaces or organizing points into meaningful clusters (clustering). In this work, we revisit these approaches under the lens of optimal transport and exhibit relationships with the Gromov-Wasserstein problem. This unveils a new general framework, called distributional reduction, that recovers DR and clustering as special cases and allows addressing them jointly within a single optimization problem. We empirically demonstrate its relevance to the identification of low-dimensional prototypes representing data at different scales, across multiple image and genomic datasets.

References (88)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/StatMLPapers/status/1754732721127960712

https://twitter.com/litscraper/status/1754721598245093486

Distributional Reduction: Unifying Dimensionality Reduction and Clustering with Gromov-Wasserstein (2402.02239v2)

Summary

Related Papers

Tweets