Papers
Topics
Authors
Recent
Search
2000 character limit reached

Deep Scalable Subspace Clustering (SDSNet)

Updated 2 April 2026
  • Deep Scalable Subspace Clustering (SDSNet) is a deep learning-based method that improves subspace clustering by using landmark-based factorization and k-subspace assignments for scalability.
  • It overcomes traditional methods by approximating large affinity matrices, reducing complexity from quadratic/cubic to linear or near-linear performance.
  • SDSNet facilitates efficient clustering of high-dimensional data in large-scale applications, delivering competitive accuracy with significantly lower resource usage.

Deep Scalable Subspace Clustering (SDSNet) refers to a family of deep learning-based methods for subspace clustering that achieve computational scalability while leveraging deep representations. SDSNet frameworks address the prohibitive cost of the classical self-expressiveness approach and its deep counterparts, which require constructing and manipulating n×nn \times n affinity matrices, resulting in quadratic or cubic complexity with respect to the number of data points nn. Recent advances, especially those categorized under the SDSNet moniker, achieve linear or near-linear computational complexity by factorizing affinity computations, approximating self-expression via landmarks, or employing direct subspace assignments in the latent space. These models are designed to cluster high-dimensional data that are assumed to lie near a union of low-dimensional subspaces, and they enable the efficient application of deep subspace clustering to large-scale datasets.

1. Subspace Clustering: Self-Expressiveness and Scalability Limitations

Subspace clustering seeks to partition a dataset X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}, whose columns are assumed to lie near a union of kk subspaces {Si}i=1k\{S_i\}_{i=1}^k, into groups corresponding to their generating subspaces. The self-expressiveness property, which underpins classical spectral subspace clustering methods such as Sparse Subspace Clustering (SSC) and Low-Rank Representation (LRR), posits that each data point can be linearly reconstructed from other points in its subspace. This is modeled as: X=XC+E,X = X C + E, where C∈Rn×nC \in \mathbb{R}^{n \times n} is the coefficient (affinity) matrix and EE accounts for reconstruction errors. Minimizing the self-expressiveness loss, either with sparsity (ℓ1\ell_1) or Frobenius norm regularization (ℓ2\ell_2), yields the affinity nn0 used in spectral clustering. However, for large nn1, the memory and computational cost of forming and eigendecomposing nn2 becomes prohibitive (nn3 storage, nn4 SVD), severely limiting the scalability of these approaches (Ji et al., 2017, Mrabah et al., 24 Dec 2025).

2. Deep Subspace Clustering Networks and Bottleneck Analysis

Deep subspace clustering networks (e.g., DSC-Net) (Ji et al., 2017) augment the self-expressiveness principle with deep encoding:

  • An encoder nn5 maps input data to latent codes nn6.
  • A self-expressive layer parameterized by nn7 reconstructs the latent codes, nn8.
  • A decoder reconstructs the input from self-expressive codes.

The joint objective optimizes for code reconstruction, input reconstruction, and regularization of nn9. In practice, these architectures require X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}0 parameters and memory for X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}1 and demand full-batch training. This bottleneck restricts deep subspace clustering to small datasets and precludes mini-batch optimization, hindering architectural depth and scalability. The necessity of loading the full dataset for C updates and spectral clustering further limits applicability.

3. SDSNet: Scalable Deep Subspace Clustering Methodologies

SDSNet approaches (Zhang et al., 2018, Mrabah et al., 24 Dec 2025) overcome the limitations of classical and early deep subspace clustering. Two major algorithmic families have been developed:

3.1 Low-Rank Landmark-Based Factorization

SDSNet (Mrabah et al., 24 Dec 2025) introduces scalable self-expression via a landmark-based factorization. Given encoder output X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}2, select X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}3 landmark codes X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}4 and factor the affinity as X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}5, with X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}6. The self-expression loss is formulated as: X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}7 where X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}8 is the decoder reconstruction. This yields X=[x1,...,xn]∈RD×nX = [x_1, ..., x_n] \in \mathbb{R}^{D \times n}9 storage and kk0-per-epoch scaling for learning kk1, kk2, encoder, and decoder. Spectral clustering is performed in the kk3-dimensional factor space, circumventing the kk4 affinity bottleneck.

3.2 Explicit k-Subspace Assignment in Latent Space

SDSNet (Zhang et al., 2018) jointly learns a non-linear embedding and kk5 subspace bases kk6 in latent space, enforcing that each embedded code kk7 is well represented by its assigned subspace: kk8 Assignments kk9 and subspaces {Si}i=1k\{S_i\}_{i=1}^k0 are updated alternately with the encoder/decoder, using either closed-form SVD per cluster or Grassmannian manifold gradient updates. The memory cost is {Si}i=1k\{S_i\}_{i=1}^k1 and computation is amenable to mini-batch optimization.

4. Architecture, Training Algorithms, and Losses

SDSNet variants inherit the encoder-decoder paradigm of prior deep clustering approaches but replace dense self-expressive layers with scalable modules.

  • Encoder: Typical SDSNet encoders stack convolutional layers, e.g., for {Si}i=1k\{S_i\}_{i=1}^k2 images, layers such as 20 filters of {Si}i=1k\{S_i\}_{i=1}^k3 (stride 2), followed by 10 and 5 filters of {Si}i=1k\{S_i\}_{i=1}^k4 (stride 2), outputting {Si}i=1k\{S_i\}_{i=1}^k5-dimensional latent codes.
  • Self-Expression/Assignment Module: Either (1) a factorization {Si}i=1k\{S_i\}_{i=1}^k6 with alternating SVD updates for {Si}i=1k\{S_i\}_{i=1}^k7 and {Si}i=1k\{S_i\}_{i=1}^k8 (Mrabah et al., 24 Dec 2025); or (2) explicit nearest-subspace assignments with optimization over {Si}i=1k\{S_i\}_{i=1}^k9 and X=XC+E,X = X C + E,0 (Zhang et al., 2018).
  • Decoder: Symmetric to the encoder, reconstructing X=XC+E,X = X C + E,1 from latent codes.
  • Overall Loss: Weighted sum of reconstruction loss and subspace/self-expression loss; example:

X=XC+E,X = X C + E,2

Training Algorithms include pre-training the auto-encoder, mini-batch alternating updates for embedding/SVDs, and periodic subspace or factor updates. Outlier rejection (dropping the 10% of points with highest projection residuals) is employed before basis updates. Standard optimizers, e.g., Adam with X=XC+E,X = X C + E,3 learning rate, are used.

5. Complexity, Scalability, and Empirical Evaluation

SDSNet architectures dramatically reduce both memory and computational demands compared to non-scalable DSC methods.

  • Complexity:
    • Landmark-based SDSNet: X=XC+E,X = X C + E,4 memory, X=XC+E,X = X C + E,5 computation per epoch when X=XC+E,X = X C + E,6 fixed (Mrabah et al., 24 Dec 2025). Spectral clustering in X=XC+E,X = X C + E,7 rather than X=XC+E,X = X C + E,8.
    • k-Subspace SDSNet: X=XC+E,X = X C + E,9 memory for codes, C∈Rn×nC \in \mathbb{R}^{n \times n}0 for subspaces (Zhang et al., 2018).
  • Empirical Benchmarks:
    • On MNIST (C∈Rn×nC \in \mathbb{R}^{n \times n}1), SDSNet achieves ACC ≈ 87.1%, NMI ≈ 78.1%, ARI ≈ 75.8% in <8 minutes on a single GPU, with C∈Rn×nC \in \mathbb{R}^{n \times n}2GB memory. In comparison, classic SSC/LRR require C∈Rn×nC \in \mathbb{R}^{n \times n}3GB memory for affinity matrices even before clustering (Zhang et al., 2018).
    • On Fashion-MNIST, ACC ≈ 63.8%, NMI ≈ 62.0%, training in under 10 minutes.
    • Landmark-based SDSNet matches or outperforms scalable baselines (e.g., SSC-OMP, A-DSSC) by 5–35% in accuracy and NMI; versus non-scalable DSC methods, it is within 1–4% in accuracy despite linear-time complexity (Mrabah et al., 24 Dec 2025).
  • Practical Notes: Pre-training convergence in ≈200 epochs; Grassmann manifold gradient updates can stabilize subspace learning; cluster initialization with k-means over latent codes accelerates convergence.

6. Spectral Clustering via Factorized Affinities

Rather than building the full C∈Rn×nC \in \mathbb{R}^{n \times n}4 affinity, SDSNet leverages the factor C∈Rn×nC \in \mathbb{R}^{n \times n}5.

  • The graph Laplacian C∈Rn×nC \in \mathbb{R}^{n \times n}6 (with C∈Rn×nC \in \mathbb{R}^{n \times n}7 diagonal).
  • The eigenproblem is reduced: if C∈Rn×nC \in \mathbb{R}^{n \times n}8, leading eigenvectors of C∈Rn×nC \in \mathbb{R}^{n \times n}9 lie in the range of EE0. Solving the EE1 eigenproblem EE2 produces the embeddings EE3 for k-means clustering (Mrabah et al., 24 Dec 2025).
  • This eigen-decomposition costs EE4, compared to EE5 for dense methods.

7. Theoretical and Practical Implications

SDSNet has advanced subspace clustering by:

  • Eliminating both affinity matrix bottlenecks and full-batch constraints, enabling deep architectures (e.g., deeper CNNs, large-scale self-supervised learning) for unsupervised clustering (Zhang et al., 2018, Mrabah et al., 24 Dec 2025).
  • Showing convergence of learned affinities to block-diagonal structure after a few hundred epochs, matching clustering quality of traditional spectral methods with much reduced computational demands.
  • Maintaining theoretical properties of self-expressive clustering (block-structured affinity, spectral guarantees) while being practical for tens or hundreds of thousands of samples.

A plausible implication is that SDSNet has paved the way for extensions with richer neural architectures, hybrid self-supervised objectives, and further innovations in scalable clustering. Careful initialization, mini-batch learning, and principled outlier rejection are critical for convergence and accuracy. The landmark-based and k-subspace assignment paradigms remain the foundation for future research in scalable deep subspace clustering.

Key References:

Definition Search Book Streamline Icon: https://streamlinehq.com
References (3)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Deep Scalable Subspace Clustering (SDSNet).