Papers
Topics
Authors
Recent
Search
2000 character limit reached

Product of Projections in Multi-View Subspace Clustering

Updated 2 April 2026
  • Product of projections in multi-view subspace clustering is a technique that replaces full affinity matrices with low-rank, landmark-based approximations to capture intrinsic subspace structures.
  • SDSNet leverages encoder-decoder architectures and scalable self-expressiveness modules to efficiently process high-dimensional data while preserving subspace integrity.
  • Empirical results show that these projection methods significantly reduce computational complexity and memory requirements, enabling rapid clustering of tens of thousands of samples.

Deep Scalable Subspace Clustering (SDSNet) refers to a class of deep learning frameworks that address the scalability bottlenecks inherent in traditional and deep subspace clustering by replacing full affinity or self-expressive matrices with scalable, often low-rank, approximations within end-to-end architectures. Unlike classical approaches that require full-batch optimization of n×nn \times n affinity graphs for nn samples, SDSNet designs leverage mini-batch training, factorization schemes, and landmark-based approximations, enabling efficient clustering of tens of thousands of high-dimensional samples while retaining fidelity to subspace structure (Zhang et al., 2018, Mrabah et al., 24 Dec 2025).

1. Problem Formulation and Motivation

The objective of subspace clustering is to partition a dataset X∈RD×nX \in \mathbb{R}^{D \times n}, whose nn columns are presumed to lie close to the union of kk (possibly non-linear) subspaces {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k, into groups corresponding to those subspaces. Classical self-expressiveness models, notably Sparse Subspace Clustering (SSC) and Low-Rank Representation (LRR), express each data point as a linear combination of the others:

X=XC+E,X = X C + E,

with C∈Rn×nC \in \mathbb{R}^{n \times n} as the coefficient (affinity) matrix and EE as the residual. Deep Subspace Clustering Networks (DSC-Net) (Ji et al., 2017) extend this model into a non-linear latent space via an auto-encoder, then employ a self-expressive layer with CC as the learnable nn0 matrix.

However, these approaches entail nn1 memory (for nn2) and nn3 or higher computational costs (for spectral clustering on nn4), making them impractical for large nn5 (Ji et al., 2017, Zhang et al., 2018, Mrabah et al., 24 Dec 2025). SDSNet was developed to eliminate this scaling bottleneck while preserving or improving clustering accuracy.

2. Architectural Innovations

SDSNet architectures share three critical design choices:

  1. Encoder-decoder backbone: A convolutional auto-encoder parametrizes the mapping nn6 and nn7. The encoder produces a latent embedding nn8, with nn9.
  2. Scalable self-expressiveness module:
    • In (Zhang et al., 2018), each point X∈RD×nX \in \mathbb{R}^{D \times n}0 is assigned to one of X∈RD×nX \in \mathbb{R}^{D \times n}1 learnable subspaces X∈RD×nX \in \mathbb{R}^{D \times n}2 (with X∈RD×nX \in \mathbb{R}^{D \times n}3), avoiding the need for a global X∈RD×nX \in \mathbb{R}^{D \times n}4 affinity.
    • In (Mrabah et al., 24 Dec 2025), the coefficient matrix X∈RD×nX \in \mathbb{R}^{D \times n}5 is replaced by a low-rank factorization X∈RD×nX \in \mathbb{R}^{D \times n}6 with X∈RD×nX \in \mathbb{R}^{D \times n}7 and X∈RD×nX \in \mathbb{R}^{D \times n}8. Instead of X∈RD×nX \in \mathbb{R}^{D \times n}9, SDSNet enforces nn0 for a selected landmark matrix nn1.
  3. Efficient clustering head: Spectral clustering is performed in the low-dimensional space spanned by the factor nn2 or subspace assignment matrix nn3, with all operations costing nn4 when nn5 are fixed.

A comparison of key self-expressive strategies appears below:

Model Self-Expressive Representation Parameter Count
DSC-Net nn6 nn7
SDSNet (2018) nn8, nn9 kk0
SDSNet (2025) kk1, kk2 kk3

3. Optimization Objectives and Training Algorithms

SDSNet jointly optimizes auto-encoder reconstruction and subspace-preserving self-expressiveness losses:

kk4

kk5

kk6

Alternating minimization updates the assignment kk7, subspaces kk8 (via SVD or Grassmannian gradient), and network parameters by minibatch stochastic gradient descent (Adam).

kk9

Optimizing over {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k0 and {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k1 is achieved via Procrustes SVD or least squares, alternating with back-propagation for encoder/decoder parameters. Affinity {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k2 is block-diagonalized by optimizing representations during training.

4. Landmark-Based and Subspace-Assignment Approximations

A defining feature of SDSNet (Mrabah et al., 24 Dec 2025) is its replacement of the full self-expressiveness matrix {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k3 with a landmark-based factorization. {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k4 anchors are selected via {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k5-means++ or random sampling in latent space, leading to {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k6. The resulting {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k7 encodes how each data point relates to the landmarks, with {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k8. All affinity computations and subsequent spectral clustering occur in the {Si}i=1k\{\mathcal{S}_i\}_{i=1}^k9-dimensional anchor space, yielding X=XC+E,X = X C + E,0 computation and X=XC+E,X = X C + E,1 memory.

In (Zhang et al., 2018), explicit assignment variables X=XC+E,X = X C + E,2 simplify clustering to an assignment to one of X=XC+E,X = X C + E,3 subspaces per point; subspaces are updated via SVD of assigned embeddings.

5. Computational Complexity and Scalability

SDSNet substantially alleviates the cubic and quadratic scaling issues of prior methods:

  • Classical affinity-based deep subspace clustering: X=XC+E,X = X C + E,4 memory and X=XC+E,X = X C + E,5 time (building, storing, and decomposing X=XC+E,X = X C + E,6) (Ji et al., 2017).
  • SDSNet (Zhang et al., 2018): Each epoch comprises encoding X=XC+E,X = X C + E,7 samples (X=XC+E,X = X C + E,8), assignments (X=XC+E,X = X C + E,9), and SVD/Grassmann updates (C∈Rn×nC \in \mathbb{R}^{n \times n}0), all linear when C∈Rn×nC \in \mathbb{R}^{n \times n}1 and C∈Rn×nC \in \mathbb{R}^{n \times n}2 are constant.
  • SDSNet (Mrabah et al., 24 Dec 2025): Encoder/decoder updates (C∈Rn×nC \in \mathbb{R}^{n \times n}3), C∈Rn×nC \in \mathbb{R}^{n \times n}4/C∈Rn×nC \in \mathbb{R}^{n \times n}5 updates (C∈Rn×nC \in \mathbb{R}^{n \times n}6), and final spectral clustering (C∈Rn×nC \in \mathbb{R}^{n \times n}7).

Empirically, SDSNet clusters 70,000 MNIST images in under 8 minutes using C∈Rn×nC \in \mathbb{R}^{n \times n}82 GB GPU memory; equivalent affinity-based approaches require C∈Rn×nC \in \mathbb{R}^{n \times n}937 GB just for storing the affinity matrix (Zhang et al., 2018). On synthetic datasets with EE0 up to 10,000, computational time is observed to grow linearly (Mrabah et al., 24 Dec 2025).

6. Empirical Performance and Benchmarking

Extensive experiments demonstrate that SDSNet maintains or surpasses the clustering accuracy of both affinity-based and scalable state-of-the-art methods. On Fashion-MNIST (60,000 samples), SDSNet achieves ACC EE1 63.8% and NMI EE2 62.0% in under 10 minutes (Zhang et al., 2018). On YaleB, ORL, Coil100, UMIST, and Fashion-MNIST, (Mrabah et al., 24 Dec 2025) reports that SDSNet matches or exceeds alternatives (SSC-OMP, EnSC, SGL, SEE3C, LMVSC, SSSC, A-DSSC) by margins of 5–35% in ACC and NMI, with subspace-preserving error (SPE) comparable to dense affinity approaches.

Convergence is achieved rapidly, with the relative change in EE4 or EE5 dropping below EE6 in fewer than 10 outer iterations (Mrabah et al., 24 Dec 2025); affinity matrices display clear block-diagonal structure early in training.

7. Implementation Strategies and Hyperparameters

Robust performance requires careful tuning and implementation choices:

  • Hyperparameters: Number of subspaces EE7 is set to the true number of clusters (e.g., EE8 for MNIST), embedding dimension EE9 typically 20, subspace dimension CC0 in [7, 11], anchor number CC1 (proportional to CC2), loss weightings CC3, batch size CC4, optimizer Adam (CC5 lr), AE pre-training CC6 epochs, joint training CC7–CC8 epochs (Zhang et al., 2018).
  • Subspace updates: Either top-CC9 SVD on assignments (discarding 10% outliers) or Riemannian updates (Grassmann retraction).
  • Landmark selection: nn00-means++ improves anchor robustness over random, especially as nn01 approaches nn02.
  • Initialization and batch scheduling: Pre-training the auto-encoder facilitates stable convergence; initializing subspaces/anchors via nn03-means or shallow DSC improves accuracy and speed.

SDSNet’s design establishes a paradigm shift for deep subspace clustering, yielding highly scalable, accurate solutions across large-scale, high-dimensional datasets by circumventing the full affinity bottleneck and optimizing computational paths through architectural and algorithmic innovations (Zhang et al., 2018, Mrabah et al., 24 Dec 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (3)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Product of Projections in Multi-View Subspace.