A deep matrix factorization method for learning attribute representations (1509.03248v1)

Published 10 Sep 2015 in cs.CV, cs.LG, and stat.ML

Abstract: Semi-Non-negative Matrix Factorization is a technique that learns a low-dimensional representation of a dataset that lends itself to a clustering interpretation. It is possible that the mapping between this new representation and our original data matrix contains rather complex hierarchical information with implicit lower-level hidden attributes, that classical one level clustering methodologies can not interpret. In this work we propose a novel model, Deep Semi-NMF, that is able to learn such hidden representations that allow themselves to an interpretation of clustering according to different, unknown attributes of a given dataset. We also present a semi-supervised version of the algorithm, named Deep WSF, that allows the use of (partial) prior information for each of the known attributes of a dataset, that allows the model to be used on datasets with mixed attribute knowledge. Finally, we show that our models are able to learn low-dimensional representations that are better suited for clustering, but also classification, outperforming Semi-Non-negative Matrix Factorization, but also other state-of-the-art methodologies variants.

Citations (282)

View on Semantic Scholar

Summary

The paper introduces a deep framework for Semi-NMF that builds hierarchical, nonlinear representations to better capture data attributes.
It employs a greedy layer-wise training strategy and weakly-supervised graph regularization to refine feature learning in mixed datasets.
Experimental results on facial image datasets show superior clustering accuracy compared to traditional NMF and related methods.

An Expert Review of "A Deep Matrix Factorization Method for Learning Attribute Representations"

The discussed paper presents a novel approach to matrix factorization through the introduction of Deep Semi-Non-negative Matrix Factorization (Deep Semi-NMF), which enhances traditional Semi-NMF methods by incorporating deep-learning architectures for hierarchical data representation. This is particularly relevant for the clustering of datasets with multi-attribute characteristics, such as facial image datasets.

Main Contributions and Methodology

The authors' primary contribution is the design of a deep framework for Semi-NMF that facilitates the extraction of representations suitable for clustering according to the latent attributes of data. Unlike conventional Semi-NMF, which is limited to a single-layer representation, the Deep Semi-NMF method constructs a hierarchy of nonlinear representations across multiple layers. Each layer captures different levels of abstraction in the data, thereby improving clustering efficacy.

A notable aspect is the utilization of a greedy layer-wise training algorithm, similar to techniques in deep learning, which pretrains each layer's factors to form a basis for fine-tuning the entire network. This approach is crucial for efficiently learning complex hierarchical latent structures while maintaining computational feasibility.

Moreover, the inclusion of a Weakly-Supervised Deep Semi-NMF (Deep WSF) variant allows for the incorporation of partial attribute labels, refining the representation of mixed attribute datasets. By employing graph regularization techniques, Deep WSF leverages partial supervision to enhance feature learning, yielding superior outcomes in cases with known attribute information, such as pose and expression in facial images.

Experimental Evaluation

The paper's empirical analysis provides a robust evaluation of Deep Semi-NMF's performance compared to existing NMF variants. Using databases like CMU PIE and XM2VTS, the paper assesses clustering accuracy and reconstruction error, revealing that Deep Semi-NMF consistently delivers high clustering performance across varied numbers of components and layers. Notably, this method outperforms established Semi-NMF, Graph-regularized NMF (GNMF), and multi-layer NMF approaches, emphasizing its effectiveness in handling datasets with complex, multi-modal distributions.

Additionally, using Image Gradient Orientations (IGO) instead of pixel intensities demonstrated the Deep Semi-NMF's applicability to datasets with mixed-signed features, further showcasing the model's flexibility and robustness.

Theoretical and Practical Implications

The theoretical implications of this research are significant as it bridges the gap between traditional matrix factorization and modern deep learning techniques. By introducing hierarchical factorization, the method aligns closely with the cognitive models of hierarchical data processing, akin to the neural processing in the human visual system. Practically, the ability to derive meaningful data representations enhances not only clustering tasks but also potentially benefits classification and feature extraction challenges in varied domains such as biometric identification and document analysis.

Future Directions

Moving forward, potential directions include exploring non-linear Deep NMF extensions, which involve more complex activation functions, enabling a broader spectrum of applications. Furthermore, adapting this framework to multi-linear data structures, such as multi-dimensional tensors, could unlock its potential in areas requiring higher-order data representations.

Overall, the Deep Semi-NMF and its supervised variant, Deep WSF, signify a substantial advancement in matrix factorization techniques, providing a compelling approach for discovering hierarchical attribute representations in complex datasets. This work paves the way for more sophisticated applications of semi-supervised learning in machine learning and artificial intelligence.

PDF Markdown

A deep matrix factorization method for learning attribute representations (1509.03248v1)

Summary

An Expert Review of "A Deep Matrix Factorization Method for Learning Attribute Representations"

Related Papers