An Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning (1611.08331v1)

Published 25 Nov 2016 in cs.LG and stat.ML

Abstract: Since about 100 years ago, to learn the intrinsic structure of data, many representation learning approaches have been proposed, including both linear ones and nonlinear ones, supervised ones and unsupervised ones. Particularly, deep architectures are widely applied for representation learning in recent years, and have delivered top results in many tasks, such as image classification, object detection and speech recognition. In this paper, we review the development of data representation learning methods. Specifically, we investigate both traditional feature learning algorithms and state-of-the-art deep learning models. The history of data representation learning is introduced, while available resources (e.g. online course, tutorial and book information) and toolboxes are provided. Finally, we conclude this paper with remarks and some interesting research directions on data representation learning.

Citations (166)

View on Semantic Scholar

Summary

Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning

The paper "An Overview on Data Representation Learning: From Traditional Feature Learning to Recent Deep Learning" presents a comprehensive examination of the evolution of representation learning methods, with particular emphasis on both traditional feature learning techniques and the advancements in deep learning. Authored by Guoqiang Zhong, Li-Na Wang, and Junyu Dong, the review provides not only a historical perspective but also a resource for exploring the expansive terrain of data representation learning.

Data representation learning plays a pivotal role across multiple domains including artificial intelligence, bioinformatics, and finance, acting as a foundational step for tasks such as classification, retrieval, and recommendation. The paper systematically delineates the trajectory of data representation learning from the introduction of foundational linear methods like Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) to the more nuanced, nonlinear, and supervised versus unsupervised methods that mark the current landscape of deep learning.

Traditional Feature Learning

The section dedicated to traditional feature learning explores shallow models aimed primarily at learning transformations of data that enhance subsequent predictive modeling. Emphasis is placed on global and local feature learning techniques, where the former encompasses holistic preservation of data information, whereas the latter emphasizes local similarities or manifold structures, such as in the case of Locally Linear Embedding (LLE).

Global learning methods include notable techniques such as PCA, kernel PCA, and various iterations like sparse PCA, each adapting the foundational algorithm to capture specific data characteristics. Supervised linear approaches like LDA and its kernel extension, Generalized Discriminant Analysis (GDA), integrate class information to improve separation in reduced spaces. This section further reviews tensor representation learning, detailing its utility in direct processing of multi-dimensional data, highlighting algorithms such as 2DPCA and its derivatives.

Manifold Learning

The manuscript advances into manifold learning, a subdomain focusing on locality-preserving methods that capture intrinsic data structures representing high-dimensionality in lower dimensions. Crucial methodologies including Isomap and LLE are explored for their role in nonlinear dimension reduction and their applications in manifold discovery. The review also presents tools and algorithms that merge ideas from manifold learning and other representation learning frameworks, showcasing their potential for feature learning and recognition tasks.

Deep Learning

Deep learning's evolution is traced back to its rejuvenation through Hinton’s foundational concepts of deep neural networks layered by pre-training and fine-tuning. The paper emphasizes significant milestones such as the introduction of AlexNet, which epitomized deep learning’s capability on a large scale with CNNs, and the subsequent advancements in architectures like VGGNet and ResNet, which have markedly impacted image recognition tasks. Additionally, cutting-edge models such as Recurrent Neural Networks (RNNs), Long Short-Term Memory (LSTM), and Generative Adversarial Networks (GANs) are surveyed, underlining their importance in tackling complex sequences and generative tasks.

The section also delineates resources and toolkits essential for engaging with deep learning, including popular frameworks such as Theano, Caffe, TensorFlow, and MXNet, each providing key functionalities for model development and deployment across varied applications.

Implications and Future Directions

The implications of the surveyed methodologies stretch across theoretical and practical realms. The trajectory of deep learning demonstrates its influence beyond AI into disciplines such as finance and bioinformatics. Looking forward, the paper suggests three pivotal research avenues: advancing fundamental theories to close gaps between application and theory, innovating novel algorithms beyond current CNN and RNN paradigms, and expanding applications to address nuanced challenges within natural language processing and computer vision.

In conclusion, while highlighting the indispensable role of deep learning, the review punctuates the necessity of diverse machine learning strategies to holistically address the nuanced and multifaceted problems presented by real-world data analytics.