Deep Neural Networks and Tabular Data: A Survey (2110.01889v3)

Published 5 Oct 2021 in cs.LG

Abstract: Heterogeneous tabular data are the most commonly used form of data and are essential for numerous critical and computationally demanding applications. On homogeneous data sets, deep neural networks have repeatedly shown excellent performance and have therefore been widely adopted. However, their adaptation to tabular data for inference or data generation tasks remains challenging. To facilitate further progress in the field, this work provides an overview of state-of-the-art deep learning methods for tabular data. We categorize these methods into three groups: data transformations, specialized architectures, and regularization models. For each of these groups, our work offers a comprehensive overview of the main approaches. Moreover, we discuss deep learning approaches for generating tabular data, and we also provide an overview over strategies for explaining deep models on tabular data. Thus, our first contribution is to address the main research streams and existing methodologies in the mentioned areas, while highlighting relevant challenges and open research questions. Our second contribution is to provide an empirical comparison of traditional machine learning methods with eleven deep learning approaches across five popular real-world tabular data sets of different sizes and with different learning objectives. Our results, which we have made publicly available as competitive benchmarks, indicate that algorithms based on gradient-boosted tree ensembles still mostly outperform deep learning models on supervised learning tasks, suggesting that the research progress on competitive deep learning models for tabular data is stagnating. To the best of our knowledge, this is the first in-depth overview of deep learning approaches for tabular data; as such, this work can serve as a valuable starting point to guide researchers and practitioners interested in deep learning with tabular data.

Authors (6)

Vadim Borisov (9 papers)
Tobias Leemann (14 papers)
Kathrin Seßler (7 papers)
Johannes Haug (8 papers)
Martin Pawelczyk (21 papers)
Gjergji Kasneci (69 papers)

Citations (528)

View on Semantic Scholar

Summary

Overview of "A Sample Article Using IEEEtran.cls for IEEE Journals and Transactions"

This paper provides a comprehensive guide on utilizing the IEEEtran class in LaTeX for formatting documents intended for submission to IEEE journals and transactions. The emphasis is on detailing common article elements and ensuring proper typesetting for a range of publication types offered by the IEEE, including journals, conference papers, and technical notes.

Key Contributions

The core contribution of this document is its detailed description of the IEEEtran class features. It articulates the design intent of the templates, emphasizing their utility in approximating the final appearance of published articles. The paper underscores that these templates are not the final product but serve as a preview to estimate page length and format for submission to IEEE's publication systems.

Design Intent and Limitations

A significant section of the document discusses the limitations of the templates. These templates are designed to produce outputs that mirror the final look of IEEE publications, but are not intended for the final printed versions. The document highlights the process of conversion to XML for further processing, which underpins the transition to both final print and HTML versions for IEEE Xplore.

LaTeX Support and Styling

The paper provides pointers to various LaTeX user groups and resources that can assist both novice and experienced users. This section serves as a valuable reference point for users seeking community support or additional guides on LaTeX usage.

Common Article Elements

An in-depth discussion of common article components, such as section hierarchies, citation management, lists, figures, tables, algorithms, and mathematical typography, is provided. The paper offers precise coding examples for each element, ensuring users can replicate the formatting with accuracy. This serves as a crucial resource for researchers aiming to maintain consistency with IEEE’s publication standards.

Practical Implications and Recommendations

For researchers aiming to publish with the IEEE, understanding and implementing the correct document formatting is essential. This guidance not only aids in aligning submissions with IEEE standards but also streamlines the publication process, potentially reducing back-and-forth revisions due to formatting issues.

Speculative Future Directions

Given the ongoing evolution of document preparation systems, one might speculate that future versions of these guidelines could incorporate more advanced features such as automated formatting checks or integration with collaborative platforms for improved submission efficiency. Additionally, as AI-powered tools become more prevalent, there might be scope for tools that provide real-time formatting feedback based on IEEE standards.

In conclusion, this paper serves as an essential guide for preparing IEEE submissions using LaTeX. It offers exhaustive coverage of formatting details and practical examples, making it an invaluable resource for authors in the IEEE publication process.

PDF Markdown

Related Papers

Find Related Papers

Tweets

https://twitter.com/gp_pulipaka/status/1777078923324432541