CARTE: Pretraining and Transfer for Tabular Learning (2402.16785v2)

Published 26 Feb 2024 in cs.LG

Abstract: Pretrained deep-learning models are the go-to solution for images or text. However, for tabular data the standard is still to train tree-based models. Indeed, transfer learning on tables hits the challenge of data integration: finding correspondences, correspondences in the entries (entity matching) where different words may denote the same entity, correspondences across columns (schema matching), which may come in different orders, names... We propose a neural architecture that does not need such correspondences. As a result, we can pretrain it on background data that has not been matched. The architecture -- CARTE for Context Aware Representation of Table Entries -- uses a graph representation of tabular (or relational) data to process tables with different columns, string embedding of entries and columns names to model an open vocabulary, and a graph-attentional network to contextualize entries with column names and neighboring entries. An extensive benchmark shows that CARTE facilitates learning, outperforming a solid set of baselines including the best tree-based models. CARTE also enables joint learning across tables with unmatched columns, enhancing a small table with bigger ones. CARTE opens the door to large pretrained models for tabular data.

References (51)

Authors (3)

Myung Jun Kim (3 papers)
Léo Grinsztajn (7 papers)
Gaël Varoquaux (87 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/GaelVaroquaux/status/1826019094287757729

https://twitter.com/GaelVaroquaux/status/1797653529659953315

https://twitter.com/unsorsodicorda/status/1877450206980997413

https://twitter.com/irubachev/status/1778887263113400326

https://twitter.com/GaelVaroquaux/status/1887629996782415878

https://twitter.com/realmofresearch/status/1799076734232588644

CARTE: Pretraining and Transfer for Tabular Learning (2402.16785v2)

Summary

Related Papers

Tweets