Capture the Flag: Uncovering Data Insights with Large Language Models (2312.13876v1)

Published 21 Dec 2023 in cs.LG, cs.CL, and stat.ML

Abstract: The extraction of a small number of relevant insights from vast amounts of data is a crucial component of data-driven decision-making. However, accomplishing this task requires considerable technical skills, domain expertise, and human labor. This study explores the potential of using LLMs to automate the discovery of insights in data, leveraging recent advances in reasoning and code generation techniques. We propose a new evaluation methodology based on a "capture the flag" principle, measuring the ability of such models to recognize meaningful and pertinent information (flags) in a dataset. We further propose two proof-of-concept agents, with different inner workings, and compare their ability to capture such flags in a real-world sales dataset. While the work reported here is preliminary, our results are sufficiently interesting to mandate future exploration by the community.

References (48)

Authors (9)

Issam Laradji (37 papers)
Perouz Taslakian (31 papers)
Sai Rajeswar (27 papers)
Valentina Zantedeschi (29 papers)
Alexandre Lacoste (42 papers)
Nicolas Chapados (25 papers)
David Vazquez (73 papers)
Christopher Pal (97 papers)
Alexandre Drouin (34 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Capture the Flag: Uncovering Data Insights with Large Language Models (2312.13876v1)

Summary

Related Papers