Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

BLADE: Benchmarking Language Model Agents for Data-Driven Science (2408.09667v2)

Published 19 Aug 2024 in cs.CL

Abstract: Data-driven scientific discovery requires the iterative integration of scientific domain knowledge, statistical expertise, and an understanding of data semantics to make nuanced analytical decisions, e.g., about which variables, transformations, and statistical models to consider. LM-based agents equipped with planning, memory, and code execution capabilities have the potential to support data-driven science. However, evaluating agents on such open-ended tasks is challenging due to multiple valid approaches, partially correct steps, and different ways to express the same decisions. To address these challenges, we present BLADE, a benchmark to automatically evaluate agents' multifaceted approaches to open-ended research questions. BLADE consists of 12 datasets and research questions drawn from existing scientific literature, with ground truth collected from independent analyses by expert data scientists and researchers. To automatically evaluate agent responses, we developed corresponding computational methods to match different representations of analyses to this ground truth. Though LLMs possess considerable world knowledge, our evaluation shows that they are often limited to basic analyses. However, agents capable of interacting with the underlying data demonstrate improved, but still non-optimal, diversity in their analytical decision making. Our work enables the evaluation of agents for data-driven science and provides researchers deeper insights into agents' analysis approaches.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (16)
  1. Ken Gu (8 papers)
  2. Ruoxi Shang (3 papers)
  3. Ruien Jiang (1 paper)
  4. Keying Kuang (2 papers)
  5. Richard-John Lin (1 paper)
  6. Donghe Lyu (1 paper)
  7. Yue Mao (18 papers)
  8. Youran Pan (1 paper)
  9. Teng Wu (23 papers)
  10. Jiaqian Yu (8 papers)
  11. Yikun Zhang (18 papers)
  12. Tianmai M. Zhang (3 papers)
  13. Lanyi Zhu (1 paper)
  14. Mike A. Merrill (9 papers)
  15. Jeffrey Heer (29 papers)
  16. Tim Althoff (64 papers)
Citations (3)

Summary

A Comprehensive Guide to Using PDFLaTeX for Document Preparation

This document offers an in-depth examination of utilizing pdf\LaTeX{} as the principal engine for creating PDF files from \LaTeX{} source documents. It outlines various practical steps and considerations necessary for both reviewing and finalizing documents using pdf\LaTeX{}, while also briefly touching upon Xe\LaTeX{}.

Document Setup and Preamble

The paper begins by asserting the importance of choosing the correct document class and preamble settings to ensure optimal functionality and appearance. The recommended approach initiates with the command:

1
\documentclass[11pt]{article}
This establishes the foundation for the document. Additional guidance is provided for loading the required style files, specifically noting the distinctions between draft and final document versions.

To align the document with Times Roman fonts, authors are instructed to include respective font packages within the preamble. The paper refers users to the source for examples of multiple authors in header fields, ensuring clarity in multi-author attributions.

Footnotes, Tables, and Figures

A segment of the paper is dedicated to the inclusion and formatting of footnotes, tables, and figures:

  • Footnotes: Insertion is facilitated by the \footnote command, an essential element in academic writing.
  • Tables: Through an example, it demonstrates the creation and captioning of tables without overriding default caption sizes. This ensures consistency in document formatting.
  • Figures: The usage of the graphicx package is recommended for including and manipulating graphics. Ensuring consistency in font usage between figures and the main document is emphasized, contributing to a professional appearance. Figures, both individual and side-by-side, are illustrated with comprehensive examples.

Hyperlinks and Citations

The document provides technical resolutions for common errors encountered during compilation, such as issues arising from hyperlink nesting when using pdf\LaTeX{}. The prescribed solution involves upgrading to a more recent version of \LaTeX{}.

For citations, compatibility with natbib styles is stressed, and various citation commands (\citep, \citet, \citealp, etc.) are outlined with examples. This section ensures that the document adheres to established citation norms, promoting consistency and accuracy in referencing.

References and Bib\TeX Files

In constructing a bibliography, the document provides concise instructions for integrating personal Bib\TeX{} files and references from the ACL Anthology. This ensures a streamlined and efficient process for bibliography management, enabling hyperlinks for DOIs and URLs through the hyperref package.

An example of generating a references section using Bib\TeX{} is provided:

1
\bibliography{custom}
or integrating multiple sources:
1
\bibliography{anthology,custom}

Equations and Appendices

Equations are presented with clear labeling and referencing techniques, ensuring that mathematical expressions are both readable and easily referable within the text. An example of such an equation is:

1
A = \pi r^2
For appendices, a switch in the section numbering is advised using the \appendix command, facilitating the organization of supplementary information.

Implications and Future Directions

This comprehensive guide on the usage of pdf\LaTeX{} for document preparation offers both theoretical insights and practical steps, significantly smoothing the scholarly writing process. By adhering to these guidelines, researchers can ensure their documents are professionally formatted, accurately cited, and ready for publication. Moving forward, the seamless integration of advanced text editors and version control systems with these \LaTeX{} practices could further enhance the document preparation experience for researchers.

The document stands as a fundamental resource for researchers aiming to master the intricacies of \LaTeX{} and pdf\LaTeX{} for academic writing, ensuring high standards in document presentation and functionality.