Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Meeting in the notebook: a notebook-based environment for micro-submissions in data science collaborations (2103.15787v1)

Published 29 Mar 2021 in cs.HC

Abstract: Developers in data science and other domains frequently use computational notebooks to create exploratory analyses and prototype models. However, they often struggle to incorporate existing software engineering tooling into these notebook-based workflows, leading to fragile development processes. We introduce Assembl\'{e}, a new development environment for collaborative data science projects, in which promising code fragments of data science pipelines can be contributed as pull requests to an upstream repository entirely from within JupyterLab, abstracting away low-level version control tool usage. We describe the design and implementation of Assembl\'{e} and report on a user study of 23 data scientists.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Micah J. Smith (6 papers)
  2. Jürgen Cito (22 papers)
  3. Kalyan Veeramachaneni (38 papers)

Summary

We haven't generated a summary for this paper yet.