StickyLand: Breaking the Linear Presentation of Computational Notebooks (2202.11086v1)

Published 22 Feb 2022 in cs.HC and cs.LG

Abstract: How can we better organize code in computational notebooks? Notebooks have become a popular tool among data scientists, as they seamlessly weave text and code together, supporting users to rapidly iterate and document code experiments. However, it is often challenging to organize code in notebooks, partially because there is a mismatch between the linear presentation of code and the non-linear process of exploratory data analysis. We present StickyLand, a notebook extension for empowering users to freely organize their code in non-linear ways. With sticky cells that are always shown on the screen, users can quickly access their notes, instantly observe experiment results, and easily build interactive dashboards that support complex visual analytics. Case studies highlight how our tool can enhance notebook users's productivity and identify opportunities for future notebook designs. StickyLand is available at https://github.com/xiaohk/stickyland.

Authors (3)

Zijie J. Wang (39 papers)
Katie Dai (1 paper)
W. Keith Edwards (1 paper)

Citations (22)

View on Semantic Scholar

Summary

Breaking the Linear Presentation of Computational Notebooks

The paper "Breaking the Linear Presentation of Computational Notebooks" addresses a significant issue encountered by many data scientists using Jupyter Notebooks and similar tools: the linear organization of code. This format does not align with the inherently non-linear progression of exploratory data analysis (EDA). The authors introduce an extended functionality through a tool, referred to as $Name$ , designed to enhance the flexibility and efficiency of data scientists' workflows.

Key Contributions

$Name$ introduces a novel interface that allows users to transcend the traditional linear constraints of computational notebooks. The main innovations presented include:

Sticky Cells Mechanism: A feature where cells can persist on the screen, allowing users to maintain their visibility while navigating through the notebook. This facilitates better accessibility of critical notes, results, or interactive dashboards without the need to scroll through the notebook.
Floating Cells and Dashboards: By enabling cells to float, users can create customizable layouts and dashboards that support complex visual analytics, offering a more dynamic and interactive workspace.
Auto-Run Functionality: This feature simplifies the execution process by automatically running specific code cells when changes are made elsewhere in the notebook, thereby enhancing workflow efficiency.

These features collectively aim to resolve issues related to code management, execution order, and maintaining context, providing an enriched EDA experience.

Implications and Future Directions

The development of $Name$ holds both practical and theoretical significance. Practically, it can substantially improve the productivity of data scientists by aligning the notebook interface with the non-linear nature of EDA workflows. Theoretically, it challenges the conventional linear assumptions inherent in the design of many computational tools, proposing an alternative that may inspire further innovations in notebook interfaces.

There are implications for future research and practical deployment in AI and data science environments. Extensions of this work could involve:

Topological Execution: Enhancing auto-run capabilities to consider dependencies among cells more effectively, thereby reducing redundant computations.
Integration with Other Platforms: Expanding compatibility with additional notebook environments like Google Colab or VSCode Notebook to broaden user adoption.
Independent States: Allowing for independent cell states could provide a safer experimentation space, minimizing unwanted interferences in the main notebook state.

Conclusion

This paper offers a robust exploration of how non-linear organization within notebooks can transform data analysis practices. $Name$ serves as an innovative tool, addressing current workflow inefficiencies and providing a platform for continued exploration in computational notebook design. This research stands as a valuable contribution to improving the efficacy and user experience of tools integral to modern data science.

PDF Markdown

Related Papers

GitHub

GitHub - xiaohk/stickyland: Break the linear presentation of Jupyter Notebooks with sticky cells! (514 stars)

Tweets

https://twitter.com/_akhaliq/status/1496309708759736324

https://twitter.com/carlcarrie/status/1591219819151777794

https://twitter.com/krassowski_m/status/1500462036702146562

https://twitter.com/MehdiAllahyari/status/1715942983386739014

https://twitter.com/neurosock/status/1504091957118787587

https://twitter.com/trending__ts/status/1505620337031065600

YouTube

Show All Videos