Dice Question Streamline Icon: https://streamlinehq.com

Prioritizing Agentic Use Cases Across Heterogeneous Lakehouse Platforms

Determine a systematic framework for prioritizing AI agent use cases within data lakehouse environments, specifying which data lifecycle tasks should be targeted first across heterogeneous lakehouse architectures and interfaces to guide safe and effective agent deployment.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper proposes a programmable, API-first lakehouse design and demonstrates a proof-of-concept for agentic repair of data pipelines. While the authors showcase one concrete use case, they explicitly note uncertainty about how to prioritize agent-driven automation across diverse, heterogeneous platforms.

This uncertainty reflects the breadth of possible agentic tasks in lakehouses—spanning data access, pipeline execution, observability, and governance—combined with differences in platform interfaces and operational constraints. A principled prioritization would help decide which agentic workflows should be addressed first in production settings.

References

On the other, it is unclear how to prioritize agentic use cases across such heterogeneous platforms.

Safe, Untrusted, "Proof-Carrying" AI Agents: toward the agentic lakehouse (2510.09567 - Tagliabue et al., 10 Oct 2025) in Introduction