Papers

Topics

Authors

Recent

View all

Detailed Answer

Quick Answer

Concise responses based on abstracts only

Detailed Answer

Well-researched responses based on abstracts and relevant paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses

Gemini 2.5 Flash

Gemini 2.5 Flash 43 tok/s

Gemini 2.5 Pro 49 tok/s Pro

GPT-5 Medium 17 tok/s Pro

GPT-5 High 19 tok/s Pro

GPT-4o 96 tok/s Pro

Kimi K2 197 tok/s Pro

GPT OSS 120B 455 tok/s Pro

Claude Sonnet 4 37 tok/s Pro

2000 character limit reached

Generating a Low-code Complete Workflow via Task Decomposition and RAG (2412.00239v1)

Published 29 Nov 2024 in cs.SE and cs.AI

Abstract: AI technologies are moving rapidly from research to production. With the popularity of Foundation Models (FMs) that generate text, images, and video, AI-based systems are increasing their complexity. Compared to traditional AI-based software, systems employing FMs, or GenAI-based systems, are more difficult to design due to their scale and versatility. This makes it necessary to document best practices, known as design patterns in software engineering, that can be used across GenAI applications. Our first contribution is to formalize two techniques, Task Decomposition and Retrieval-Augmented Generation (RAG), as design patterns for GenAI-based systems. We discuss their trade-offs in terms of software quality attributes and comment on alternative approaches. We recommend to AI practitioners to consider these techniques not only from a scientific perspective but also from the standpoint of desired engineering properties such as flexibility, maintainability, safety, and security. As a second contribution, we describe our industry experience applying Task Decomposition and RAG to build a complex real-world GenAI application for enterprise users: Workflow Generation. The task of generating workflows entails generating a specific plan using data from the system environment, taking as input a user requirement. As these two patterns affect the entire AI development cycle, we explain how they impacted the dataset creation, model training, model evaluation, and deployment phases.

Collections

Summary

The paper formalizes Task Decomposition and Retrieval-Augmented Generation (RAG) as design patterns to enhance software engineering practices for systems built with generative AI.
Task Decomposition breaks complex workflows into simpler sub-tasks for better modularity and testing, while RAG leverages external data to improve accuracy and reduce hallucinations in model output.
A case study on low-code workflow generation demonstrates how these patterns improve system modifiability, testability, and flexibility in practical GenAI applications.

Generating a Low-code Complete Workflow via Task Decomposition and RAG

The paper "Generating a Low-code Complete Workflow via Task Decomposition and RAG" presented by Orlando Marquez Ayala and Patrice Bechard addresses the complexities inherent in designing systems that leverage Generative AI (GenAI) technologies. With the increasing deployment of foundation models (FMs) such as LLMs in software products, software engineering practices must evolve to handle these advanced capabilities effectively. The authors contribute to this discourse by formalizing two techniques—Task Decomposition and Retrieval-Augmented Generation (RAG)—as design patterns for GenAI-based systems.

In the context of software engineering for AI, the paper recognizes the divide between AI research and its practical implementation in production environments. This gap often results in AI models being developed without sufficient consideration of essential software engineering attributes such as flexibility, maintainability, safety, and security. Task Decomposition and RAG are proposed as methodologies to bridge this gap, offering a structured approach to handle AI tasks that require complex outputs and external data dependencies.

Task Decomposition and RAG as Design Patterns

Task Decomposition is highlighted as a divide-and-conquer strategy, breaking complex ML workflows into manageable sub-tasks. This approach is particularly beneficial for scenarios requiring extensive structured outputs, as it simplifies testing, increases functional correctness, and enhances the modularity of the system by enabling focused improvements on specific components. RAG, on the other hand, tackles knowledge limitations in FMs by augmenting generation processes with external data, thereby decreasing the occurrence of model hallucinations and enhancing information retrieval from vast databases.

The authors critically assess these techniques against desired software quality attributes, incorporating flexibility in AI model deployment by allowing variations in the complexity of the model tasks. For instance, the use of RAG allows for adaptive retrieval of data, thereby enabling the FM to generate content grounded in up-to-date information from its environment. The utility of RAG is underscored by its capacity to enhance security and interpretability, as the FM can select from a curated set of retrieval options to generate output that is not only correct but also traceable.

Case Study: Workflow Generation

To demonstrate the application of these design patterns, the paper presents a case paper detailing the construction of a GenAI application for enterprise users—Workflow Generation. The case paper reveals the practical challenges encountered when generating low-code workflows, utilizing Task Decomposition to sequentially develop and refine distinct sub-tasks. This methodological approach mitigates quality issues related to scalability and performance by employing a smaller FM to execute simpler tasks while leveraging RAG to stabilize output through suggestive data retrieval mechanisms.

Through the case paper, the authors elucidate the benefits of their approach in a real-world application, revealing improved system modifiability and testability. Specifically, employing a tree-structured representation of workflows enabled the utilization of tree edit distance metrics for evaluation. This not only facilitated the measurement of correctness across different workflow components but also allowed for strategic improvements based on specific sub-task performance metrics. By modularizing the system's AI components, the authors designed a flexible architecture conducive to incremental enhancements and reduced time-to-market.

Implications and Future Directions in AI

The implications of this research are multifaceted. Practically, it offers a roadmap for AI practitioners interested in deploying complex GenAI-driven products within an enterprise context. By providing detailed documentation of Task Decomposition and RAG implementation, the researchers have positioned these techniques as foundational elements necessary to optimize AI software pipelines—a crucial consideration as the industry accelerates towards a future powered increasingly by autonomous AI systems. Theoretically, this research adds to the canon of design patterns in software engineering, extending traditional methodologies to account for the unique challenges posed by GenAI.

Looking ahead, the evolution of design patterns that incorporate dynamic retrieval and modular decomposition could prove instrumental for developing AI systems capable of generalizing across diverse contexts and adapting to unpredictable changes in environmental data. The case paper invites future research in optimizing multi-FM strategies and fine-tuning RAG implementations to refine retrieval quality amid continuously shifting data landscapes. Through such endeavors, the intersection of AI research and software engineering will continue to be a fertile ground for innovation, enhancing systems' capabilities to deliver secure, maintainable, and high-quality solutions.