Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks (2502.09577v1)

Published 13 Feb 2025 in cs.HC

Abstract: Prewriting is the process of generating and organising ideas before a first draft. It consists of a combination of informal, iterative, and semi-structured strategies such as visual diagramming, which poses a challenge for collaborating with LLMs in a turn-taking conversational manner. We present Polymind, a visual diagramming tool that leverages multiple LLM-powered agents to support prewriting. The system features a parallel collaboration workflow in place of the turn-taking conversational interactions. It defines multiple ``microtasks'' to simulate group collaboration scenarios such as collaborative writing and group brainstorming. Instead of repetitively prompting a chatbot for various purposes, Polymind enables users to orchestrate multiple microtasks simultaneously. Users can configure and delegate customised microtasks, and manage their microtasks by specifying task requirements and toggling visibility and initiative. Our evaluation revealed that, compared to ChatGPT, users had more customizability over collaboration with Polymind, and were thus able to quickly expand personalised writing ideas during prewriting.

View on arXiv

Authors (4)

Qian Wan (26 papers)
Jiannan Li (10 papers)
Huanchen Wang (4 papers)
Zhicong Lu (41 papers)

Summary

An Expert Review of "Polymind: Parallel Visual Diagramming with LLMs to Support Prewriting Through Microtasks"

"Polymind: Parallel Visual Diagramming with LLMs to Support Prewriting Through Microtasks" presents a novel approach to facilitating the prewriting phase of writing. Leveraging the capabilities of LLMs, this paper introduces Polymind, a visual diagramming tool that aims to improve prewriting productivity by employing multiple LLM-enabled agents to execute microtasks in parallel. By focusing on visual diagramming, the system aids users in the generation and organization of ideas, a known challenge for traditional LLM interactive models due to their conversational, turn-taking nature.

Key Contributions and Findings

The research is grounded in the context of enhancing creativity through parallel thinking, a concept often realized through group collaboration but now innovatively applied to human-AI interaction. The authors define a set of predefined microtasks that cover both divergent (e.g., brainstorming, associating) and convergent (e.g., summarization, elaboration) thinking strategies. These microtasks are small, manageable tasks that execute over diagram nodes to simulate human collaboration scenarios, avoiding the limitations of serial interaction inherent in traditional LLM interfaces.

Prominent findings from the paper include:

User Autonomy and Control: Compared to traditional interfaces like ChatGPT, Polymind’s users experienced greater customization and control over the collaboration process. The ability to define and manage microtasks facilitated personalized idea expansion.
Enhanced Creativity and Efficiency: The parallel workflow supported by microtasks allowed for faster ideation and iteration, while maintaining a high level of user engagement and creativity during prewriting tasks.
Usability and Task Management: Despite the complexity involved in managing multiple microtasks simultaneously, Polymind was perceived as nearly as usable as standard LLM conversational interfaces. Features like initiative modes and task status notifications proved instrumental in mitigating the cognitive load associated with task management.

Practical and Theoretical Implications

From a practical standpoint, Polymind demonstrates significant potential for transforming writing processes by providing structured, yet flexible support in the ideation and planning stages. The tool's capacity to integrate multiple AI agents contrasts with the often linear, singular focus of current AI writing aids, introducing opportunities for enhanced collaborative creativity in various writing contexts.

Theoretically, the research offers insights into the interplay between human cognitive strategies and AI capabilities. By embedding microtasks within prewriting workflows, the paper aligns with current HCI research on augmenting human creativity through AI, emphasizing the importance of mixed-initiative user interfaces that allow for dynamic adjustments of AI assistance levels based on cognitive task demands.

Future Developments

The authors suggest that the framework established by Polymind could be extended beyond prewriting tasks to other diagram-based interfaces, such as those used in design or programming, offering distinct AI agents tailored to specific user needs. Future work might explore refining the task management features, integrating more sophisticated user feedback mechanisms, and further investigating the balance between AI initiative and user control, especially in complex or poorly structured tasks.

In conclusion, "Polymind: Parallel Visual Diagramming with LLMs to Support Prewriting Through Microtasks" contributes to the expanding field of human-AI collaborative creativity, offering a practical, innovative tool that enhances the capabilities of current LLM applications in the ideation phase of writing. It presents a compelling case for the integration of parallel AI processing as both a research direction and a practical framework for enhancing creative workflows.