Word2World: Generating Stories and Worlds through Large Language Models (2405.06686v1)

Published 6 May 2024 in cs.CL and cs.AI

Abstract: LLMs have proven their worth across a diverse spectrum of disciplines. LLMs have shown great potential in Procedural Content Generation (PCG) as well, but directly generating a level through a pre-trained LLM is still challenging. This work introduces Word2World, a system that enables LLMs to procedurally design playable games through stories, without any task-specific fine-tuning. Word2World leverages the abilities of LLMs to create diverse content and extract information. Combining these abilities, LLMs can create a story for the game, design narrative, and place tiles in appropriate places to create coherent worlds and playable games. We test Word2World with different LLMs and perform a thorough ablation study to validate each step. We open-source the code at https://github.com/umair-nasir14/Word2World.

References (43)

Authors (3)

Muhammad U. Nasir (3 papers)
Steven James (30 papers)
Julian Togelius (154 papers)

Citations (5)

View on Semantic Scholar

Summary

The paper introduces a novel method to generate playable game environments directly from narratives using a multi-step LLM process.
It employs a detailed pipeline for tile extraction and iterative refinement to ensure coherent and structurally sound game levels.
Experimental evaluations reveal that larger LLMs achieve superior novelty and coherence in procedurally generated game worlds.

Word2World: Turning Stories into Playable Game Levels Using LLMs

Understanding Word2World

Imagine being able to create an entire playable game level just from a story. That’s what the new system, Word2World, aims to achieve. It leverages the capabilities of LLMs to transform textual descriptions into coherent, playable game environments without needing task-specific fine-tuning. Word2World brings us closer to fully automating procedural content generation (PCG) in gaming, marking a significant step in the use of LLMs beyond text generation.

Breaking Down the Components

Procedural Content Generation with LLMs

The process of generating game levels in Word2World is divided into several steps:

Story Creation: An LLM first generates a story, which forms the narrative foundation for the game.
Information Extraction: The LLM extracts essential details from this story, such as character descriptions, tile information, goals, critical tiles, walkable tiles, and interactive tiles.
World Generation: The system then proceeds to lay down the environment tiles in one step, followed by the placement of characters and important interactive tiles in another step.
Algorithmic Refinements: Algorithmic checks ensure tiles are correctly placed and the map adheres to specific constraints (e.g., equal row lengths in the tile map).
Feedback Loop: The process involves multiple rounds where evaluations from previous iterations are fed back to refine the world until it is coherent and playable.

This multi-step process helps ensure the generated world is not only diverse and rich in content but also structurally sound for gameplay.

Tile Selection Process

Choosing the right tiles is crucial to creating an engaging and coherent game world. Word2World uses a pre-defined dataset of tiles for the environment and characters. These tiles are described and labeled manually to optimize retrieval. DistilBERT, a smaller, faster version of BERT, is used to find the most relevant tiles via cosine similarity based on these descriptions.

LLM Agents as Game Testers

An interesting aspect of Word2World is the use of LLM agents as evaluators to simulate playing the game. These agents generate action sequences (like moving up, down, left, right, picking objects, and hitting enemies) to test if the generated levels are playable. Their ability is assessed by the rewards they receive for correctly performing tasks in the game environment.

Evaluation of Word2World

The robustness of Word2World is demonstrated through various evaluations:

LLM-based Evaluations: These assess the coherence of the game world with the narrative.
Conventional PCG Checks:
- Playability checks using an AStar agent.
- Path length measurements.
- Novelty assessments based on differences from previously generated worlds.
- Accuracy checks for the placement of character and important tiles.

Experimental Insights

Word2World was tested across multiple runs, revealing some interesting findings:

The method consistently generates coherent and playable worlds.
Ablation studies (where specific steps are removed) show that every component in Word2World's multi-step process crucially contributes to its performance. For instance, omitting goal extraction or important tile extraction significantly hampers the quality of generated levels.
Different LLMs were compared, with larger models (like GPT-4 and Claude-3) generally outperforming smaller variants in terms of novelty and coherence.

Broader Implications and Future Directions

Practical Implications

The practical implications of Word2World are vast for both the gaming industry and the research community. For the gaming industry, it offers a tool to rapidly prototype game levels based on narrative inputs, significantly reducing development time and cost. For researchers, Word2World provides diverse environments that can be used for various AI and machine learning experiments, especially in reinforcement learning.

Theoretical Implications

From a theoretical perspective, Word2World bridges a gap between narrative generation and computational game creativity. It provides a new framework to explore how stories can be translated into structured, interactive environments, potentially setting new benchmarks for narrative-driven game development.

Future Developments

There's a lot of potential for expanding Word2World:

Different Game Genres: Adapting the system for 2D platformers or 3D environments.
Open-World Games: Creating expansive, open-world environments based on storybooks or other narrative formats.
Reinforcement Learning: As Word2World can generate diverse and coherent environments, it could be a valuable tool for open-ended learning and other advanced AI experiments.

Word2World represents a significant step forward in using LLMs for procedural content generation, making it easier than ever to turn imaginative stories into interactive game worlds. As this technology develops, we can expect even more innovative solutions for creating rich, engaging virtual environments.

Related Papers

Tweets

https://twitter.com/utheprodigyn/status/1790659182154154239

https://twitter.com/togelius/status/1790815597720170612

https://twitter.com/fly51fly/status/1791031127114678644

https://twitter.com/utheprodigyn/status/1845522118064099726

https://twitter.com/utheprodigyn/status/1790659226647273861

https://twitter.com/utheprodigyn/status/1794761892361314438