Papers

Topics

Authors

Recent

View all

Detailed Answer

Quick Answer

Concise responses based on abstracts only

Detailed Answer

Well-researched responses based on abstracts and relevant paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses

Gemini 2.5 Flash

Gemini 2.5 Flash 82 tok/s

Gemini 2.5 Pro 52 tok/s Pro

GPT-5 Medium 19 tok/s Pro

GPT-5 High 17 tok/s Pro

GPT-4o 107 tok/s Pro

Kimi K2 174 tok/s Pro

GPT OSS 120B 468 tok/s Pro

Claude Sonnet 4 37 tok/s Pro

2000 character limit reached

Eliza: A Web3 friendly AI Agent Operating System (2501.06781v2)

Published 12 Jan 2025 in cs.AI

Abstract: AI Agent, powered by LLMs as its cognitive core, is an intelligent agentic system capable of autonomously controlling and determining the execution paths under user's instructions. With the burst of capabilities of LLMs and various plugins, such as RAG, text-to-image/video/3D, etc., the potential of AI Agents has been vastly expanded, with their capabilities growing stronger by the day. However, at the intersection between AI and web3, there is currently no ideal agentic framework that can seamlessly integrate web3 applications into AI agent functionalities. In this paper, we propose Eliza, the first open-source web3-friendly Agentic framework that makes the deployment of web3 applications effortless. We emphasize that every aspect of Eliza is a regular Typescript program under the full control of its user, and it seamlessly integrates with web3 (i.e., reading and writing blockchain data, interacting with smart contracts, etc.). Furthermore, we show how stable performance is achieved through the pragmatic implementation of the key components of Eliza's runtime. Our code is publicly available at https://github.com/ai16z/eliza.

Collections

Summary

The paper presents a modular, open‐source AI agent OS, Eliza, that integrates large language models with blockchain to enable native Web3 operations.
Its design decouples functionalities into distinct modules such as Agent Runtimes, Character Files, and flexible plugins, fostering both extensibility and precise context management.
Benchmark tests using the GAIA framework, with scores averaging 19.42% across three difficulty levels, underscore Eliza’s effectiveness in decentralized applications.

The paper presents a comprehensive design and implementation of Eliza, an open‐source, Web3-friendly AI agent operating system that integrates large LLMs with blockchain functionalities. The authors outline a framework that is built entirely in TypeScript, enabling native integration with Web3 environments such as Ethereum, Solana, and others, while maintaining full user control over the underlying code. The effort is characterized by a modular design that decouples core agent functionality into distinct components such as Agent Runtimes, Character Files, Providers, Actions, Evaluators, and Plugins.

The technical contributions include:

Agent Runtimes:

The runtime is implemented as a primary class (AgentRuntime) that abstracts message processing, state management, memory operations, and action execution. A minimal code snippet is provided that demonstrates instantiation with configuration parameters such as authentication tokens, model provider selection, and custom actions. This design prioritizes both robustness and extensibility in managing autonomous agent behaviors.

Character Files and Configuration:

Character Files are JSON configurations processed via a schema system (using Zod) that define the agent’s personality, behavioral traits, and operational parameters. This facilitates fine-grained control over the agent’s output style and interaction modalities, including integration with smart contracts and other blockchain operations.

Providers and Context Management:

Providers are implemented to supply dynamic contextual data such as market information, wallet details, and temporal context. The runtime allows simple registration of custom providers. Examples include Time Provider, Facts Provider, and a “degen Boredom Provider” that influences agent engagement levels based on recent conversation analysis.

Action and Evaluator Systems:

Actions encapsulate a wide array of operations ranging from executing blockchain transactions (e.g., token swapping, NFT generation) to processing multimedia content. Evaluators complement actions by performing real-time assessment and fact extraction during agent interactions. Both of these components are designed to be extendable, relying on a strict interface definition that simplifies integration of new functionalities.

Plugin Architecture:
- A typical plugin includes optional arrays of actions, providers, evaluators, services, and clients.
- The system supports both internal enhancements (e.g., Zero-shot Chain-of-Thought style reasoning) and external augmentations (e.g., Retrieval-Augmented Generation (RAG) techniques).
Intent Recognition and Communication:

The framework incorporates a multi-layered intent recognition system that leverages hierarchical action definitions combined with symbolic action triggers and context-aware evaluation. This ensures that the agent maintains conversation coherence, adequately tracks context with long-term memory support, and adapts across various communication channels such as Discord, Twitter, and more.

Benchmarks and Performance Evaluation:
- When operating in a swarm with three homogeneous agents and employing a majority voting mechanism, Eliza achieves scores of 32.21%, 21.70%, and 4.36% across three difficulty levels with an average score of 19.42%.
- These results were compared with other methods such as GPT-4, AutoGPT, GPTSwarm, and Langfun, allowing developers to gauge the operational performance in both AI reasoning and Web3 operational scenarios.
Web3-Specific Integration:
- A Solana plugin that manages token operations, wallet interactions, and establishes a trust score mechanism for evaluating transaction safety.
- Advanced implementation examples for image generation actions, including robust configuration validation, file management, and response handling for multimedia outputs.
Design Principles and Limitations:

The system is guided by three core design principles: 1. Prioritizing Web3 Developers: Emphasizing JavaScript/TypeScript paradigms to facilitate seamless integration with existing Web3 solutions. 2. Pluggable Modular Design: Decoupling components to ensure a straightforward extension and maintenance model. 3. Simplicity Over Exhaustiveness: Advocating a “roughness is better” approach to maximize rapid feature development without sacrificing system stability.

The authors also candidly discuss limitations such as the absence of an explicit workflow system, the need for further runtime optimization especially when scaling with multiple agents, and the potential for expanded multi-language support in future iterations.

Overall, the paper provides a detailed technical description and evaluation of Eliza, demonstrating its ability to bridge advanced AI agent capabilities with the practical necessities of decentralized Web3 applications. The modularity and extensibility of the architecture, combined with benchmark evaluations and in-depth integration examples, collectively underscore the framework's potential as a versatile tool for both AI and blockchain developers.