Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation (2506.09046v2)

Published 10 Jun 2025 in cs.LG, cs.AI, and cs.MA

Abstract: Leveraging multiple LLMs(LLMs) has proven effective for addressing complex, high-dimensional tasks, but current approaches often rely on static, manually engineered multi-agent configurations. To overcome these constraints, we present the Agentic Neural Network(ANN), a framework that conceptualizes multi-agent collaboration as a layered neural network architecture. In this design, each agent operates as a node, and each layer forms a cooperative "team" focused on a specific subtask. Agentic Neural Network follows a two-phase optimization strategy: (1) Forward Phase-Drawing inspiration from neural network forward passes, tasks are dynamically decomposed into subtasks, and cooperative agent teams with suitable aggregation methods are constructed layer by layer. (2) Backward Phase-Mirroring backpropagation, we refine both global and local collaboration through iterative feedback, allowing agents to self-evolve their roles, prompts, and coordination. This neuro-symbolic approach enables ANN to create new or specialized agent teams post-training, delivering notable gains in accuracy and adaptability. Across four benchmark datasets, ANN surpasses leading multi-agent baselines under the same configurations, showing consistent performance improvements. Our findings indicate that ANN provides a scalable, data-driven framework for multi-agent systems, combining the collaborative capabilities of LLMs with the efficiency and flexibility of neural network principles. We plan to open-source the entire framework.

Collections

Sign up for free to add this paper to one or more collections.

Sign Up

Summary

The paper introduces Agentic Neural Networks, a framework that models multi-agent systems as layered networks with self-evolving capabilities.
It employs a two-phase optimization strategy with forward dynamic team selection and backward textual refinement to enhance agent collaboration.
Experimental results on datasets like HumanEval (72.7%-87.8% accuracy) validate its superiority over static multi-agent configurations.

Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation

The paper "Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation" (2506.09046) introduces the Agentic Neural Network ( $\mathcal{ANN}$ ), a novel framework that applies neural network principles to multi-agent systems (MAS). The $\mathcal{ANN}$ framework aims to address the limitations of static, manually engineered multi-agent configurations by conceptualizing multi-agent collaboration as a layered neural network architecture, where each agent acts as a node and each layer forms a cooperative team focused on a specific subtask.

Core Methodology

The $\mathcal{ANN}$ methodology draws inspiration from classic neural networks, replacing numerical weight optimizations with dynamic agent-based team selection and iterative textual refinement. It employs a two-phase optimization strategy: a forward phase for dynamic team selection and a backward phase for optimization.

Forward Dynamic Team Selection

In the forward phase, the framework decomposes a complex task into subtasks, assigning each to a layer of specialized agents. This process involves:

Defining the $\mathcal{ANN}$ structure: The architecture mimics neural networks, where each layer consists of agent nodes connected in a sequence to facilitate information flow.
Selecting Layer-wise Aggregation Functions: A mechanism dynamically determines the most appropriate aggregation function at each layer, combining outputs from multiple agents based on subtask requirements.

The aggregation function selection is determined by

$f_\ell = \text{DynamicRoutingSelect}(\mathcal{F}_\ell, \ell, I_\ell, I),$

where $\mathcal{F}_\ell$ is the set of candidate aggregation functions, $I_\ell$ is the input to the layer, and $I$ is the task-specific information.

Figure 1: Comparison of static and dynamic agentic teams, illustrating the adaptability of the $\mathcal{ANN}$ framework.

Backward Optimization

If the predefined performance thresholds are not met after the forward pass, the backward optimization phase is triggered to refine agent interactions and aggregation functions at both global (system-wide) and local (layer-specific) levels.

Global Optimization: Analyzes inter-layer coordination, refining interconnections and data flow to improve overall system performance. The global gradient is computed as:

$\mathcal{G}_{\text{global}} = \text{ComputeGlobalGradient}(S, \tau),$

where $S$ represents the global workflow and $\tau$ denotes the execution trajectory.

Local Optimization: Fine-tunes agents and aggregation functions within each layer, adjusting their parameters based on detailed performance feedback. The local gradient for each layer is computed as:

$\mathcal{G}_{\text{local},\ell}^{t} = \beta \mathcal{G}_{\text{global}} + (1 - \beta) \times \text{ComputeLocalGradient}(\ell, f_{\ell}, \tau),$

where $\beta$ is a weighting factor that balances the influence of global optimization and layer-specific gradients.

To improve stability, $\mathcal{ANN}$ employs momentum-based optimization.

Experimental Validation

The $\mathcal{ANN}$ framework was evaluated on four challenging datasets: MATH (mathematical reasoning), DABench (data analysis), Creative Writing, and HumanEval (code generation). The experimental results indicate that $\mathcal{ANN}$ simplifies MAS design by automating prompt tuning, role assignment, and agent collaboration, outperforming existing baselines in accuracy. For instance, on HumanEval, $\mathcal{ANN}$ achieved 72.7\% and 87.8\% for GPT-3.5 and GPT-4, respectively.

Figure 2: Ablation paper results on HumanEval, Creative Writing, MATH, and DABench, demonstrating the impact of various components of the $\mathcal{ANN}$ framework.

The paper also presents ablation studies to demonstrate the contribution of each component of the $\mathcal{ANN}$ framework. The ablation paper compares four variants: the full $\mathcal{ANN}$ approach, a variant without momentum-based optimization, a variant without validation-based performance checks, and a variant without backward optimization. The results indicate that each component contributes significantly to performance, and combining them yields the most reliable and robust improvements.

Implications and Future Directions

The $\mathcal{ANN}$ framework introduces a paradigm shift in multi-agent systems, moving from static, manually designed architectures to more data-driven, automated approaches. The framework's self-evolving capabilities, dynamically reconfiguring its agent teams and coordination strategies, offer a promising direction for creating more robust and flexible multi-agent systems.

Future work may focus on automating the generation of initial layouts from accumulated agent experience using meta-prompt learning, integrating advanced pruning techniques to enhance efficiency, introducing a dynamic role adjustment mechanism, and integrating multi-agent fine-tuning with global and local tuning of the multi-agentic workflow.

Figure 3: Prompt-evolution trajectory for the HumanEval dataset.

Figure 4: Prompt-evolution trajectory for the DABench dataset.

Conclusion

The Agentic Neural Network ( $\mathcal{ANN}$ ) presents a novel approach to multi-agent systems by integrating neural network principles with LLMs. The framework's dynamic agent team formation, two-phase optimization pipeline, and self-evolving capabilities demonstrate its potential for orchestrating complex multi-agent workflows. The $\mathcal{ANN}$ framework effectively combines symbolic coordination with connectionist optimization, paving the way for fully automated and self-evolving multi-agent systems.

PDF Markdown

Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation (2506.09046v2)

Collections

Summary

Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation

Core Methodology

Forward Dynamic Team Selection

Backward Optimization

Experimental Validation

Implications and Future Directions

Conclusion

Follow-up Questions

Authors (5)

Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation (2506.09046v2)

Collections

Summary

Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation

Core Methodology

Forward Dynamic Team Selection

Backward Optimization

Experimental Validation

Implications and Future Directions

Conclusion

Follow-up Questions

Related Papers

Authors (5)