LLMs for sensory-motor control: Combining in-context and iterative learning (2506.04867v1)

Published 5 Jun 2025 in cs.AI, cs.HC, cs.LG, and cs.RO

Abstract: We propose a method that enables LLMs to control embodied agents by directly mapping continuous observation vectors to continuous action vectors. Initially, the LLMs generate a control strategy based on a textual description of the agent, its environment, and the intended goal. This strategy is then iteratively refined through a learning process in which the LLMs are repeatedly prompted to improve the current strategy, using performance feedback and sensory-motor data collected during its evaluation. The method is validated on classic control tasks from the Gymnasium library and the inverted pendulum task from the MuJoCo library. In most cases, it successfully identifies optimal or high-performing solutions by integrating symbolic knowledge derived through reasoning with sub-symbolic sensory-motor data gathered as the agent interacts with its environment.

PDF Abstract

LLMs for Sensory-Motor Control: Combining In-Context and Iterative Learning

The paper "LLMs for Sensory-Motor Control: Combining In-Context and Iterative Learning" by Jônata Tyska Carvalho and Stefano Nolfi proposes a novel methodology to control embodied agents using LLMs. The approach directly maps continuous observation vectors to continuous action vectors, bypassing the reliance on predefined motor primitives. This represents a significant departure from traditional methods that typically require extensive training data or partial solutions based on distinct motor primitives, which may not always integrate effectively due to dynamical complexity.

Methodological Approach

The authors introduce a framework where LLMs initially formulate a control strategy based on textual descriptions of the agent and its environment. This strategy is iteratively refined through subsequent prompts that incorporate performance feedback and sensory-motor data from evaluations. The methodology thus combines symbolic reasoning with sub-symbolic sensory-motor information, leveraging the reasoning and in-context learning capabilities of LLMs. The iterative learning component allows the model to enhance its control policies autonomously, optimizing strategies through continual feedback without needing large-scale, human-provided demonstrations.

Results and Discussion

The framework was validated across several dynamic control tasks, including classic benchmarks from the Gymnasium library and the inverted pendulum task from MuJoCo. Four different LLMs were evaluated, but Qwen2.5:72B demonstrated superior performance, achieving optimal or high-performing solutions for most tasks. Importantly, the iterative process showed significant effectiveness in refining control strategies, with initial strategies serving only as a baseline for further refinement.

Despite encouraging results, the authors highlight key limitations. Notably, the sensory-motor data used for feedback were substantially sub-sampled, potentially omitting crucial insights needed for policy enhancement. This constraint may be particularly limiting in tasks without explicit failure conditions, such as the Pendulum, where capturing meaningful data snapshots is challenging. Future work should prioritize optimizing data usage and exploring automation in prompt construction, for instance using AutoPrompt techniques, which could further improve the efficacy and adaptability of this approach.

Implications and Future Directions

This research opens avenues for deploying LLMs in real-world embodied control scenarios, offering a scalable approach that minimizes the need for large, labor-intensive datasets while maintaining robust adaptability through reasoning and iterative learning. The promising results suggest potential applications in various domains, including robotics, simulated environments, and interactive agent systems. Further advancements could explore dual-agent architectures where one model critiques and improves action strategies generated by another, potentially enhancing problem-solving capabilities through feedback loops.

In summary, the paper underscores the potential of LLMs to synthesize complex control policies autonomously, effectively bridging symbolic and sub-symbolic reasoning. While challenges remain in optimizing sensory-motor feedback utilization and automating prompt generation, the framework presented represents a valuable contribution to the field of autonomous control systems using AI, with implications for both practical applications and theoretical advancements in machine learning and embodied intelligence.

PDF Markdown Bookmark Chat (Pro)

Authors (2)

Jônata Tyska Carvalho (3 papers)
Stefano Nolfi (13 papers)

Related Papers

Find Related Papers

YouTube

Show All Videos