Large Language Models as General Pattern Machines (2307.04721v2)

Published 10 Jul 2023 in cs.AI, cs.CL, and cs.RO

Abstract: We observe that pre-trained LLMs are capable of autoregressively completing complex token sequences -- from arbitrary ones procedurally generated by probabilistic context-free grammars (PCFG), to more rich spatial patterns found in the Abstraction and Reasoning Corpus (ARC), a general AI benchmark, prompted in the style of ASCII art. Surprisingly, pattern completion proficiency can be partially retained even when the sequences are expressed using tokens randomly sampled from the vocabulary. These results suggest that without any additional training, LLMs can serve as general sequence modelers, driven by in-context learning. In this work, we investigate how these zero-shot capabilities may be applied to problems in robotics -- from extrapolating sequences of numbers that represent states over time to complete simple motions, to least-to-most prompting of reward-conditioned trajectories that can discover and represent closed-loop policies (e.g., a stabilizing controller for CartPole). While difficult to deploy today for real systems due to latency, context size limitations, and compute costs, the approach of using LLMs to drive low-level control may provide an exciting glimpse into how the patterns among words could be transferred to actions.

Citations (151)

View on Semantic Scholar

Summary

The paper demonstrates that LLMs can accurately complete complex patterns from PCFG and ARC benchmarks using token-invariant approaches.
The paper shows that LLMs effectively generalize and transform sequences, outperforming traditional methods in robotics tasks like motion extrapolation.
The paper reveals that reward-conditioned prompting and in-context learning improve action trajectories, indicating their potential in adaptive control scenarios.

Overview of "LLMs as General Pattern Machines"

The paper "LLMs as General Pattern Machines" investigates the potential of LLMs as versatile sequence modelers, driven by their in-context learning capabilities. The authors assert that not only can LLMs complete complex linguistic sequences, but they also exhibit an ability to extrapolate abstract and symbolic patterns. The paper explores this proposition through a series of experiments, demonstrating applications of LLMs in robotics, specifically focusing on sequence transformation, completion, and improvement. The implications of this paper suggest intriguing cross-domain adaptations where language patterns could inform robotic control policies.

Key Findings

Pattern Completion Beyond Language: The research highlights LLMs' surprising proficiency in completing sequence patterns generated from probabilistic context-free grammars (PCFG) and the Abstraction and Reasoning Corpus (ARC). Notably, performance was preserved even with arbitrarily defined tokens from the model's vocabulary, indicating a fascinating token-invariance in recognizing and extending patterns.
Applications in Robotics:
- Sequence Transformation: Evaluated through ARC tasks, where LLMs showcased the ability to generalize spatial transformations, often surpassing traditional program synthesis methods that rely on manual DSLs. This transformation capability was further explored using a novel PCFG benchmark, which demonstrated improvements correlated with model scale.
- Sequence Completion: LLMs demonstrated robust extrapolation of periodic functions, which was operationalized into robotic tasks, such as motion extrapolation for table sweeping and whiteboard drawing. Larger models and expanded contextual inputs generally improved accuracy.
- Sequence Improvement: The paper explored enhancing trajectories through reward-conditioned prompting and in-context learning, exemplified by tasks such as navigating a grid world or stabilizing a CartPole. These tasks underscored the potential for LLMs to deduce better action sequences in iterative online settings, illustrating a simple form of reinforcement learning.

Implications and Future Directions

The implications of this paper are multifaceted. The realization that LLMs can engage with non-linguistic patterns opens new avenues for transferring the power of LLMs to robotics and other domains requiring sequence manipulation. The ability of these models to learn and adapt to new tokens suggests a form of innate pattern reasoning that transcends specific training data, potentially leading to generalist models capable of versatile applications in both language and robotics.

However, deploying LLMs in practical systems is currently restricted by latency, context size, and computational cost constraints. Overcoming these limitations would necessitate advancements in model efficiency and real-time integration capabilities. Future research might focus on how multi-task pre-training on diverse modalities can further cultivate these pattern recognition skills, potentially culminating in seamless cross-domain capabilities.

Ultimately, while LLMs exhibit these impressive abilities out-of-the-box, facilitating their integration into specialized, domain-specific tasks might require further fine-tuning or joint training paradigms with domain-specific data. The exploration of these models in embodied AI scenarios, such as assistive robotics or real-time control systems, presents a promising area for future innovation.

PDF Markdown

Related Papers

Tweets

https://twitter.com/RobertTLange/status/1804477568315224253

https://twitter.com/Artoftheproblem/status/1770846935907238233

https://twitter.com/riley_stews/status/1756034286988460389

https://twitter.com/18364654/status/1693476181046145301

YouTube

Show All Videos

Reddit

Large Language Models as General Pattern Machines (5 points, 1 comment)