Medium- and long-run performance comparison of LLM-based and Q-learning pricing agents
Compare the relative performance in the medium and long run of GPT-4-based pricing agents using prompt prefixes P1 and P2 and Q-learning pricing agents in the repeated Bertrand duopoly environment with logit demand, including outcomes such as prices and profits.
References
The comparison of the relative performance of these pricing agents in the medium and long run is left for future research.
— Algorithmic Collusion by Large Language Models
(2404.00806 - Fish et al., 31 Mar 2024) in Section 6.3 (Asymmetric Pricing Algorithms), footnote