The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model (2405.17637v1)
Abstract: Selecting LLMs in business contexts requires a careful analysis of the final financial benefits of the investment. However, the emphasis of academia and industry analysis of LLM is solely on performance. This work introduces a framework to evaluate LLMs, focusing on the earnings and return on investment aspects that should be taken into account in business decision making. We use a decision-theoretic approach to compare the financial impact of different LLMs, considering variables such as the cost per token, the probability of success in the specific task, and the gain and losses associated with LLMs use. The study reveals how the superior accuracy of more expensive models can, under certain conditions, justify a greater investment through more significant earnings but not necessarily a larger RoI. This article provides a framework for companies looking to optimize their technology choices, ensuring that investment in cutting-edge technology aligns with strategic financial objectives. In addition, we discuss how changes in operational variables influence the economics of using LLMs, offering practical insights for enterprise settings, finding that the predicted gain and loss and the different probabilities of success and failure are the variables that most impact the sensitivity of the models.
- Christopher M. Bishop. Pattern Recognition and Machine Learning (Information Science and Statistics). Springer-Verlag, Berlin, Heidelberg, 2006. ISBN 0387310738.
- A survey on evaluation of large language models, 2023.
- Grabriel Frahm. Rational Choice and Strategic Conflict: The Subjectivistic Approach to Game Decision and Theory. De Gruiter, Berlin, 2019.
- Anuj Gupta. Ai maturity continuum: A three step framework to understand return on investment (roi) in ai. SSRN Electronic Journal, 2024. ISSN 1556-5068. doi: 10.2139/ssrn.4754694. URL http://dx.doi.org/10.2139/ssrn.4754694.
- SALib: An open-source python library for sensitivity analysis. The Journal of Open Source Software, 2(9), 1 2017. doi: 10.21105/joss.00097. URL https://doi.org/10.21105/joss.00097.
- Toward SALib 2.0: Advancing the accessibility and interpretability of global sensitivity analyses. Socio-Environmental Systems Modelling, 4:18155, 5 2022. doi: 10.18174/sesmo.18155. URL https://sesmo.org/article/view/18155.
- LLMLingua: Compressing prompts for accelerated inference of large language models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 13358–13376. Association for Computational Linguistics, December 2023a. doi: 10.18653/v1/2023.emnlp-main.825. URL https://aclanthology.org/2023.emnlp-main.825.
- LongLLMLingua: Accelerating and enhancing llms in long context scenarios via prompt compression. ArXiv preprint, abs/2310.06839, 2023b. URL https://arxiv.org/abs/2310.06839.
- Large language models in aws. In 2024 1st International Conference on Robotics, Engineering, Science, and Technology (RESTCON), pages 112–117, 2024. doi: 10.1109/RESTCON60981.2024.10463557.
- More agents is all you need, 2024.
- OpenAI. OpenAI Pricing. https://openai.com/pricing, 2024.
- Llmlingua-2: Data distillation for efficient and faithful task-agnostic prompt compression, 2024.
- Chan S. Park. Fundamentals of Engineering Economics. Pearson, 3 edition, 2013.
- Project Management Institute. A Guide to the Project Management Body of Knowledge (PMBOK® Guide). Project Management Institute, Newtown Square, Pennsylvania, 7 edition, 2021. ISBN 978-1628256642.
- A systematic survey of prompt engineering in large language models: Techniques and applications, 2024.
- Sensitivity Analysis in Practice: A Guide to Assessing Scientific Models. John Wiley & Sons, 2004.
- Global Sensitivity Analysis: The Primer. Wiley-Interscience, 2008.
- Leonard J. Savage. The Foundations of Statistics. Dover Publications, New York, 2 edition, 1954.
- Towards optimizing the costs of llm usage, 2024.
- Ilya M Sobol. Global sensitivity indices for nonlinear mathematical models and their monte carlo estimates. Mathematics and computers in simulation, 55(1-3):271–280, 2001.
- A survey of large language models, 2023.
- Recommender systems in the era of large language models (llms), 2024.