Better Call GPT, Comparing Large Language Models Against Lawyers (2401.16212v1)
Abstract: This paper presents a groundbreaking comparison between LLMs and traditional legal contract reviewers, Junior Lawyers and Legal Process Outsourcers. We dissect whether LLMs can outperform humans in accuracy, speed, and cost efficiency during contract review. Our empirical analysis benchmarks LLMs against a ground truth set by Senior Lawyers, uncovering that advanced models match or exceed human accuracy in determining legal issues. In speed, LLMs complete reviews in mere seconds, eclipsing the hours required by their human counterparts. Cost wise, LLMs operate at a fraction of the price, offering a staggering 99.97 percent reduction in cost over traditional methods. These results are not just statistics, they signal a seismic shift in legal practice. LLMs stand poised to disrupt the legal industry, enhancing accessibility and efficiency of legal services. Our research asserts that the era of LLM dominance in legal contract review is upon us, challenging the status quo and calling for a reimagined future of legal workflows.
- Amazon. Amazon titan: A large language model. https://aws.amazon.com/bedrock/titan, 2023. Accessed: 2023-12-19.
- Palm 2 technical report. arXiv preprint arXiv:2305.10403, 2023.
- P. D. Callister. Generative ai and finding the law. Available at SSRN 4608268, 2023.
- Chatgpt goes to law school. Available at SSRN, 2023.
- L. J. Cronbach. Coefficient alpha and the internal structure of tests. psychometrika, 16(3):297–334, 1951.
- Legalbench: A collaboratively built benchmark for measuring legal reasoning in large language models, 2023.
- A. of Corporate Counsel. 2023 law department compensation survey. https://www.acc.com/sites/default/files/2023-09/ACC_2023_Law_Dept_Compensation_Survey_Exec_Summary.pdf, 2023. Accessed: 2023-12-01.
- OpenAI. Gpt-4 technical report. arxiv 2303.08774. Open AI, 2:13, 2023.
- Chatgpt as an artificial lawyer? Artificial Intelligence for Access to Justice (AI4AJ 2023), 2023.
- A. Team. Claude: A large language model. https://www.anthropic.com/claude, 2023. Accessed: 2023-12-19.
- Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288, 2023.