Automated Review Generation Method Based on Large Language Models (2407.20906v1)
Abstract: Literature research, vital for scientific advancement, is overwhelmed by the vast ocean of available information. Addressing this, we propose an automated review generation method based on LLMs to streamline literature processing and reduce cognitive load. In case study on propane dehydrogenation (PDH) catalysts, our method swiftly generated comprehensive reviews from 343 articles, averaging seconds per article per LLM account. Extended analysis of 1041 articles provided deep insights into catalysts' composition, structure, and performance. Recognizing LLMs' hallucinations, we employed a multi-layered quality control strategy, ensuring our method's reliability and effective hallucination mitigation. Expert verification confirms the accuracy and citation integrity of generated reviews, demonstrating LLM hallucination risks reduced to below 0.5% with over 95% confidence. Released Windows application enables one-click review generation, aiding researchers in tracking advancements and recommending literature. This approach showcases LLMs' role in enhancing scientific research productivity and sets the stage for further exploration.
- Lawrence S. Free online availability substantially increases a paper’s impact. Nature 411, 521 (2001).
- Lok C. Speed reading: scientists are struggling to make sense of the expanding scientific literature. Corie Lok asks whether computational tools can do the hard work for them. Nature 463, 416-419 (2010).
- Batra SR. Emerging materials intelligence ecosystems propelled by machine learning. Nature Reviews Materials, (2020).
- Boschen I. Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports. Sci Rep 11, 19525 (2021).
- White AD. The future of chemistry is language. Nature Reviews Chemistry 7, 457-458 (2023).
- Sanderson K. GPT-4 is here: what scientists think. Nature 615, 773-773 (2023).