Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Large Language Models as Master Key: Unlocking the Secrets of Materials Science with GPT (2304.02213v5)

Published 5 Apr 2023 in cs.CL and cs.AI

Abstract: The amount of data has growing significance in exploring cutting-edge materials and a number of datasets have been generated either by hand or automated approaches. However, the materials science field struggles to effectively utilize the abundance of data, especially in applied disciplines where materials are evaluated based on device performance rather than their properties. This article presents a new NLP task called structured information inference (SII) to address the complexities of information extraction at the device level in materials science. We accomplished this task by tuning GPT-3 on an existing perovskite solar cell FAIR (Findable, Accessible, Interoperable, Reusable) dataset with 91.8% F1-score and extended the dataset with data published since its release. The produced data is formatted and normalized, enabling its direct utilization as input in subsequent data analysis. This feature empowers materials scientists to develop models by selecting high-quality review articles within their domain. Additionally, we designed experiments to predict the electrical performance of solar cells and design materials or devices with targeted parameters using LLMs. Our results demonstrate comparable performance to traditional machine learning methods without feature selection, highlighting the potential of LLMs to acquire scientific knowledge and design new materials akin to materials scientists.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Tong Xie (18 papers)
  2. Yuwei Wan (9 papers)
  3. Wei Huang (318 papers)
  4. Yufei Zhou (9 papers)
  5. Yixuan Liu (41 papers)
  6. Qingyuan Linghu (4 papers)
  7. Shaozhou Wang (5 papers)
  8. Chunyu Kit (10 papers)
  9. Clara Grazian (29 papers)
  10. Wenjie Zhang (138 papers)
  11. Bram Hoex (9 papers)
Citations (50)