Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

From Text to Insight: Large Language Models for Materials Science Data Extraction (2407.16867v2)

Published 23 Jul 2024 in cond-mat.mtrl-sci and cs.LG

Abstract: The vast majority of materials science knowledge exists in unstructured natural language, yet structured data is crucial for innovative and systematic materials design. Traditionally, the field has relied on manual curation and partial automation for data extraction for specific use cases. The advent of LLMs represents a significant shift, potentially enabling efficient extraction of structured, actionable data from unstructured text by non-experts. While applying LLMs to materials science data extraction presents unique challenges, domain knowledge offers opportunities to guide and validate LLM outputs. This review provides a comprehensive overview of LLM-based structured data extraction in materials science, synthesizing current knowledge and outlining future directions. We address the lack of standardized guidelines and present frameworks for leveraging the synergy between LLMs and materials science expertise. This work serves as a foundational resource for researchers aiming to harness LLMs for data-driven materials research. The insights presented here could significantly enhance how researchers across disciplines access and utilize scientific information, potentially accelerating the development of novel materials for critical societal needs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Mara Schilling-Wilhelmi (5 papers)
  2. Martiño Ríos-García (5 papers)
  3. Sherjeel Shabih (1 paper)
  4. María Victoria Gil (3 papers)
  5. Santiago Miret (36 papers)
  6. Christoph T. Koch (18 papers)
  7. José A. Márquez (4 papers)
  8. Kevin Maik Jablonka (11 papers)
Citations (6)
Youtube Logo Streamline Icon: https://streamlinehq.com