Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 70 tok/s

Gemini 2.5 Pro 48 tok/s Pro

GPT-5 Medium 27 tok/s Pro

GPT-5 High 24 tok/s Pro

GPT-4o 75 tok/s Pro

Kimi K2 175 tok/s Pro

GPT OSS 120B 447 tok/s Pro

Claude Sonnet 4 36 tok/s Pro

2000 character limit reached

Benchmarking Materials Property Prediction Methods: The Matbench Test Set and Automatminer Reference Algorithm (2005.00707v2)

Published 2 May 2020 in cond-mat.mtrl-sci and physics.comp-ph

Abstract: We present a benchmark test suite and an automated machine learning procedure for evaluating supervised ML models for predicting properties of inorganic bulk materials. The test suite, Matbench, is a set of 13 ML tasks that range in size from 312 to 132k samples and contain data from 10 density functional theory-derived and experimental sources. Tasks include predicting optical, thermal, electronic, thermodynamic, tensile, and elastic properties given a materials composition and/or crystal structure. The reference algorithm, Automatminer, is a highly-extensible, fully-automated ML pipeline for predicting materials properties from materials primitives (such as composition and crystal structure) without user intervention or hyperparameter tuning. We test Automatminer on the Matbench test suite and compare its predictive power with state-of-the-art crystal graph neural networks and a traditional descriptor-based Random Forest model. We find Automatminer achieves the best performance on 8 of 13 tasks in the benchmark. We also show our test suite is capable of exposing predictive advantages of each algorithm - namely, that crystal graph methods appear to outperform traditional machine learning methods given ~10⁴ or greater data points. The pre-processed, ready-to-use Matbench tasks and the Automatminer source code are open source and available online (http://hackingmaterials.lbl.gov/automatminer/). We encourage evaluating new materials ML algorithms on the MatBench benchmark and comparing them against the latest version of Automatminer.

Citations (241)

View on Semantic Scholar

Summary

The paper introduces Matbench, a diverse benchmark suite spanning 13 materials property tasks for standardized model evaluation.
It proposes Automatminer, an automated ML pipeline that achieves superior performance on 8 out of 13 tasks compared to state-of-the-art methods.
Empirical results highlight the predictive strength of graph neural networks on larger datasets, paving the way for advances in materials informatics.

Benchmarking Materials Property Prediction with Matbench and Automatminer

The paper "Benchmarking Materials Property Prediction: The Matbench Test Set and Automatminer Reference Algorithm" introduces a comprehensive benchmarking framework for evaluating the predictive performance of supervised ML models in materials science. The paper presents Matbench, a benchmark test suite comprising 13 distinct tasks focused on predicting properties of inorganic bulk materials, such as optical, thermal, electronic, thermodynamic, tensile, and elastic properties. Additionally, Automatminer, an automated ML pipeline, is proposed as a reference algorithm to facilitate the prediction of material properties with minimal user intervention.

Core Contributions

The authors identify two primary components required for advancing the field of materials informatics: a robust benchmark test suite and an automatic reference model. Here, Matbench serves as a diversified collection of curated datasets spanning a range of sample sizes and sources, including density functional theory and experimental data. Automatminer, the proposed reference algorithm, offers a fully automated ML pipeline capable of handling various materials prediction tasks without necessitating domain-specific expertise.

Key Findings

The empirical results underscore the efficacy of Automatminer as it demonstrates superior performance on 8 out of the 13 tasks when benchmarked against state-of-the-art models, such as crystal graph neural networks (CGCNN) and Random Forest (RF). These models were evaluated using five-fold Nested Cross Validation (NCV), ensuring robust error estimation and minimizing biases in model comparison. The paper also sheds light on the substantial predictive advantage exhibited by crystal graph methods when larger datasets are used.

Crucially, the Automatminer Express preset showed that an automated ML pipeline could achieve results comparable to those obtained through labor-intensive hand-optimized models. The performance of different algorithms was also examined relative to dataset size, revealing trends where graph neural networks predominantly excel with increasing dataset sizes.

Implications for Future Research

The introduction of Matbench and Automatminer addresses critical challenges in the materials informatics domain, such as the need for standardized benchmarking protocols and automated model development pipelines. The extensible design of Automatminer paves the way for future enhancements, including incorporation of more advanced featurization techniques and expanding the model space to include complex algorithms like deep neural networks and support vector machine kernels.

On a broader scale, the work demonstrates how standardized benchmarks can drive innovation by providing researchers with reliable platforms for model evaluation and comparison. This approach is instrumental in catalyzing developments across various subfields within materials science, including defect calculations and high-throughput experimental techniques.

Limitations and Outlook

While Automatminer and Matbench effectively serve as a baseline for materials property prediction tasks, the paper acknowledges the limitations inherent in the current framework. For instance, the reliance on NCV for error estimation may not optimally reflect real-world applications where domain-specific validation approaches could yield more accurate predictions. Additionally, Matbench is positioned as a versioned resource that will evolve, indicating that the benchmarking framework will progressively incorporate emerging datasets and tasks to remain relevant.

In conclusion, the paper presents a well-structured and detailed benchmark suite and toolset that arguably contribute to the systematic advancement of machine learning in materials science. Researchers are encouraged to utilize Matbench and Automatminer to evaluate new predictive algorithms and foster collaborative advancements in material informatics. The release of these open-source tools marks a significant step towards accelerating the discovery and development of novel materials.