Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process (2402.04146v4)
Abstract: With the advent of artificial intelligence and machine learning, various domains of science and engineering communities have leveraged data-driven surrogates to model complex systems through fusing numerous sources of information (data) from published papers, patents, open repositories, or other resources. However, not much attention has been paid to the differences in quality and comprehensiveness of the known and unknown underlying physical parameters of the information sources, which could have downstream implications during system optimization. Additionally, existing methods cannot fuse multi-source data into a single predictive model. Towards resolving this issue, a multi-source data fusion framework based on Latent Variable Gaussian Process (LVGP) is proposed. The individual data sources are tagged as a characteristic categorical variable that are mapped into a physically interpretable latent space, allowing the development of source-aware data fusion modeling. Additionally, a dissimilarity metric based on the latent variables of LVGP is introduced to study and understand the differences in the sources of data. The proposed approach is demonstrated on and analyzed through two mathematical and two materials science case studies. From the case studies, it is observed that compared to using single-source and source unaware machine learning models, the proposed multi-source data fusion framework can provide better predictions for sparse-data problems.
- “Applications of artificial intelligence in engineering and manufacturing: A systematic review” In Journal of Intelligent Manufacturing 33.6 Springer, 2022, pp. 1581–1601
- “A survey on machine learning for data fusion” In Information Fusion 57 Elsevier, 2020, pp. 115–129
- “Data fusion” In ACM computing surveys (CSUR) 41.1 ACM New York, NY, USA, 2009, pp. 1–41
- Jingren Zhou, Xin Hong and Peiquan Jin “Information fusion for multi-source material data: Progress and challenges” In Applied Sciences 9.17 MDPI, 2019, pp. 3473
- “On Uncertainty Quantification in Materials Modeling and Discovery: Applications of GE’s BHM and IDACE” In AIAA SCITECH 2023 Forum
- S.K. Ravi, P. Dong and Z. Wei “Data-driven modeling of multiaxial fatigue in frequency domain” In Marine Structures 84 Elsevier, 2022, pp. 103201
- “Elucidating precipitation in FeCrAl alloys through explainable AI: A case study” In Computational Materials Science 230 Elsevier, 2023, pp. 112440
- “Multifidelity information fusion with machine learning: A case study of dopant formation energies in hafnia” In ACS applied materials & interfaces 11.28 ACS Publications, 2019, pp. 24906–24918
- Ghanshyam Pilania, James E Gubernatis and Turab Lookman “Multi-fidelity machine learning models for accurate bandgap predictions of solids” In Computational Materials Science 129 Elsevier, 2017, pp. 156–163
- “Data-driven materials science: status, challenges, and perspectives” In Advanced Science 6.21 Wiley Online Library, 2019, pp. 1900808
- “The materials data facility: data services to advance materials science research” In Jom 68.8 Springer, 2016, pp. 2045–2052
- “The materials commons: a collaboration platform and information repository for the global materials community” In Jom 68 Springer, 2016, pp. 2035–2044
- “A repository for the publication and sharing of heterogeneous materials data” In Scientific Data 9.1 Nature Publishing Group UK London, 2022, pp. 787
- “The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies” In npj Computational Materials 1.1 Nature Publishing Group, 2015, pp. 1–15
- “Commentary: The Materials Project: A materials genome approach to accelerating materials innovation” In APL materials 1.1 AIP Publishing, 2013
- ““Everyone wants to do the model work, not the data work”: Data Cascades in High-Stakes AI” In proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, 2021, pp. 1–15
- “Advances, challenges and opportunities in creating data for trustworthy AI” In Nature Machine Intelligence 4.8 Nature Publishing Group UK London, 2022, pp. 669–677
- “A latent variable approach to Gaussian process modeling with qualitative and quantitative factors” In Technometrics 62.3 Taylor & Francis, 2020, pp. 291–302
- “Data centric nanocomposites design via mixed-variable Bayesian optimization” In Molecular Systems Design & Engineering 5.8 Royal Society of Chemistry, 2020, pp. 1376–1390
- “Featureless adaptive optimization accelerates functional electronic materials design” In Applied Physics Reviews 7.4 AIP Publishing, 2020
- Yichi Zhang, Daniel W Apley and Wei Chen “Bayesian optimization for materials design with mixed quantitative and qualitative variables” In Scientific reports 10.1 Nature Publishing Group UK London, 2020, pp. 4924
- “A Latent Variable Approach for Non-Hierarchical Multi-Fidelity Adaptive Sampling” In Computer Methods in Applied Mechanics and Engineering 421, 2024, pp. 116773 DOI: https://doi.org/10.1016/j.cma.2024.116773
- Yigitcan Comlek, Liwei Wang and Wei Chen “Mixed-Variable Global Sensitivity Analysis for Knowledge Discovery And Efficient Combinatorial Materials Design (IDETC2023-110756)” In Journal of Mechanical Design, 2023, pp. 1–31
- Christopher K Williams and Carl Edward Rasmussen “Gaussian processes for machine learning” In the MIT Press 2.3, 2006, pp. 4
- David Ackley “A connectionist machine for genetic hillclimbing” Springer science & business media, 2012
- “Effect of aluminum on the FeCr (Al) alloy oxidation resistance in steam environment at low temperature (400 C) and high temperature (1200 C)” In Corrosion Science 209 Elsevier, 2022, pp. 110765
- “Mapping of 475 C embrittlement in ferritic Fe–Cr–Al alloys” In Scripta Materialia 63.11 Elsevier, 2010, pp. 1104–1107
- Kevin G Field, Kenneth C Littrell and Samuel A Briggs “Precipitation of α′superscript𝛼′\alpha^{\prime}italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT in neutron irradiated commercial FeCrAl alloys” In Scripta Materialia 142 Elsevier, 2018, pp. 41–45
- “Phase separation in PM 2000™ Fe-base ODS alloy: Experimental study at the atomic level” In Materials Science and Engineering: A 490.1-2 Elsevier, 2008, pp. 277–288
- C Capdevila, Michael K Miller and Kaye F Russell “Aluminum partitioning during phase separation in Fe–20% Cr–6% Al ODS alloy” In Journal of materials science 43 Springer, 2008, pp. 3889–3893
- “The effect of Al on the 475 C embrittlement of Fe–Cr alloys” In Computational materials science 74 Elsevier, 2013, pp. 101–106
- “Effect of Cr/Al contents on the 475ºC age-hardening in oxide dispersion strengthened ferritic steels” In Nuclear Materials and Energy 9 Elsevier, 2016, pp. 610–615
- “Effects of Al on Alpha Prime Formation in FeCrAl Alloys” In Proceedings of the TopFuel, 2021
- “Understanding oxidation of Fe-Cr-Al alloys through explainable artificial intelligence” In MRS communications Springer, 2023, pp. 1–7
- “Data-driven predictive modeling of FeCrAl oxidation” In Materials Letters: X Elsevier, 2023, pp. 100183
- “Optimizing chemistry for designing oxidation resistant FeCrAl alloys” In MRS Advances 8.1 Springer, 2023, pp. 21–26
- “400 C aging embrittlement of FeCrAl alloys: Microstructure and fracture behavior” In Materials Science and Engineering: A 743 Elsevier, 2019, pp. 159–167
- “Microstructural stability of Fe–Cr–Al alloys at 450–550 C” In Journal of Nuclear Materials 457 Elsevier, 2015, pp. 291–297
- “Aluminum suppression of α′superscript𝛼′\alpha^{\prime}italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT precipitate in model Fe–Cr–Al alloys during long-term aging at 475 C” In Materials Science and Engineering: A 772 Elsevier, 2020, pp. 138714
- Carlos Capdevila, Michael K Miller and Jesús Chao “Phase separation kinetics in a Fe–Cr–Al alloy” In Acta materialia 60.12 Elsevier, 2012, pp. 4673–4684
- “Sensitivity of thermo-electric power measurements to α𝛼\alphaitalic_α–α′superscript𝛼′\alpha^{\prime}italic_α start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT phase separation in Cr-rich oxide dispersion strengthened steels” In Journal of Materials Science 50 Springer, 2015, pp. 4629–4635
- “Current progress and future challenges in rare-earth-free permanent magnets” In Acta Materialia 158 Elsevier, 2018, pp. 118–137
- Hideyuki Yoshikawa Teruhiko Fujiwara “Rare earth-cobalt permanent magnet” Tokin Coorporation, 2015 URL: https://patents.google.com/patent/US20150262740A1/en?oq=U.S.+Patent+Application+No.+14%5C%2f643%5C%2c875.
- “Permanent-magnet properties of Sm-Ce-Co-Fe-Cu alloys with compositions between 1-5 and 2-17” In IEEE Transactions on Magnetics 10.2 IEEE, 1974, pp. 313–317