Explainable AutoML (xAutoML) with adaptive modeling for yield enhancement in semiconductor smart manufacturing (2403.12381v1)
Abstract: Enhancing yield is recognized as a paramount driver to reducing production costs in semiconductor smart manufacturing. However, optimizing and ensuring high yield rates is a highly complex and technical challenge, especially while maintaining reliable yield diagnosis and prognosis, and this shall require understanding all the confounding factors in a complex condition. This study proposes a domain-specific explainable automated machine learning technique (termed xAutoML), which autonomously self-learns the optimal models for yield prediction, with an extent of explainability, and also provides insights on key diagnosis factors. The xAutoML incorporates tailored problem-solving functionalities in an auto-optimization pipeline to address the intricacies of semiconductor yield enhancement. Firstly, to capture the key diagnosis factors, knowledge-informed feature extraction coupled with model-agnostic key feature selection is designed. Secondly, combined algorithm selection and hyperparameter tuning with adaptive loss are developed to generate optimized classifiers for better defect prediction, and adaptively evolve in response to shifting data patterns. Moreover, a suite of explainability tools is provided throughout the AutoML pipeline, enhancing user understanding and fostering trust in the automated processes. The proposed xAutoML exhibits superior performance, with domain-specific refined countermeasures, adaptive optimization capabilities, and embedded explainability. Findings exhibit that the proposed xAutoML is a compelling solution for semiconductor yield improvement, defect diagnosis, and related applications.
- Automl: A survey of the state-of-the-art, Knowledge-Based Systems 212 (2021) 106622.
- Xai:explainable artificial intelligence, Science robotics 4 (2019) eaay7120.
- Machine learning based wafer defect detection, in: Design-Process-Technology Co-optimization for Manufacturability XIII, volume 10962, SPIE, 2019, pp. 38–45.
- A data-driven approach to selection of critical process steps in the semiconductor manufacturing process considering missing and imbalanced data, Journal of Manufacturing Systems 52 (2019) 146–156.
- E. LeDell, S. Poirier, H2o automl: Scalable automatic machine learning, in: Proceedings of the AutoML Workshop at ICML, volume 2020, ICML, 2020.
- A. Płońska, P. Płoński, Mljar: State-of-the-art automated machine learning framework for tabular data, Version 0.10 3 (2021).
- A feature learning approach based on xgboost for driving assessment and risk prediction, Accident Analysis & Prevention 129 (2019) 170–179.
- Diversity techniques improve the performance of the best imbalance learning ensembles, Information Sciences 325 (2015) 98–117.
- An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics, Information sciences 250 (2013) 113–141.
- Interpretml: A unified framework for machine learning interpretability, arXiv preprint arXiv:1909.09223 (2019).
- S. J. Rigatti, Random forest, Journal of Insurance Medicine 47 (2017) 31–39.
- Efficient and robust automated machine learning, Advances in neural information processing systems 28 (2015).
- J. Bergstra, Y. Bengio, Random search for hyper-parameter optimization., Journal of machine learning research 13 (2012).
- Hyperband: A novel bandit-based approach to hyperparameter optimization, The Journal of Machine Learning Research 18 (2017) 6765–6816.
- Bohb: Robust and efficient hyperparameter optimization at scale, in: International Conference on Machine Learning, PMLR, 2018, pp. 1437–1446.
- T. Chen, C. Guestrin, Xgboost: A scalable tree boosting system, in: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016, pp. 785–794.
- Lightgbm: A highly efficient gradient boosting decision tree, Advances in neural information processing systems 30 (2017).
- Focal loss for dense object detection, in: Proceedings of the IEEE international conference on computer vision, 2017, pp. 2980–2988.
- Pyod: A python toolbox for scalable outlier detection, arXiv preprint arXiv:1901.01588 (2019).
- Automatic clustering for unsupervised risk diagnosis of vehicle driving for smart road, IEEE Transactions on Intelligent Transportation Systems 23 (2022) 17451–17465.
- An automated machine learning (automl) method of risk prediction for decision-making of autonomous vehicles, IEEE Transactions on Intelligent Transportation Systems 22 (2020) 7145–7154.
- Explainable artificial intelligence (xai): Concepts, taxonomies, opportunities and challenges toward responsible ai, Information fusion 58 (2020) 82–115.
- J. Barnes, Azure machine learning, Microsoft Azure Essentials. 1st ed, Microsoft (2015).
- Amazon sagemaker autopilot: a white box automl solution at scale, in: Proceedings of the fourth international workshop on data management for end-to-end machine learning, 2020, pp. 1–7.
- An ensembled anomaly detector for wafer fault detection, Sensors 21 (2021) 5465.
- Ped-anova: Efficiently quantifying hyperparameter importance in arbitrary subspaces, arXiv preprint arXiv:2304.10255 (2023).
- Xautoml: A visual analytics tool for understanding and validating automated machine learning, ACM Transactions on Interactive Intelligent Systems 13 (2023) 1–39.
- Autoablation: Automated parallel ablation studies for deep learning, in: Proceedings of the 1st Workshop on Machine Learning and Systems, 2021, pp. 55–61.
- Explaining hyperparameter optimization via partial dependence plots, Advances in Neural Information Processing Systems 34 (2021) 2280–2291.
- Cave: Configuration assessment, visualization and evaluation, in: Learning and Intelligent Optimization: 12th International Conference, LION 12, Kalamata, Greece, June 10–15, 2018, Revised Selected Papers 12, Springer, 2019, pp. 115–130.
- Semiconductor manufacturing automation, in: Springer Handbook of Automation, Springer, 2023, pp. 841–863.
- All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously., J. Mach. Learn. Res. 20 (2019) 1–81.
- Hyperopt: A python library for optimizing the hyperparameters of machine learning algorithms, in: Proceedings of the 12th Python in science conference, volume 13, Citeseer, 2013, p. 20.
- Gain: Missing data imputation using generative adversarial nets, in: International conference on machine learning, PMLR, 2018, pp. 5689–5698.
- Adaptive modelling for anomaly detection and defect diagnosis in semiconductor smart manufacturing: A domain-specific automl, in: 2023 IEEE International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics, Automation and Mechatronics (RAM), IEEE, 2023, pp. 198–203.
- T. Nakazawa, D. V. Kulkarni, Wafer map defect pattern classification and image retrieval using convolutional neural network, IEEE Transactions on Semiconductor Manufacturing 31 (2018) 309–314.
- Data mining for yield enhancement in semiconductor manufacturing and an empirical study, Expert Systems with Applications 33 (2007) 192–198.
- Key risk indicators for accident assessment conditioned on pre-crash vehicle trajectory, Accident Analysis & Prevention 117 (2018) 346–356.
- Convolutional neural network for wafer surface defect classification and the detection of unknown defect class, IEEE Transactions on Semiconductor Manufacturing 32 (2019) 163–170. doi:10.1109/TSM.2019.2902657.
- Explainable generative design in manufacturing for reinforcement learning based factory layout planning, Journal of Manufacturing Systems 72 (2024) 74–92. doi:https://doi.org/10.1016/j.jmsy.2023.11.012.
- Hybrid physics-based and data-driven models for smart manufacturing: Modelling, simulation, and explainability, Journal of Manufacturing Systems 63 (2022) 381–391. doi:https://doi.org/10.1016/j.jmsy.2022.04.004.
- Overview of automatic defect classification, in: Proceedings of 1994 IEEE/SEMI Advanced Semiconductor Manufacturing Conference and Workshop (ASMC), 1994, pp. 272–. doi:10.1109/ASMC.1994.588270.
- Smart manufacturing process and system automation–a critical review of the standards and envisioned scenarios, Journal of Manufacturing Systems 56 (2020) 312–325.
- Reference architectures for smart manufacturing: A critical review, Journal of manufacturing systems 49 (2018) 215–225.
- Artificial intelligence in advanced manufacturing: Current status and future outlook, Journal of Manufacturing Science and Engineering 142 (2020) 110804.
- K. A. Tasias, Integrated quality, maintenance and production model for multivariate processes: a bayesian approach, Journal of Manufacturing Systems 63 (2022) 35–51.
- Towards new-generation human-centric smart manufacturing in industry 5.0: A systematic review, Advanced Engineering Informatics 57 (2023) 102121. doi:https://doi.org/10.1016/j.aei.2023.102121.
- Explainable artificial intelligence (xai): Precepts, models, and opportunities for research in construction, Advanced Engineering Informatics 57 (2023) 102024.
- Intelligent machine agent architecture for adaptive control optimization of manufacturing processes, Advanced Engineering Informatics 25 (2011) 783–796.
- Applications of agent-based systems in intelligent manufacturing: An updated review, Advanced engineering Informatics 20 (2006) 415–431.