Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 88 tok/s
Gemini 2.5 Pro 35 tok/s Pro
GPT-5 Medium 35 tok/s
GPT-5 High 28 tok/s Pro
GPT-4o 93 tok/s
GPT OSS 120B 474 tok/s Pro
Kimi K2 197 tok/s Pro
2000 character limit reached

Explainable data-driven modeling via mixture of experts: towards effective blending of grey and black-box models (2401.17118v1)

Published 30 Jan 2024 in cs.LG, cs.SY, and eess.SY

Abstract: Traditional models grounded in first principles often struggle with accuracy as the system's complexity increases. Conversely, machine learning approaches, while powerful, face challenges in interpretability and in handling physical constraints. Efforts to combine these models often often stumble upon difficulties in finding a balance between accuracy and complexity. To address these issues, we propose a comprehensive framework based on a "mixture of experts" rationale. This approach enables the data-based fusion of diverse local models, leveraging the full potential of first-principle-based priors. Our solution allows independent training of experts, drawing on techniques from both machine learning and system identification, and it supports both collaborative and competitive learning paradigms. To enhance interpretability, we penalize abrupt variations in the expert's combination. Experimental results validate the effectiveness of our approach in producing an interpretable combination of models closely resembling the target phenomena.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (27)
  1. R. Babuška and H. Verbruggen. Neuro-fuzzy methods for nonlinear system identification. Annual reviews in control, 27(1):73–85, 2003.
  2. A new approach to nonlinear modelling of dynamic systems based on fuzzy rules. International Journal of Applied Mathematics and Computer Science, 26(3):603–621, 2016.
  3. A. Beck and L. Tetruashvili. On the convergence of block coordinate descent type methods. SIAM Journal on Optimization, 23(4):2037–2060, 2013.
  4. A. Bemporad. A piecewise linear regression and classification algorithm with application to learning and model predictive control of hybrid systems. IEEE Transactions on Automatic Control, 68(6):3194–3209, 2023.
  5. Fitting jump models. Automatica, 96:11–21, 2018.
  6. H. Bersini and G. Bontempi. Now comes the time to defuzzify neuro-fuzzy models. Fuzzy sets and systems, 90(2):161–169, 1997.
  7. T. Bikmukhametov and J. Jäschke. Combining machine learning and process engineering physics towards enhanced accuracy and explainability of data-driven models. Computers & Chemical Engineering, 138:1–27, 2020.
  8. R. Bischof and M. A. Kraus. Mixture-of-experts-ensemble meta-learning for physics-informed neural networks. In Proceedings of 33. Forum Bauinformatik, 2022.
  9. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn., 3(1):1–122, 2011.
  10. S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press, 2004.
  11. L. Breiman. Bagging predictors. Machine learning, 24:123–140, 1996.
  12. Vehicle sideslip estimation via kernel-based lpv identification: Theory and experiments. Automatica, 122:109237, 2020.
  13. Scientific machine learning through physics–informed neural networks: where we are and what’s next. Journal of Scientific Computing, 92(3):1–62, 2022.
  14. Techniques for interpretable machine learning. Communications of the ACM, 63(1):68–77, 2019. doi: 10.1145/3359786.
  15. C. Fantuzzi and R. Rovatti. On the approximation capabilities of the homogeneous takagi-sugeno model. In Proceedings of IEEE 5th International Fuzzy Systems, volume 2, pages 1067–1072, 1996.
  16. Experiments with a new boosting algorithm. In icml, volume 96, pages 148–156. Citeseer, 1996.
  17. Research challenges in modeling and simulation for engineering complex systems. Springer, 2017.
  18. D. Gabay. Chapter ix applications of the method of multipliers to variational inequalities. In M. Fortin and R. Glowinski, editors, Augmented Lagrangian Methods: Applications to the Numerical Solution of Boundary-Value Problems, volume 15 of Studies in Mathematics and Its Applications, pages 299–331. Elsevier, 1983.
  19. F. M. Gray and M. Schmidt. A hybrid approach to thermal building modelling using a combination of gaussian processes and grey-box models. Energy and Buildings, 165:56–63, 2018.
  20. T. Johansen and R. Babuska. Multiobjective identification of takagi-sugeno fuzzy models. IEEE Transactions on Fuzzy Systems, 11(6):847–860, 2003.
  21. Hierarchical mixtures of experts and the em algorithm. Neural computation, 6(2):181–214, 1994.
  22. Physics-informed machine learning. Nature Reviews Physics, 3(6):422–440, 2021.
  23. C. Molnar. Interpretable machine learning. Springer International Publishing, 2020.
  24. A grey-box ensemble model exploiting black-box accuracy and white-box intrinsic interpretability. Algorithms, 13(1):17, 2020.
  25. Q. Xiong and A. Jutan. Grey-box modelling and control of chemical processes. Chemical Engineering Science, 57(6):1027–1039, 2002.
  26. Twenty years of mixture of experts. IEEE Transactions of Neural Networks Learning Systems, 23(8):1177–1193, 2012.
  27. L. Zadeh. Zadeh, fuzzy sets. Fuzzy sets, fuzzy logic, and fuzzy systems, pages 19–34, 1965.
Citations (1)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube