Papers
Topics
Authors
Recent
Search
2000 character limit reached

Learning to extrapolate using continued fractions: Predicting the critical temperature of superconductor materials

Published 27 Nov 2020 in cs.LG, cond-mat.supr-con, cs.AI, and cs.NE | (2012.03774v3)

Abstract: In the field of AI and Machine Learning (ML), the approximation of unknown target functions $y=f(\mathbf{x})$ using limited instances $S={(\mathbf{x{(i)}},y{(i)})}$, where $\mathbf{x{(i)}} \in D$ and $D$ represents the domain of interest, is a common objective. We refer to $S$ as the training set and aim to identify a low-complexity mathematical model that can effectively approximate this target function for new instances $\mathbf{x}$. Consequently, the model's generalization ability is evaluated on a separate set $T={\mathbf{x{(j)}}} \subset D$, where $T \neq S$, frequently with $T \cap S = \emptyset$, to assess its performance beyond the training set. However, certain applications require accurate approximation not only within the original domain $D$ but also in an extended domain $D'$ that encompasses $D$. This becomes particularly relevant in scenarios involving the design of new structures, where minimizing errors in approximations is crucial. For example, when developing new materials through data-driven approaches, the AI/ML system can provide valuable insights to guide the design process by serving as a surrogate function. Consequently, the learned model can be employed to facilitate the design of new laboratory experiments. In this paper, we propose a method for multivariate regression based on iterative fitting of a continued fraction, incorporating additive spline models. We compare the performance of our method with established techniques, including AdaBoost, Kernel Ridge, Linear Regression, Lasso Lars, Linear Support Vector Regression, Multi-Layer Perceptrons, Random Forests, Stochastic Gradient Descent, and XGBoost. To evaluate these methods, we focus on an important problem in the field: predicting the critical temperature of superconductors based on physical-chemical characteristics.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. Kam Hamidieh. A data-driven statistical model for predicting the critical temperature of a superconductor. Comput. Mater. Sci., 154:346–354, 2018. ISSN 0927-0256. doi:https://doi.org/10.1016/j.commatsci.2018.07.052.
  2. A memetic algorithm for symbolic regression. In IEEE Congress on Evolutionary Computation, CEC 2019, Wellington, New Zealand, June 10-13, 2019, pages 2167–2174. IEEE, 2019. doi:10.1109/CEC.2019.8789889.
  3. Analytic continued fractions for regression: Results on 352 datasets from the physical sciences. In IEEE Congress on Evolutionary Computation, CEC 2020, Glasgow, United Kingdom, July 19-24, 2020, pages 1–8. IEEE, 2020. doi:10.1109/CEC48606.2020.9185564.
  4. Analytic continued fractions for regression: A memetic algorithm approach. Expert Systems with Applications, 179:115018, 2021. ISSN 0957-4174. doi:https://doi.org/10.1016/j.eswa.2021.115018.
  5. Memetic algorithms. In Teofilo F. Gonzalez, editor, Handbook of Approximation Algorithms and Metaheuristics. Chapman and Hall/CRC, 2007. doi:10.1201/9781420010749.ch27. URL https://doi.org/10.1201/9781420010749.ch27.
  6. Pablo Moscato. Memetic algorithms: The untold story. In Ferrante Neri, Carlos Cotta, and Pablo Moscato, editors, Handbook of Memetic Algorithms, volume 379 of Studies in Computational Intelligence, pages 275–309. Springer, 2012. doi:10.1007/978-3-642-23247-3_17. URL https://doi.org/10.1007/978-3-642-23247-3_17.
  7. Memetic algorithms. In Rafael Martí, Panos M. Pardalos, and Mauricio G. C. Resende, editors, Handbook of Heuristics, pages 607–638. Springer, 2018. doi:10.1007/978-3-319-07124-4_29.
  8. An accelerated introduction to memetic algorithms. In Michel Gendreau and Jean-Yves Potvin, editors, Handbook of Metaheuristics, pages 275–309. Springer International Publishing, Cham, 2019. ISBN 978-3-319-91086-4. doi:10.1007/978-3-319-91086-4_9.
  9. Memetic algorithms for business analytics and data science: A brief survey. In Pablo Moscato and Natalie Jane de Vries, editors, Business and Consumer Analytics: New Ideas, pages 545–608. Springer, 2019. doi:10.1007/978-3-030-06222-4_13.
  10. Continued fractions and the Thomson problem. Scientific Reports, 13(1):7272, 2023. ISSN 2045-2322. doi:10.1038/s41598-023-33744-5. URL https://doi.org/10.1038/s41598-023-33744-5.
  11. Data-driven discovery of formulas by symbolic regression. Materials Research Society Bulletin, 44(7):559–564, 2019. doi:10.1557/mrs.2019.156.
  12. Franky Backeljauw and Annie A. M. Cuyt. Algorithm 895: A continued fractions package for special functions. ACM Trans. Math. Softw., 36(3):15:1–15:20, 2009.
  13. Carl de Boor. A practical guide to splines, volume 27. Springer-Verlag New York, 1978.
  14. The Elements of Statistical Learning, pages 139–190. Springer, 2 edition, 2009.
  15. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, pages 785–794, New York, NY, USA, 2016. ACM. ISBN 978-1-4503-4232-2. doi:10.1145/2939672.2939785.
  16. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res., 12:2825–2830, 2011.
  17. pygam: Generalized additive models in python, 3 2018. URL https://doi.org/10.5281/zenodo.1208723.
  18. Milton Friedman. The use of ranks to avoid the assumption of normality implicit in the analysis of variance. Journal of the American Statistical Association, 32(200):675–701, 1937.
  19. Janez Demšar. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res., 7(1):1–30, 2006.
  20. Orange: Data mining toolbox in python. J. Mach. Learn. Res., 14(1):2349–2353, 1 2013. ISSN 1532-4435.
  21. Jacob Cohen. A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20(1):37–46, 1960.
  22. Jerome H Friedman. Multivariate adaptive regression splines. The Annals of Statistics, 19(1):1–67, 1991.
Citations (7)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.