Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-View Symbolic Regression (2402.04298v4)

Published 6 Feb 2024 in cs.LG, astro-ph.IM, and stat.AP

Abstract: Symbolic regression (SR) searches for analytical expressions representing the relationship between a set of explanatory and response variables. Current SR methods assume a single dataset extracted from a single experiment. Nevertheless, frequently, the researcher is confronted with multiple sets of results obtained from experiments conducted with different setups. Traditional SR methods may fail to find the underlying expression since the parameters of each experiment can be different. In this work we present Multi-View Symbolic Regression (MvSR), which takes into account multiple datasets simultaneously, mimicking experimental environments, and outputs a general parametric solution. This approach fits the evaluated expression to each independent dataset and returns a parametric family of functions f(x; theta) simultaneously capable of accurately fitting all datasets. We demonstrate the effectiveness of MvSR using data generated from known expressions, as well as real-world data from astronomy, chemistry and economy, for which an a priori analytical expression is not available. Results show that MvSR obtains the correct expression more frequently and is robust to hyperparameters change. In real-world data, it is able to grasp the group behavior, recovering known expressions from the literature as well as promising alternatives, thus enabling the use of SR to a large range of experimental scenarios.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. 1967. A simplified synthesis for meso-tetraphenylporphine. Journal of organic chemistry. 32, 2 (1967).
  2. Louis Bachelier. 1900. Théorie de la spéculation. In Annales scientifiques de l’École normale supérieure, Vol. 17. 21–86.
  3. The core-collapse rate from the Supernova Legacy Survey. Astronomy and Astrophysics 499, 3 (June 2009), 653–660. https://doi.org/10.1051/0004-6361/200911847 arXiv:0904.1066 [astro-ph.CO]
  4. The Zwicky Transient Facility: System Overview, Performance, and First Results. Publications of the Astronomical Society of the Pacific 131, 995 (Jan. 2019), 018002. https://doi.org/10.1088/1538-3873/aaecbe arXiv:1902.01932 [astro-ph.IM]
  5. Fischer Black and Myron Scholes. 1973. The Pricing of Options and Corporate Liabilities. Journal of Political Economy 81, 3 (May 1973), 637–654. https://doi.org/10.1086/260062
  6. Jean-Philippe Bouchaud and Marc Potters. 2000. Theory of financial risks. Vol. 12. Cambridge University Press, Cambridge From Statistical Physics to Risk ….
  7. Abdürrezzak E. Bozdoğan. 2022. Polynomial Equations based on Bouguer–Lambert and Beer Laws for Deviations from Linearity and Absorption Flattening. Journal of Analytical Chemistry 77, 11 (01 Nov 2022), 1426–1432. https://doi.org/10.1134/S1061934822110028
  8. Operon C++: An Efficient Genetic Programming Framework for Symbolic Regression. In Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion (GECCO ’20). Association for Computing Machinery, internet, 1562–1570. https://doi.org/doi:10.1145/3377929.3398099
  9. Discovering symbolic models from deep learning with inductive biases. Advances in Neural Information Processing Systems 33 (2020), 17429–17442.
  10. Photometric classification and redshift estimation of LSST Supernovae. Monthly Notices of the Royal Astronomical Society 477, 3 (July 2018), 4142–4151. https://doi.org/10.1093/mnras/sty965 arXiv:1701.05689 [astro-ph.CO]
  11. Understanding conflict origin and dynamics on Twitter: A real-time detection system. Expert Systems with Applications 212 (2023), 118748.
  12. scikit-hep/iminuit. (Dec 2020). https://doi.org/10.5281/zenodo.3949207
  13. Gaia Data Release 3. Summary of the variability processing and analysis. Astronomy and Astrophysics 674, Article A13 (June 2023), A13 pages. https://doi.org/10.1051/0004-6361/202244242 arXiv:2206.06416 [astro-ph.SR]
  14. Jerome H. Friedman. 1991. Multivariate Adaptive Regression Splines. The Annals of Statistics 19, 1 (1991), 1 – 67. https://doi.org/10.1214/aos/1176347963
  15. Fast, accurate, and transferable many-body interatomic potentials by symbolic regression. npj Computational Materials 5, 1 (18 Nov 2019), 112. https://doi.org/10.1038/s41524-019-0249-1
  16. Parameter identification for symbolic regression using nonlinear least squares. Genetic Programming and Evolvable Machines 21, 3 (2020), 471–501.
  17. S. G. Kou. 2002. A Jump-Diffusion Model for Option Pricing. Management Science 48, 8 (Aug. 2002), 1086–1101. https://doi.org/10.1287/mnsc.48.8.1086.166
  18. Contemporary Symbolic Regression Methods and their Relative Performance. (07 2021).
  19. A Flexible Symbolic Regression Method for Constructing Interpretable Clinical Prediction Models. npj Digital Medicine 6, 1 (June 2023), 1–14. https://doi.org/10.1038/s41746-023-00833-8
  20. An intermediate distribution between Gaussian and Cauchy distributions. Physica A: Statistical Mechanics and its Applications 391, 22 (Nov. 2012), 5411–5421. https://doi.org/10.1016/j.physa.2012.06.035
  21. The Variance Gamma Process and Option Pricing. Review of Finance 2, 1 (April 1998), 79–105. https://doi.org/10.1023/a:1009703431535
  22. The SNAD Viewer: Everything You Want to Know about Your Favorite ZTF Object. Publications of the Astronomical Society of the Pacific 135, 1044, Article 024503 (Feb. 2023), 024503 pages. https://doi.org/10.1088/1538-3873/acb292 arXiv:2211.07605 [astro-ph.IM]
  23. Rosario N. Mantegna and H. Eugene Stanley. 1995. Scaling behaviour in the dynamics of an economic index. Nature 376, 6535 (July 1995), 46–49. https://doi.org/10.1038/376046a0
  24. Rosario N Mantegna and H Eugene Stanley. 1999. Introduction to econophysics: correlations and complexity in finance. Cambridge university press.
  25. SymPy: symbolic computing in Python. PeerJ Computer Science 3 (Jan. 2017), e103. https://doi.org/10.7717/peerj-cs.103
  26. Rainbow: a colorful approach on multi-passband light curve estimation. arXiv e-prints, Article arXiv:2310.02916 (Oct. 2023), arXiv:2310.02916 pages. https://doi.org/10.48550/arXiv.2310.02916 arXiv:2310.02916 [astro-ph.IM]
  27. Alert Classification for the ALeRCE Broker System: The Light Curve Classifier. The Astronomical Journal 161, 3, Article 141 (March 2021), 141 pages. https://doi.org/10.3847/1538-3881/abd5c1 arXiv:2008.03311 [astro-ph.IM]
  28. A tailor designed fluorescent ‘turn-on’ sensor of formaldehyde based on the BODIPY motif. Tetrahedron Letters 53, 37 (2012), 4913–4916. https://doi.org/10.1016/j.tetlet.2012.06.117
  29. Carboxylate BODIPY integrated in MOF-5: easy preparation and solid-state luminescence. J. Mater. Chem. C 11 (2023), 14896–14905. Issue 42. https://doi.org/10.1039/D3TC02581K
  30. Supernova Photometric Classification Pipelines Trained on Spectroscopically Classified Supernovae from the Pan-STARRS1 Medium-deep Survey. The Astrophysical Journal 884, 1 (oct 2019), 83. https://doi.org/10.3847/1538-4357/ab418c
  31. The International Variable Star Index (VSX). Society for Astronomical Sciences Annual Symposium 25 (May 2006), 47.
  32. A novel model extended from the Bouguer-Lambert-Beer law can describe the non-linear absorbance of potassium dichromate solutions and microalgae suspensions. Frontiers in Bioengineering and Biotechnology 11 (2023). https://doi.org/10.3389/fbioe.2023.1116735
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com