Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 174 tok/s
Gemini 2.5 Pro 51 tok/s Pro
GPT-5 Medium 38 tok/s Pro
GPT-5 High 34 tok/s Pro
GPT-4o 91 tok/s Pro
Kimi K2 205 tok/s Pro
GPT OSS 120B 438 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

Evolution-based Feature Selection for Predicting Dissolved Oxygen Concentrations in Lakes (2403.18923v2)

Published 15 Feb 2024 in cs.NE, cs.AI, and cs.LG

Abstract: Accurate prediction of dissolved oxygen (DO) concentrations in lakes requires a comprehensive study of phenological patterns across ecosystems, highlighting the need for precise selection of interactions amongst external factors and internal physical-chemical-biological variables. This paper presents the Multi-population Cognitive Evolutionary Search (MCES), a novel evolutionary algorithm for complex feature interaction selection problems. MCES allows models within every population to evolve adaptively, selecting relevant feature interactions for different lake types and tasks. Evaluated on diverse lakes in the Midwestern USA, MCES not only consistently produces accurate predictions with few observed labels but also, through gene maps of models, reveals sophisticated phenological patterns of different lake types, embodying the innovative concept of "AI from nature, for nature".

Definition Search Book Streamline Icon: https://streamlinehq.com
References (66)
  1. Overcoming equifinality: Leveraging long time series for stream metabolism estimation. Journal of Geophysical Research: Biogeosciences 123, 2 (2018), 624–645.
  2. Thomas Back. 1996. Evolutionary algorithms in theory and practice: evolution strategies, evolutionary programming, genetic algorithms. Oxford University Press.
  3. Keith Beven. 2006. A manifesto for the equifinality thesis. Journal of hydrology 320, 1-2 (2006), 18–36.
  4. Edward Asahel Birge. 1906. Gases dissolved in the waters of Wisconsin lakes. Transactions of the American Fisheries Society 35, 1 (1906), 143–163.
  5. GOTM, a general ocean turbulence model: theory, implementation and test cases. Space Applications Institute.
  6. Shih-Kang Chao and Guang Cheng. 2019. A generalization of regularized dual averaging and its dynamics. arXiv preprint arXiv:1909.10072 (2019).
  7. Depth-integrated, continuous estimates of metabolism in a clear-water lake. Canadian Journal of Fisheries and Aquatic Sciences 65, 4 (2008), 712–722.
  8. Learning quality characteristics for plastic injection molding processes using a combination of simulated and measured data. Journal of Manufacturing Processes 60 (2020), 134–143.
  9. Crop yield estimation using multi-source satellite image series and deep learning. In IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium. IEEE, 5163–5166.
  10. Robust inverse framework using knowledge-guided self-supervised learning: An application to hydrology. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 465–474.
  11. Delving deeper: Metabolic processes in the metalimnion of stratified lakes. Limnology and Oceanography 62, 3 (2017), 1288–1306.
  12. Application of k-epsilon turbulence models to enclosed basins: the role of internal seiches. Journal of Geophysical Research: Oceans 107, C12 (2002), 23–1.
  13. David P Hamilton and S Geoffrey Schladow. 1997. Prediction of water quality in lakes and reservoirs. Part I—Model description. Ecological modelling 96, 1-3 (1997), 91–110.
  14. Herim Han and Sunghwan Choi. 2021. Transfer Learning from Simulation to Experimental Data: NMR Chemical Shift Predictions. The Journal of Physical Chemistry Letters 12, 14 (2021), 3662–3668.
  15. Predicting lake surface water phosphorus dynamics using process-guided machine learning. 430 ([n. d.]), 109136.
  16. Jun He and Xin Yao. 2002. From an individual to a population: An analysis of the first hitting time of population-based evolutionary algorithms. IEEE Transactions on Evolutionary Computation 6, 5 (2002), 495–511.
  17. A General Lake Model (GLM 3.0) for linking with high-frequency sensor data from the Global Lake Ecological Observatory Network (GLEON). Geoscientific Model Development 12, 1 (2019), 473–523.
  18. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780.
  19. Simultaneous quantification of aquatic ecosystem metabolism and reaeration using a Bayesian statistical model of oxygen dynamics. Limnology and Oceanography 55, 3 (2010), 1047–1063.
  20. Exploring, exploiting and evolving diversity of aquatic ecosystem models: a community perspective. Aquatic Ecology 49 (2015), 513–548.
  21. PCLake+: A process-based ecological model to assess the trophic state of stratified and non-stratified freshwater lakes worldwide. Ecological modelling 396 (2019), 23–32.
  22. Urban point sources of nutrients were the leading cause for the historical spread of hypoxia across European lakes. Proceedings of the National Academy of Sciences 113, 45 (2016), 12655–12660.
  23. Bringing automated, remote-sensed, machine learning methods to monitoring crop landscapes at scale. Agricultural Economics 50 (2019), 41–50.
  24. Field-aware factorization machines for CTR prediction. In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys). 43–50.
  25. Navigation of a fuzzy-controlled wheeled robot through the combination of expert knowledge and data-driven multiobjective evolutionary learning. IEEE Transactions on Cybernetics 52, 8 (2022), 7388–7401.
  26. Autofeature: Searching for feature interactions and their architectures for click-through rate prediction. In ACM International Conference on Information and Knowledge Management (CIKM). 625–634.
  27. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
  28. Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets. Hydrology and Earth System Sciences 23, 12 (2019), 5089–5110.
  29. Long-term change in metabolism phenology in north temperate lakes. Limnology and Oceanography 67, 7 (2022), 1502–1521.
  30. Deep learning. nature 521, 7553 (2015), 436–444.
  31. Estimating the Autotrophic and Heterotrophic Respiration in the US Crop Fields using Knowledge Guided Machine Learning. In AGU Fall Meeting 2021. AGU.
  32. AutoGroup: Automatic feature grouping for modelling explicit high-order feature interactions in CTR prediction. In Proceedings of the 43rd international ACM SIGIR conference on Research and Development in Information Retrieval (SIGIR). 199–208.
  33. Autofis: Automatic feature interaction selection in factorization models for click-through rate prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD). 2636–2645.
  34. DARTS: Differentiable architecture search. In International Conference on Learning Representations.
  35. KGML-ag: a modeling framework of knowledge-guided machine learning to simulate agroecosystems: a case study of estimating N 2 O emission using data from mesocosm experiments. Geoscientific Model Development 15, 7 (2022), 2839–2858.
  36. Autocalibration of a one-dimensional hydrodynamic-ecological model (DYRESM 4.0-CAEDYM 3.1) using a Monte Carlo approach: simulations of hypoxic events in a polymictic lake. Geoscientific Model Development 11, 3 (2018), 903–913.
  37. Long-term dynamics of lakes in the landscape: long-term ecological research on north temperate lakes. Long-Term Ecological Research.
  38. Mikko I Malinen and Pasi Fränti. 2014. Balanced k-means for clustering. In Structural, Syntactic, and Statistical Pattern Recognition: Joint IAPR International Workshop, S+ SSPR 2014, Joensuu, Finland, August 20-22, 2014. Proceedings. Springer, 32–41.
  39. Rammohan Mallipeddi and Ponnuthurai N Suganthan. 2008. Empirical study on the effect of population size on differential evolution algorithm. In 2008 IEEE Congress on Evolutionary Computation. IEEE, 3663–3670.
  40. National-scale remotely sensed lake trophic state from 1984 through 2020. Scientific Data 11, 1 (2024), 77.
  41. An open source QGIS-based workflow for model application and experimentation with aquatic ecosystems. Environmental Modelling & Software 95 (2017), 358–364.
  42. Joseph S Phillips. 2020. Time-varying responses of lake metabolism to light and temperature. Limnology and Oceanography 65, 3 (2020), 652–666.
  43. Exploring the exceptional performance of a deep learning stream temperature model and the value of streamflow data. Environmental Research Letters 16, 2 (2021), 024025.
  44. Stephan Rasp and Nils Thuerey. 2021. Data-driven medium-range weather prediction with a resnet pretrained on climate simulations: A new model for weatherbench. Journal of Advances in Modeling Earth Systems 13, 2 (2021), e2020MS002405.
  45. Process-guided deep learning predictions of lake water temperature. Water Resources Research 55, 11 (2019), 9173–9190.
  46. Steffen Rendle. 2010. Factorization machines. In 2010 IEEE International Conference on Data Mining (ICDM). IEEE, 995–1000.
  47. Depth-integrated estimates of ecosystem metabolism in a high-elevation lake (Emerald Lake, Sierra Nevada, California). Limnology and oceanography 56, 5 (2011), 1764–1780.
  48. Tuomo M Saloranta and Tom Andersen. 2007. MyLake—A multi-year lake simulation model code suitable for uncertainty and sensitivity analysis simulations. Ecological modelling 207, 1 (2007), 45–60.
  49. Ecosystem respiration: drivers of daily variability and background respiration in lakes around the globe. Limnology and Oceanography 58, 3 (2013), 849–866.
  50. Beyond the Plankton Ecology Group (PEG) model: mechanisms driving plankton succession. Annual review of ecology, evolution, and systematics 43 (2012), 429–448.
  51. Towards automated neural interaction discovery for click-through rate prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD). 945–955.
  52. Autoint: Automatic feature interaction learning via self-attentive neural networks. In ACM International Conference on Information and Knowledge Management (CIKM). 1161–1170.
  53. Lake metabolism and the diel oxygen technique: state of the science. Limnology and Oceanography: Methods 8, 11 (2010), 628–644.
  54. The metabolism of aquatic ecosystems: history, applications, and future challenges. Aquatic Sciences 74 (2012), 15–29.
  55. LAKE 2.0: a model for temperature, methane, carbon dioxide and oxygen dynamics in lakes. Geoscientific Model Development 9, 5 (2016), 1977–2006.
  56. Negatively correlated search. IEEE Journal on Selected Areas in Communications 34, 3 (2016), 542–550.
  57. Evolutionary machine learning: A survey. Comput. Surveys 54, 8 (2021), 1–35.
  58. Crop yield prediction using machine learning: A systematic literature review. Computers and Electronics in Agriculture 177 (2020), 105709.
  59. Predicting Water Temperature Dynamics of Unmonitored Lakes With Meta-Transfer Learning. Water Resources Research 57, 7 (2021), e2021WR029579.
  60. P Chris Wilson. 2010. Water Quality Notes: Dissolved Oxygen: SL313/SS525, 1/2010. EDIS 2010, 2 (2010).
  61. Phenological shifts in lake stratification under climate change. Nature communications 12, 1 (2021), 2318.
  62. Lin Xiao. 2009. Dual averaging method for regularized stochastic learning and online optimization. Advances in Neural Information Processing Systems 22 (2009).
  63. A survey on evolutionary computation approaches to feature selection. IEEE Transactions on Evolutionary Computation 20, 4 (2015), 606–626.
  64. Cognitive Evolutionary Search to Select Feature Interactions for Click-Through Rate Prediction. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3151–3161.
  65. Xcrossnet: Feature structure-oriented learning for click-through rate prediction. In Advances in Knowledge Discovery and Data Mining: 25th Pacific-Asia Conference (PAKDD). Springer, 436–447.
  66. Evolutionary learning: Advances in theories and algorithms. Springer.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.