Papers
Topics
Authors
Recent
2000 character limit reached

Harmful algal bloom forecasting. A comparison between stream and batch learning (2402.13304v1)

Published 20 Feb 2024 in cs.LG and cs.AI

Abstract: Diarrhetic Shellfish Poisoning (DSP) is a global health threat arising from shellfish contaminated with toxins produced by dinoflagellates. The condition, with its widespread incidence, high morbidity rate, and persistent shellfish toxicity, poses risks to public health and the shellfish industry. High biomass of toxin-producing algae such as DSP are known as Harmful Algal Blooms (HABs). Monitoring and forecasting systems are crucial for mitigating HABs impact. Predicting harmful algal blooms involves a time-series-based problem with a strong historical seasonal component, however, recent anomalies due to changes in meteorological and oceanographic events have been observed. Stream Learning stands out as one of the most promising approaches for addressing time-series-based problems with concept drifts. However, its efficacy in predicting HABs remains unproven and needs to be tested in comparison with Batch Learning. Historical data availability is a critical point in developing predictive systems. In oceanography, the available data collection can have some constrains and limitations, which has led to exploring new tools to obtain more exhaustive time series. In this study, a machine learning workflow for predicting the number of cells of a toxic dinoflagellate, Dinophysis acuminata, was developed with several key advancements. Seven machine learning algorithms were compared within two learning paradigms. Notably, the output data from CROCO, the ocean hydrodynamic model, was employed as the primary dataset, palliating the limitation of time-continuous historical data. This study highlights the value of models interpretability, fair models comparison methodology, and the incorporation of Stream Learning models. The model DoME, with an average R2 of 0.77 in the 3-day-ahead prediction, emerged as the most effective and interpretable predictor, outperforming the other algorithms.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (56)
  1. Harmful algal blooms and public health. Harmful Algae, 57:2–8, 2016. Harmful Algal Blooms and Public Health.
  2. Climate variability and oceanographic settings associated with interannual variability in the initiation of Dinophysis acuminata blooms. Marine drugs, 11(8):2964–2981, 2013.
  3. The growth season of Dinophysis acuminata in an upwelling system embayment: A conceptual model based on in situ measurements. Deep-Sea Research Part II: Topical Studies in Oceanography, 101:141–151, 2014.
  4. Modelling the hydrodynamic conditions associated with dinophysis blooms in galicia (nw spain). Harmful Algae, 53:40–52, 3 2016.
  5. Lipophilic toxins in galicia (nw spain) between 2014 and 2017: Incidence on the main molluscan species and analysis of the monitoring efficiency. Toxins, 11, 2019.
  6. Fine scale physical-biological interactions during a shift from relaxation to upwelling with a focus on Dinophysis acuminata and its potential ciliate prey. Progress in Oceanography, 175(April):309–327, 2019.
  7. Assessing risks and mitigating impacts of harmful algal blooms on mariculture and marine fisheries. Reviews in Aquaculture, 12(3):1663–1688, 2020.
  8. The niche of a stress-tolerant specialist, Dinophysis acuminata, in a coastal upwelling system. Harmful Algae, 125(November 2022), 2023.
  9. Neural network modelling of coastal algal blooms. Ecological Modelling, 159(2):179–201, 2003.
  10. Dinophysis, a highly specialized mixoplanktonic protist. Frontiers in Protistology, 1:1328026, jan 2024.
  11. Coastal connectivity in the Gulf of Maine in spring and summer of 2004-2009. Deep-Sea Research Part II: Topical Studies in Oceanography, 103:199–209, 2014.
  12. Toward predicting Dinophysis blooms off NW Iberia: A decade of events. Harmful Algae, 53:17–32, 2016.
  13. L. Velo-Suárez and J. C. Gutiérrez-Estrada. Artificial neural network approaches to one-step weekly prediction of Dinophysis acuminata blooms in huelva (western andalucía, spain). Harmful Algae, 6, 2007.
  14. Forecasting harmful algae blooms: Application to Dinophysis acuminata in northern Norway. Harmful Algae, 126(October 2022):102442, 2023.
  15. A review of recent machine learning advances for forecasting harmful algal blooms and shellfish contamination. Journal of Marine Science and Engineering, 9, 2021.
  16. Ensemble models with uncertainty analysis for multi-day ahead forecasting of chlorophyll a concentration in coastal waters. Engineering Applications of Computational Fluid Mechanics, 13(1):91–101, 2019.
  17. A real time data driven algal bloom risk forecast system for mariculture management. Marine Pollution Bulletin, 161:111731, 2020.
  18. A comparative study on predicting algae blooms in douro river, portugal. Ecological Modelling, 212(1):86–91, 2008. Selected papers from the Fifth European Conference on Ecological Modelling, 19-23 September 2005, Pushchino, Russia.
  19. Harmful algal blooms prediction with machine learning models in tolo harbour. In 2014 International Conference on Smart Computing, pages 245–250, Nov 2014.
  20. Advances in forecasting harmful algal blooms using machine learning models: A case study with planktothrix rubescens in lake geneva. Harmful Algae, 99:101906, 2020.
  21. Random forest classification to determine environmental drivers and forecast paralytic shellfish toxins in southeast alaska with high temporal resolution. Harmful Algae, 99:101918, 2020.
  22. A remote sensing and machine learning-based approach to forecast the onset of harmful algal bloom. Remote Sensing, 13(19), 2021.
  23. Hybrid machine learning techniques in the management of harmful algal blooms impact. Computers and Electronics in Agriculture, 211:107988, 2023.
  24. Explainable artificial intelligence: an analytical review. WIREs Data Mining and Knowledge Discovery, 11(5):e1424, 2021.
  25. Predicting coastal harmful algal blooms using integrated data-driven analysis of environmental factors. Science of The Total Environment, 912:169253, 2024.
  26. Harmful algae and climate change on the canadian east coast: Exploring occurrence predictions of Dinophysis acuminata, D. norvegica, and Pseudo-nitzschia seriata. Harmful Algae, 112:102183, 2022.
  27. Online learning: A comprehensive survey. Neurocomputing, 459:249–289, 2021.
  28. Vision-based online learning kinematic control for soft robots using local gaussian process regression. IEEE Robotics and Automation Letters, 4(2):1194–1201, 2019.
  29. Real-time wifi localization of heterogeneous robot teams using an online random forest. Autonomous robots, 39:155–167, 2015.
  30. Dynamically balanced online random forests for interactive scribble-based segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2016: 19th International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II 19, pages 352–360. Springer, 2016.
  31. GEOHAB 2011. GEOHAB Modelling: A workshop Report. Technical report, IOC and SCOR, Paris and Newark, Delaware, 2011.
  32. Coastal and regional ocean community model, December 2022.
  33. Machine learning in management of precautionary closures caused by lipophilic biotoxins. Computers and Electronics in Agriculture, 197:106956, 2022.
  34. Dome: A deterministic technique for equation development and symbolic regression. Expert Systems with Applications, 198:116712, 2022.
  35. Online tree-based ensembles and option trees for regression on evolving data streams. Neurocomputing, 150:458–470, 2015. Special Issue on Information Processing and Machine Learning for Applications of Engineering Solving Complex Machine Learning Problems with Ensemble Methods Visual Analytics using Multidimensional Projections.
  36. The regional oceanic modeling system (ROMS): A split-explicit, free-surface, topography-following-coordinate oceanic model. Ocean Modelling, 9(4):347–404, 2005.
  37. A High-Resolution Modeling Study of the Circulation Patterns at a Coastal Embayment: Ría de Pontevedra (NW Spain) Under Upwelling and Downwelling Conditions. Frontiers in Marine Science, 8, jul 2021.
  38. Andrew Bakun. Coastal Upwelling Indices, West Coast of North America, 1946-71. NOAA Technical Report NMFS SSRF-671, 1973.
  39. Artificial intelligence a modern approach. London, 2010.
  40. Brett Lantz. Machine Learning with R: Second Edition. 2015.
  41. Mining high-speed data streams. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’00, page 71–80, New York, NY, USA, 2000. Association for Computing Machinery.
  42. Learning model trees from evolving data streams. Data mining and knowledge discovery, 23:128–168, 2011.
  43. Adaptive learning from evolving data streams. In Niall M. Adams, Céline Robardet, Arno Siebes, and Jean-François Boulicaut, editors, Advances in Intelligent Data Analysis VIII, pages 249–260, Berlin, Heidelberg, 2009. Springer Berlin Heidelberg.
  44. Improving adaptive bagging methods for evolving data streams. In Zhi-Hua Zhou and Takashi Washio, editors, Advances in Machine Learning, pages 23–37, Berlin, Heidelberg, 2009. Springer Berlin Heidelberg.
  45. Support-vector networks. Machine Learning, 20, 1995.
  46. Halbert White et al. Artificial neural networks. Blackwell Cambridge, Mass., 1992.
  47. Leo Breiman. Random forests. Machine learning, 45:5–32, 2001.
  48. Principal components analysis (pca). Computers & Geosciences, 19(3):303–342, 1993.
  49. Mesoscale dynamics and niche segregation of two Dinophysis species in Galician-Portuguese coastal waters. Toxins, 11(37):1–21, 2019.
  50. Advection and Composition of Dinophysis spp. Populations Along the European Atlantic Shelf. Frontiers in Marine Science, 9(July):1–14, 2022.
  51. An inshore poleward current in the NW of the Iberian Peninsula detected from satellite images, and its relation with G. catenatum and D. acuminata blooms in the Galician Rias. Estuarine, Coastal and Shelf Science, 53(6):787–799, 2001.
  52. Fronts, jets, and counter-flows in the Western Iberian upwelling system. Journal of Marine Systems, 35(1-2):61–77, jun 2002.
  53. Modeling the dynamics of harmful algal bloom events in two bays from the northern Chilean upwelling system. Harmful Algae, 132(January), 2024.
  54. Individual-based modelling of the development and transport of a Karenia mikimotoi bloom on the North-west European continental shelf. Harmful Algae, 53:118–134, 2016.
  55. Assessing the Performance and Application of Operational Lagrangian Transport HAB Forecasting Systems. Frontiers in Marine Science, 9(July):1–25, 2022.
  56. Rapid response to coastal upwelling in a semienclosed bay. Geophysical Research Letters, 44:1–10, 2017.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Paper to Video (Beta)

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 2 tweets with 0 likes about this paper.