Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
149 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Optimizing Heat Alert Issuance with Reinforcement Learning (2312.14196v4)

Published 21 Dec 2023 in cs.LG and stat.AP

Abstract: A key strategy in societal adaptation to climate change is using alert systems to prompt preventative action and reduce the adverse health impacts of extreme heat events. This paper implements and evaluates reinforcement learning (RL) as a tool to optimize the effectiveness of such systems. Our contributions are threefold. First, we introduce a new publicly available RL environment enabling the evaluation of the effectiveness of heat alert policies to reduce heat-related hospitalizations. The rewards model is trained from a comprehensive dataset of historical weather, Medicare health records, and socioeconomic/geographic features. We use scalable Bayesian techniques tailored to the low-signal effects and spatial heterogeneity present in the data. The transition model uses real historical weather patterns enriched by a data augmentation mechanism based on climate region similarity. Second, we use this environment to evaluate standard RL algorithms in the context of heat alert issuance. Our analysis shows that policy constraints are needed to improve RL's initially poor performance. Third, a post-hoc contrastive analysis provides insight into scenarios where our modified heat alert-RL policies yield significant gains/losses over the current National Weather Service alert policy in the United States.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (61)
  1. “PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators.” In Advances in Neural Information Processing Systems, volume 34, 18564–18576. Curran Associates, Inc. (2021).
  2. “Heat Waves in the United States: Mortality Risk during Heat Waves and Effect Modification by Heat Wave Characteristics in 43 U.S. Communities.” Environmental Health Perspectives, 119(2):210–218 (2011).
  3. “Synergistic health effects of air pollution, temperature, and pollen exposure: a systematic review of epidemiological evidence.” Environmental Health, 19:130 (2020).
  4. “Deep Reinforcement Learning: A Brief Survey.” IEEE Signal Processing Magazine, 34(6):26–38 (2017). Conference Name: IEEE Signal Processing Magazine.
  5. “Cause-Specific Risk of Hospital Admission Related to Extreme Heat in Older Adults.” JAMA, 312(24):2659–2667 (2014).
  6. “Budgeted reinforcement learning in continuous state space.” Advances in Neural Information Processing Systems, 32 (2019).
  7. CMS. “2011 CMS Statistics.” Technical report, U.S. Department of Health and Human Services (2011).
  8. “The Influence of Political Ideology and Socioeconomic Vulnerability on Perceived Health Risks of Heat Waves in the Context of Climate Change.” Weather, Climate, and Society, 10(4):731–746 (2018).
  9. “Increased frequency of and population exposure to extreme heat index days in the United States during the 21st century.” Environmental Research Communications, 1(7):075002 (2019).
  10. “Hot weather and heat extremes: health risks.” The Lancet, 398(10301):698–708 (2021).
  11. “Heat Watch/Warning Systems Save Lives: Estimated Costs and Benefits for Philadelphia 1995–98.” Bulletin of the American Meteorological Society, 85(8):1067–1074 (2004).
  12. “Sample-efficient reinforcement learning in the presence of exogenous information.” In Conference on Learning Theory, 5062–5127 (2022).
  13. “Hyperparameters in Contextual RL are Highly Situational.” (2022). ArXiv:2212.10876 [cs].
  14. “Survey of extreme heat public health preparedness plans and response activities in the most populous jurisdictions in the United States.” BMC public health, 23(1):811 (2023).
  15. “Reinforcement learning in the presence of rare events.” In Proceedings of the 25th international conference on Machine learning, 336–343 (2008).
  16. “Bayesian Reinforcement Learning: A Survey.” Foundations and Trends® in Machine Learning, 8(5-6):359–483 (2015). ArXiv:1609.04436 [cs, stat].
  17. “A Survey on Interpretable Reinforcement Learning.” (2022). ArXiv:2112.13112 [cs].
  18. “Global Estimates and Long-Term Trends of Fine Particulate Matter Concentrations (1998–2018).” Environmental Science & Technology, 54(13):7879–7890 (2020). Publisher: American Chemical Society.
  19. “A Survey on Deep Reinforcement Learning Algorithms for Robotic Manipulation.” Sensors (Basel, Switzerland), 23(7):3762 (2023).
  20. “pyro: a framework for hydrodynamics explorations and prototyping.” Journal of Open Source Software, 4(34):1265 (2019).
  21. “Assessment of NOAA National Weather Service Methods to Warn for Extreme Heat Events.” Weather, Climate, and Society, 9(1):5–13 (2017).
  22. “Comparison of health risks by heat wave definition: Applicability of wet-bulb globe temperature for heat wave criteria.” Environmental Research, 168:158–170 (2019).
  23. “Explainability in deep reinforcement learning.” Knowledge-Based Systems, 214:106685 (2021).
  24. “Spatial Analysis of United States National Weather Service Excessive Heat Warnings and Heat Advisories.” Bulletin of the American Meteorological Society, 103(9):E2017–E2031 (2022). Publisher: American Meteorological Society Section: Bulletin of the American Meteorological Society.
  25. “Identifying optimal dosage regimes under safety constraints: An application to long term opioid treatment of chronic pain.” Statistics in Medicine, 37(9):1407–1418 (2018). _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.7566.
  26. “Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems.” arXiv:2005.01643 [cs, stat] (2020). ArXiv: 2005.01643.
  27. “Dynamic Bike Reposition: A Spatio-Temporal Reinforcement Learning Approach.” In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 1724–1733. London United Kingdom: ACM (2018).
  28. “Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity.” Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(1):18:1–18:22 (2020).
  29. “Machine Learning Approaches to Identify Thresholds in a Heat-Health Warning System Context.” Journal of the Royal Statistical Society Series A: Statistics in Society, 184(4):1326–1346 (2021).
  30. “Summer Heat and Mortality in New York City: How Hot Is Too Hot?” Environmental Health Perspectives, 118(1):80–86 (2010).
  31. Microsoft AI for Good Research Lab. “U.S. Broadband Usage Percentages Dataset.” (2021).
  32. MIT Election Data and Science Lab. “County Presidential Election Returns 2000-2020.” (2018).
  33. “Model-based reinforcement learning: A survey.” Foundations and Trends® in Machine Learning, 16(1):1–118 (2023).
  34. “Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning.” (2021). ArXiv:2112.15221 [cs].
  35. “Just-in-Time Adaptive Interventions (JITAIs) in Mobile Health: Key Components and Design Principles for Ongoing Health Behavior Support.” Annals of Behavioral Medicine: A Publication of the Society of Behavioral Medicine, 52(6):446–462 (2017).
  36. “(When) Are Contrastive Explanations of Reinforcement Learning Helpful?” (2022). ArXiv:2211.07719 [cs].
  37. “Sociogeographic Variation in the Effects of Heat and Cold on Daily Mortality in Japan.” Journal of Epidemiology, 24(1):15–24 (2014).
  38. Padakandla, S. “A survey of reinforcement learning algorithms for dynamically varying environments.” ACM Computing Surveys (CSUR), 54(6):1–25 (2021).
  39. “Explainable Reinforcement Learning: A Survey.” (2020). ArXiv:2005.06247 [cs, stat].
  40. “Stable-Baselines3: Reliable Reinforcement Learning Implementations.” Journal of Machine Learning Research, 22(268):1–8 (2021).
  41. “Reward Estimation for Variance Reduction in Deep Reinforcement Learning.” In Conference on Robot Learning, 674–699. PMLR (2018).
  42. “Reinforcement learning algorithms: A brief survey.” Expert Systems with Applications, 231:120495 (2023).
  43. “Hindsight Learning for MDPs with Exogenous Inputs.” (2022). ArXiv:2207.06272 [cs, stat].
  44. “The Evolving Role of Humans in Weather Prediction and Communication.” Bulletin of the American Meteorological Society, 103(8):E1720–E1746 (2022). Publisher: American Meteorological Society Section: Bulletin of the American Meteorological Society.
  45. Reinforcement Learning: An Introduction. The MIT Press, second edition (2018).
  46. “A Comparative Tutorial of Bayesian Sequential Design and Reinforcement Learning.” The American Statistician, 77(2):223–233 (2023).
  47. The Boards of Trustees of the Federal Hospital Insurance and Federal Supplementary Medical Insurance Trust Funds. “2023 Annual Report.” Technical report, Centers for Medicare & Medicaid Services (2023).
  48. “Gymnasium.” (2023).
  49. “A Review of Off-Policy Evaluation in Reinforcement Learning.” (2022). ArXiv:2212.06355 [cs, math, stat].
  50. U.S. Census Bureau. “2009-2013 American Community Survey 5-year County-level Estimates of Population and Median Household Income.” (2014).
  51. U.S. Energy Information Administration. “Climate Zones - DOE Building America Program.” (2020).
  52. “Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences.” (2018). ArXiv:1807.08706 [cs, stat].
  53. “Comment: Variational Autoencoders as Empirical Bayes.” Statistical Science, 34(2):229–233 (2019).
  54. “Heat warnings, mortality, and hospital admissions among older adults in the United States.” Environment International, 157:106834 (2021).
  55. “Effectiveness of National Weather Service heat alerts in preventing mortality in 20 US cities.” Environment International, 116:30–38 (2018).
  56. “Reinforcement Learning Methods in Public Health.” Clinical Therapeutics, 44(1):139–154 (2022).
  57. “Deep Reinforcement Learning With Spatio-Temporal Traffic Forecasting for Data-Driven Base Station Sleep Control.” IEEE/ACM Transactions on Networking, 29(2):935–948 (2021).
  58. “Assessing the causal effects of a stochastic intervention in time series data: are heat alerts effective in preventing deaths and hospitalizations?” Biostatistics (2023).
  59. “Constraints Penalized Q-learning for Safe Offline Reinforcement Learning.” (2022). ArXiv:2107.09003 [cs].
  60. Zajonc, T. “Bayesian Inference for Dynamic Treatment Regimes: Mobility, Equity, and Efficiency in Student Tracking.” Journal of the American Statistical Association, 107(497):80–92 (2012).
  61. “Susceptibility to Mortality in Weather Extremes: Effect Modification by Personal and Small Area Characteristics In a Multi-City Case-Only Analysis.” Epidemiology (Cambridge, Mass.), 24(6):809–819 (2013).

Summary

We haven't generated a summary for this paper yet.