2000 character limit reached
Reinforcement learning (2405.10369v1)
Published 16 May 2024 in astro-ph.IM, cs.AI, and cs.LG
Abstract: Observing celestial objects and advancing our scientific knowledge about them involves tedious planning, scheduling, data collection and data post-processing. Many of these operational aspects of astronomy are guided and executed by expert astronomers. Reinforcement learning is a mechanism where we (as humans and astronomers) can teach agents of artificial intelligence to perform some of these tedious tasks. In this paper, we will present a state of the art overview of reinforcement learning and how it can benefit astronomy.
- TensorFlow: A system for large-scale machine learning. arXiv e-prints , arXiv:1605.086951605.08695.
- A New Look at the Statistical Model Identification. IEEE Transactions on Automatic Control 19, 716–723.
- Dynamic Programming and Optimal Control. Number v. 1 in Athena Scientific optimization and computation series, Athena Scientific.
- Dynamic Programming and Optimal Control: Volume II; Approximate Dynamic Programming. Athena Scientific optimization and computation series, Athena Scientific.
- Neuro-Dynamic Programming. Athena Scientific. 1st edition.
- Chapter 3 - the cross-entropy method for optimization, in: Rao, C., Govindaraju, V. (Eds.), Handbook of Statistics. Elsevier. volume 31 of Handbook of Statistics, pp. 35–59.
- Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine Learning 3, 1–122.
- OpenAI Gym. arXiv e-prints , arXiv:1606.015401606.01540.
- Deep reinforcement learning in a handful of trials using probabilistic dynamics models. Advances in neural information processing systems 31.
- Model-Augmented Actor-Critic: Backpropagating through Paths. arXiv e-prints , arXiv:2005.080682005.08068.
- Model-Based Reinforcement Learning via Meta-Policy Optimization. arXiv e-prints , arXiv:1809.052141809.05214.
- Discovering faster matrix multiplication algorithms with reinforcement learning. Nature 610, 47–53.
- Addressing Function Approximation Error in Actor-Critic Methods. arXiv e-prints , arXiv:1802.094771802.09477.
- Combining ADMM and the augmented Lagrangian method for efficiently handling many constraints, in: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19, International Joint Conferences on Artificial Intelligence Organization. pp. 4525–4531.
- Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. arXiv e-prints , arXiv:1801.012901801.01290.
- Soft Actor-Critic Algorithms and Applications. arXiv e-prints , arXiv:1812.059051812.05905.
- Robust statistics: the approach based on influence functions. New York USA:Wiley. ID: unige:23238.
- Deep reinforcement learning that matters, in: Proceedings of the AAAI conference on artificial intelligence.
- Learning to utilize shaping rewards: A new approach of reward shaping. Advances in Neural Information Processing Systems 33, 15931–15941.
- Accelerating Quadratic Optimization with Reinforcement Learning. arXiv e-prints , arXiv:2107.108472107.10847.
- When to trust your model: Model-based policy optimization. Advances in Neural Information Processing Systems 32.
- Observation strategy optimization for distributed telescope arrays with deep reinforcement learning. The Astronomical Journal 165, 233.
- A simulation framework for telescope array and its application in distributed reinforcement learning-based scheduling of telescope arrays. Astronomy and Computing , 100732.
- Optimal control of wide field small aperture telescope arrays with reinforcement learning, in: Observatory Operations: Strategies, Processes, and Systems IX, SPIE. pp. 170–177.
- Adam: A Method for Stochastic Optimization. ArXiv e-prints 1412.6980.
- Auto-Encoding Variational Bayes. arXiv e-prints , arXiv:1312.61141312.6114.
- Understanding black-box predictions via influence functions, in: Precup, D., Teh, Y.W. (Eds.), Proceedings of the 34th International Conference on Machine Learning, PMLR, International Convention Centre, Sydney, Australia. pp. 1885–1894.
- Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles. arXiv e-prints , arXiv:1612.014741612.01474.
- Self-optimizing adaptive optics control with reinforcement learning for high-contrast imaging. Journal of Astronomical Telescopes, Instruments, and Systems 7, 039002–039002.
- Deep learning. Nature 521, 436 EP –.
- End-to-End Training of Deep Visuomotor Policies. arXiv e-prints , arXiv:1504.007021504.00702.
- Continuous control with deep reinforcement learning. arXiv e-prints , arXiv:1509.029711509.02971.
- Faster sorting algorithms discovered using deep reinforcement learning. Nature 618, 257–263.
- Human-level control through deep reinforcement learning. Nature 518, 529–533.
- Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning. arXiv e-prints , arXiv:1708.025961708.02596.
- LOFAR Self-Calibration using a Local Sky Model, in: Gabriel, C., Arviset, C., Ponz, D., Enrique, S. (Eds.), Astronomical Data Analysis Software and Systems XV, p. 291.
- Adaptive optics control using model-based reinforcement learning. Opt. Express 29, 15327–15344.
- Toward on-sky adaptive optics control using reinforcement learning-model-based policy optimization for adaptive optics. Astronomy & Astrophysics 664, A71.
- Automatic differentiation in PyTorch, in: NIPS-W.
- PyTorch: An Imperative Style, High-Performance Deep Learning Library. arXiv e-prints , arXiv:1912.017031912.01703.
- Intelligent reflecting surface-assisted interference mitigation with deep reinforcement learning for radio astronomy. IEEE Antennas and Wireless Propagation Letters 21, 1757–1761.
- MBRL-Lib: A Modular Library for Model-based Reinforcement Learning. arXiv e-prints , arXiv:2104.101592104.10159.
- Stable-baselines3: Reliable reinforcement learning implementations. Journal of Machine Learning Research 22, 1–8.
- Prioritized Experience Replay. arXiv e-prints , arXiv:1511.059521511.05952.
- Mastering the game of go with deep neural networks and tree search. Nature 529, 484–489.
- Reinforcement Learning: An Introduction. A Bradford Book, Cambridge, MA, USA.
- Algorithms for Reinforcement Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers.
- Scaling up average reward reinforcement learning by approximating the domain models and the value function, in: ICML, Citeseer. pp. 471–479.
- Gymnasium.
- On the theory of the brownian motion. Phys. Rev. 36, 823–841.
- Double q-learning. Advances in neural information processing systems 23.
- Deep reinforcement learning with double q-learning, in: Proceedings of the AAAI conference on artificial intelligence.
- Attention Is All You Need. arXiv e-prints , arXiv:1706.037621706.03762.
- Benchmarking Model-Based Reinforcement Learning. arXiv e-prints , arXiv:1907.020571907.02057.
- Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. arXiv e-prints , arXiv:2112.105042112.10504.
- Q-learning. Machine learning 8, 279–292.
- Statistical performance of radio interferometric calibration. Monthly Notices of the Royal Astronomical Society 486, 5646–5655.
- Hint assisted reinforcement learning: an application in radio astronomy. arXiv preprint arXiv:2301.03933 .
- Deep reinforcement learning for smart calibration of radio telescopes. Monthly Notices of the Royal Astronomical Society 505, 2141–2150.
- Sarod Yatawatta (36 papers)