Enhanced Q-Learning Approach to Finite-Time Reachability with Maximum Probability for Probabilistic Boolean Control Networks (2312.06904v1)
Abstract: In this paper, we investigate the problem of controlling probabilistic Boolean control networks (PBCNs) to achieve reachability with maximum probability in the finite time horizon. We address three questions: 1) finding control policies that achieve reachability with maximum probability under fixed, and particularly, varied finite time horizon, 2) leveraging prior knowledge to solve question 1) with faster convergence speed in scenarios where time is a variable framework, and 3) proposing an enhanced Q-learning (QL) method to efficiently address the aforementioned questions for large-scale PBCNs. For question 1), we demonstrate the applicability of QL method on the finite-time reachability problem. For question 2), considering the possibility of varied time frames, we incorporate transfer learning (TL) technique to leverage prior knowledge and enhance convergence speed. For question 3), an enhanced model-free QL approach that improves upon the traditional QL algorithm by introducing memory-efficient modifications to address these issues in large-scale PBCNs effectively. Finally, we apply the proposed method to two examples: a small-scale PBCN and a large-scale PBCN, demonstrating the effectiveness of our approach.
- Stuart A Kauffman. Metabolic stability and epigenesis in randomly constructed genetic nets. Journal of theoretical biology, 22(3):437–467, 1969.
- probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks. Bioinformatics, 18(2):261–274, 2002.
- probabilistic Boolean network analysis of brain connectivity in parkinson’s disease. IEEE Journal of selected topics in signal processing, 2(6):975–985, 2008.
- Multiple fault diagnosis in manufacturing processes and machines using probabilistic Boolean networks. In International Workshop on Soft Computing Models in Industrial and Environmental Applications, pages 355–365. Springer, 2019.
- Construction of probabilistic Boolean network for credit default data. In 2014 Seventh International Joint Conference on Computational Sciences and Optimization, pages 11–15, 2014.
- Inferring user preferences by probabilistic logical reasoning over social networks, 2014.
- Local synchronization of interconnected Boolean networks with stochastic disturbances. IEEE Transactions on Neural Networks and Learning Systems, 31(2):452–463, 2020.
- On controllability and stabilizability of probabilistic Boolean control networks. Science China Information Sciences, 57(1):1–14, 2014.
- Policy iteration approach to the infinite horizon average optimal control of probabilistic Boolean networks. IEEE Transactions on Neural Networks and Learning Systems, 32(7):2910–2924, 2021.
- The outputs robustness of Boolean control networks via pinning control. IEEE Transactions on Control of Network Systems, 7(1):201–209, 2020.
- Causal reasoning on Boolean control networks based on abduction: Theory and application to cancer drug discovery. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 16(5):1574–1585, 2019.
- On reachability and controllability of switched Boolean control networks. Automatica, 48(11):2917–2922, 2012.
- Controllability of probabilistic Boolean control networks based on transition probability matrices. Automatica, 52:340–345, 2015.
- Q-learning. Machine learning, 8:279–292, 1992.
- Martin L Puterman. Markov decision processes: discrete stochastic dynamic programming. John Wiley & Sons, 2014.
- Reinforcement learning approach to feedback stabilization problem of probabilistic Boolean control networks. IEEE Control Systems Letters, 5(1):337–342, 2020.
- Cluster synchronization of Boolean networks under state-flipped control with reinforcement learning. IEEE Transactions on Circuits and Systems II: Express Briefs, 69(12):5044–5048, 2022.
- Edge removal and q𝑞qitalic_q-learning for stabilizability of Boolean networks. IEEE Transactions on Neural Networks and Learning Systems, 2023.
- Optimal finite-horizon control for probabilistic Boolean networks with hard constraints. Lecture Notes in Operations Research 7, 2007.
- Set reachability and observability of probabilistic Boolean networks. Automatica, 106:230–241, 2019.
- Finite-time observability of probabilistic Boolean control networks. Asian Journal of Control, 25(1):325–334, 2023.
- Transfer learning with deep neural networks for model predictive control of hvac and natural ventilation in smart buildings. Journal of Cleaner Production, 254:119866, 2020.
- Transfer learning. In Handbook of research on machine learning applications and trends: algorithms, methods, and techniques, pages 242–264. IGI global, 2010.
- Sampled-data control of probabilistic Boolean control networks: A deep reinforcement learning approach. Information Sciences, 619:374–389, 2023.
- Q-learning based optimal false data injection attack on probabilistic Boolean control networks, 2023.
- Reachability, controllability, and stabilization of Boolean control networks with stochastic function perturbations. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 53(2):1198–1208, 2022.
- Weak reachability of probabilistic Boolean control networks. In 2015 International Conference on Advanced Mechatronic Systems (ICAMechS), pages 56–60, 2015.
- Reinforcement learning: An introduction. MIT press, 2018.
- Convergence of stochastic iterative dynamic programming algorithms. Advances in neural information processing systems, 6, 1993.
- Mathematical concepts and methods in modern biology: using modern discrete models. Academic Press, 2013.
- Efficient verification of observability and reconstructibility for large Boolean control networks with special structures. IEEE Transactions on Automatic Control, 65(12):5144–5158, 2020.