Learning Constrained Corner Node Trajectories of a Tether Net System for Space Debris Capture (2307.03061v1)
Abstract: The earth's orbit is becoming increasingly crowded with debris that poses significant safety risks to the operation of existing and new spacecraft and satellites. The active tether-net system, which consists of a flexible net with maneuverable corner nodes launched from a small autonomous spacecraft, is a promising solution for capturing and disposing of such space debris. The requirement of autonomous operation and the need to generalize over scenarios with debris scenarios in different rotational rates makes the capture process significantly challenging. The space debris could rotate about multiple axes, which, along with sensing/estimation and actuation uncertainties, calls for a robust, generalizable approach to guiding the net launch and flight - one that can guarantee robust capture. This paper proposes a decentralized actuation system combined with reinforcement learning for planning and controlling this tether-net system. In this new system, four microsatellites with cold gas type thrusters act as the corner nodes of the net and can thus help control or correct the flight of the net after launch. The microsatellites pull the net to complete the task of approaching and capturing the space debris. The proposed method uses a RL framework that integrates a proximal policy optimization to find the optimal solution based on the dynamics simulation of the net and the microsatellites performed in Vortex Studio. The RL framework finds the optimal trajectory that is both fuel-efficient and ensures a desired level of capture quality.
- P. Huang, D. Wang, Z. Meng, F. Zhang, and Z. Liu, “Impact dynamic modeling and adaptive target capturing control for tethered space robots with uncertainties,” IEEE/ASME Transactions on Mechatronics, vol. 21, no. 5, pp. 2260–2271, 2016.
- M. Shan, J. Guo, and E. Gill, “Review and comparison of active space debris capturing and removal methods,” Progress in Aerospace Sciences, vol. 80, pp. 18–32, 2016.
- B. Thomsen and I. Sharf, “Experiments on tether-net capture and net closing mechanism of space debris,” 09 2016.
- K. Wormnes, J. Jong, H. Krag, and G. Visentin, “Throw-nets and tethers for robust space debris capture,” Proceedings of the International Astronautical Congress, IAC, vol. 3, pp. 2260–2272, 01 2013.
- E. M. Botta, “Deployment and capture dynamics of tether-nets for active space debris removal,” Ph.D. dissertation, 11 2017.
- E. M. Botta, I. Sharf, and A. Misra, “Evaluation of net capture of space debris in multiple mission scenarios,” 02 2016.
- E. M. Botta, I. Sharf, and A. Misra., “Energy and momentum analysis of the deployment dynamics of nets in space,” Acta Astronautica, vol. 140, 09 2017.
- E. M. Botta, I. Sharf, and A. Misra, “Simulation of tether-nets for capture of space debris and small asteroids,” Acta Astronautica, vol. 155, pp. 448–461, 02 2019.
- S. Chen, C. T. Woods, A. Boonrath, and E. M. Botta, “Analysis of the robustness and safety of net-based debris capture,” in AIAA SCITECH 2022 Forum, 2022, p. 1001.
- C. Zeng, G. R. Hecht, P. K. Kumar, R. K. Shah, E. M. Botta, and S. Chowdhury, “Learning robust policies for generalized debris capture with an automated tether-net system,” in AIAA SCITECH 2022 Forum. American Institute of Aeronautics and Astronautics, jan 2022. [Online]. Available: https://doi.org/10.2514%2F6.2022-2379
- J. Baxter, A. Tridgell, and L. Weaver, “Knightcap: A chess program that learns by combining td(lambda) with game-tree search,” CoRR, vol. cs.LG/9901002, 1999. [Online]. Available: https://arxiv.org/abs/cs/9901002
- C. Bonnal, J.-M. Ruault, and M.-C. Desjean, “Active debris removal: Recent progress and current trends,” Acta Astronautica, vol. 85, pp. 51–60, 04 2013.
- Z. Meng, P. Huang, and J. Guo, “Approach modeling and control of an autonomous maneuverable space net,” IEEE Transactions on Aerospace and Electronic Systems, vol. 53, no. 6, pp. 2651–2661, 2017.
- A. Dounis and C. Caraiscos, “Caraiscos, c.: Advanced control systems engineering for energy and comfort management in a building environment - a review. renwable and sustainable energy reviews 13, 1246-1261,” Renewable and Sustainable Energy Reviews, vol. 13, pp. 1246–1261, 08 2009.
- S. Risi and J. Togelius, “Neuroevolution in games: State of the art and open challenges,” IEEE Transactions on Computational Intelligence and AI in Games, vol. 9, no. 1, pp. 25–41, 2017.
- R. Caruana and A. Niculescu-Mizil, “An empirical comparison of supervised learning algorithms,” Proceedings of the 23rd international conference on Machine learning - ICML ’06, vol. 2006, pp. 161–168, 06 2006.
- J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” CoRR, vol. abs/1707.06347, 2017. [Online]. Available: http://arxiv.org/abs/1707.06347
- A. Behjat, S. Chidambaran, and S. Chowdhury, “Adaptive genomic evolution of neural network topologies (agent) for state-to-action mapping in autonomous agents,” in 2019 International Conference on Robotics and Automation (ICRA), 2019, pp. 9638–9644.
- A. Hill, A. Raffin, M. Ernestus, A. Gleave, A. Kanervisto, R. Traore, P. Dhariwal, C. Hesse, O. Klimov, A. Nichol, M. Plappert, A. Radford, J. Schulman, S. Sidor, and Y. Wu, “Stable baselines,” https://github.com/hill-a/stable-baselines, 2018.
- Y. Zhao, P. Huang, and F. Zhang, “Capture dynamics and net closing control for tethered space net robot,” Journal of Guidance, Control, and Dynamics, vol. 42, pp. 1–10, 09 2018.
- R. K. Shah, C. Zeng, E. M. Botta, and S. Chowdhury, “Launch and closure optimization under uncertainties for a tether-net space debris capture system,” AIAA, vol. 2021-3103, 07 2021.
- N. Ravichandra and E. M. Botta, “Output space mapping for net-based debris capture,” in AIAA Scitech 2020 Forum, 2020, p. 0717.
- C. M. Barnes and E. M. Botta, “A quality index for net-based capture of space debris,” Acta Astronautica, vol. 176, pp. 455–463, 2020. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0094576520304100
- D. R. Jones, M. Schonlau, and W. J. Welch, “Efficient global optimization of expensive black-box functions,” Journal of Global Optimization, vol. 13, no. 4, pp. 455–492, 1998.
- J. Mockus, “Application of bayesian approach to numerical methods of global and stochastic optimization,” Journal of Global Optimization, vol. 4, no. 4, pp. 347–365, 1994.
- C. Nieto-Peroy and M. R. Emami, “Cubesat mission: From design to operation,” Applied Sciences, vol. 9, no. 15, p. 3110, 2019.
- R. BELLMAN, “A markovian decision process,” Journal of Mathematics and Mechanics, vol. 6, no. 5, pp. 679–684, 1957. [Online]. Available: http://www.jstor.org/stable/24900506
- J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv preprint arXiv:1707.06347, 2017.
- A. Raffin, A. Hill, M. Ernestus, A. Gleave, A. Kanervisto, R. Traore, P. Dhariwal, C. Hesse, O. Klimov, A. Nichol, M. Plappert, A. Radford, J. Schulman, S. Sidor, and Y. Wu, “Stable baselines3,” https://github.com/DLR-RM/stable-baselines3, 2020.
- J. Schulman, S. Levine, P. Abbeel, M. Jordan, and P. Moritz, “Trust region policy optimization,” in International Conference on Machine Learning, 2015, pp. 1889–1897.
- D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature, vol. 323, no. 6088, pp. 533–536, 1986.