2000 character limit reached
Causal Explanation for Reinforcement Learning: Quantifying State and Temporal Importance (2210.13507v2)
Published 24 Oct 2022 in cs.AI and cs.LG
Abstract: Explainability plays an increasingly important role in machine learning. Furthermore, humans view the world through a causal lens and thus prefer causal explanations over associational ones. Therefore, in this paper, we develop a causal explanation mechanism that quantifies the causal importance of states on actions and such importance over time. We also demonstrate the advantages of our mechanism over state-of-the-art associational methods in terms of RL policy explanation through a series of simulation studies, including crop irrigation, Blackjack, collision avoidance, and lunar lander.
- Openai gym. arXiv preprint arXiv:1606.01540.
- Bryson, A. E. 1975. Applied optimal control: optimization, estimation and control. Boca Raton: CRC Press.
- Byrne, R. M. 2019. Counterfactuals in Explainable Artificial Intelligence (XAI): Evidence from Human Reasoning. In IJCAI, 6276–6282.
- Neural network attributions: A causal perspective. In International Conference on Machine Learning, 981–990. PMLR.
- Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In 2016 IEEE symposium on security and privacy (SP), 598–617. IEEE.
- A survey of uncertainty in deep neural networks. arXiv preprint arXiv:2107.03342.
- Causal inference in statistics: A primer. Hoboken: John Wiley & Sons.
- Visualizing and understanding atari agents. In International Conference on Machine Learning, 1792–1801. PMLR.
- Explainability in deep reinforcement learning. Knowledge-Based Systems, 214: 106685.
- Hilton, D. 2007. Causal explanation: From social perception to knowledge-based causal attribution.
- Nonlinear causal discovery with additive noise models. Advances in neural information processing systems, 21: 689–696.
- Transparency and explanation in deep reinforcement learning neural networks. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 144–150.
- Bidirectional conditional generative adversarial networks. In Asian Conference on Computer Vision, 216–232. Springer.
- Explainable reinforcement learning via reward decomposition. In IJCAI/ECAI Workshop on Explainable Artificial Intelligence.
- Causal discovery toolbox: Uncover causal relationships in python. arXiv preprint arXiv:1903.02278.
- Discovering causal signals in images. In Proceedings of the IEEE conference on computer vision and pattern recognition, 6979–6987.
- A unified approach to interpreting model predictions. arXiv preprint arXiv:1705.07874.
- Explainable reinforcement learning through a causal lens. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 2493–2500.
- Miller, T. 2019. Explanation in artificial intelligence: Insights from the social sciences. Artificial intelligence, 267: 1–38.
- Towards interpretable reinforcement learning using attention augmented agents. arXiv preprint arXiv:1906.02500.
- Counterfactual state explanations for reinforcement learning agents via generative deep learning. Artificial Intelligence, 295: 103455.
- Pearl, J. 2009. Causality. Causality: Models, Reasoning, and Inference. Cambridge: Cambridge University Press. ISBN 9780521895606.
- Causal discovery with continuous additive noise models.
- Explainable reinforcement learning: A survey. In International cross-domain conference for machine learning and knowledge extraction, 77–95. Springer.
- Explain your move: Understanding agent actions using specific and relevant feature attribution. arXiv preprint arXiv:1912.12191.
- ” Why should i trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 1135–1144.
- Cxplain: Causal explanations for model interpretation under uncertainty. arXiv preprint arXiv:1910.12336.
- A linear non-Gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7(10).
- Deep inside convolutional networks: Visualising image classification models and saliency maps. arXiv preprint arXiv:1312.6034.
- Deep inside convolutional networks: Visualising image classification models and saliency maps.
- Causation, prediction, and search. Cambridge: MIT press.
- Axiomatic attribution for deep networks. In International Conference on Machine Learning, 3319–3328. PMLR.
- Reinforcement learning: An introduction. Cambridge: MIT press.
- Deep reinforcement learning with double q-learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 30.
- Programmatically interpretable reinforcement learning. In International Conference on Machine Learning, 5045–5054. PMLR.
- Explainable ai and reinforcement learning—a systematic review of current approaches and trends. Frontiers in artificial intelligence, 4: 550030.
- The EPIC crop growth model. Transactions of the ASAE, 32(2): 497–0511.
- Causalvae: Disentangled representation learning via neural structural causal models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 9593–9602.
- gcastle: A python toolbox for causal discovery. arXiv preprint arXiv:2111.15155.