Learning Deep Features in Instrumental Variable Regression (2010.07154v4)
Abstract: Instrumental variable (IV) regression is a standard strategy for learning causal relationships between confounded treatment and outcome variables from observational data by utilizing an instrumental variable, which affects the outcome only through the treatment. In classical IV regression, learning proceeds in two stages: stage 1 performs linear regression from the instrument to the treatment; and stage 2 performs linear regression from the treatment to the outcome, conditioned on the instrument. We propose a novel method, deep feature instrumental variable regression (DFIV), to address the case where relations between instruments, treatments, and outcomes may be nonlinear. In this case, deep neural nets are trained to define informative nonlinear features on the instruments and treatments. We propose an alternating training regime for these features to ensure good end-to-end performance when composing stages 1 and 2, thus obtaining highly flexible feature maps in a computationally efficient manner. DFIV outperforms recent state-of-the-art methods on challenging IV benchmarks, including settings involving high dimensional image data. DFIV also exhibits competitive performance in off-policy policy evaluation for reinforcement learning, which can be understood as an IV regression task.
- TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. URL http://tensorflow.org/.
- J. D. Angrist. Lifetime earnings and the Vietnam era draft lottery: Evidence from social security administrative records. The American Economic Review, 80(3):313–336, 1990.
- Split-sample instrumental variables estimates of the return to schooling. Journal of Business & Economic Statistics, 13(2):225–235, 1995.
- Identification of causal effects using instrumental variables. Journal of the American Statistical Association, 91(434):444–455, 1996.
- Jackknife instrumental variables estimation. Journal of Applied Econometrics, 14(1):57–67, 1999.
- L. Baird. Residual algorithms: Reinforcement learning with function approximation. In Proceedings of the 12th International Conference on Machine Learning, 1995.
- E. Bareinboim and J. Pearl. Causal inference by surrogate experiments: Z-identifiability. In Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence, page 113–120, 2012.
- Neuronlike adaptive elements that can solve difficult learning control problems. IEEE transactions on Systems, Man, and Cybernetics, (5):834–846, 1983.
- Deep generalized method of moments for instrumental variable analysis. In Advances in Neural Information Processing Systems 32, pages 3564–3574. 2019.
- Semi-nonparametric IV estimation of shape-invariant engel curves. Econometrica, 75(6):1613–1669, 2007.
- Measuring the price responsiveness of gasoline demand: Economic shape restrictions and nonparametric demand estimation. Quantitative Economics, 3:29–51, 2012.
- Linear least-squares algorithms for temporal difference learning. Machine Learning, 22(1-3):33–57, 1996.
- Linear inverse problems in structural econometrics estimation based on spectral decomposition and regularization. In Handbook of Econometrics, volume 6B, chapter 77. 2007.
- X. Chen and T. M. Christensen. Optimal sup-norm rates and uniform inference on nonlinear functionals of nonparametric IV regression: Nonlinear functionals of nonparametric IV. Quantitative Economics, 9:39–84, 2018.
- X. Chen and D. Pouzo. Estimation of nonparametric conditional moment models with possibly nonsmooth generalized residuals. Econometrica, 80(1):277–321, 2012.
- Nonparametric instrumental regression. Econometrica, 79(5):1541–1565, 2011.
- Tree-based batch mode reinforcement learning. Journal of Machine Learning Research, 6:503–556, 2005.
- C. Hansen and D. Kozbur. Instrumental variables estimation with many weak instruments using regularized jive. Journal of Econometrics, 182(2):290–308, 2014.
- L. P. Hansen. Large sample properties of generalized method of moments estimators. Econometrica, 50(4):1029–1054, 1982.
- Deep IV: A flexible approach for counterfactual prediction. In International Conference on Machine Learning, 2017.
- Acme: A research framework for distributed reinforcement learning. arXiv preprint arXiv:2006.00979, 2020.
- Batch policy learning under constraints. In International Conference on Machine Learning, 2019.
- Y. LeCun and C. Cortes. MNIST handwritten digit database. 2010. URL http://yann.lecun.com/exdb/mnist/.
- dSprites: Disentanglement testing sprites dataset, 2017. URL https://github.com/deepmind/dsprites-dataset/.
- Spectral normalization for generative adversarial networks. In International Conference on Learning Representations, 2018.
- Human-level control through deep reinforcement learning. Nature, 518(7540):529–533, 2015.
- Foundations of Machine Learning. MIT Press, 2012.
- A. W. Moore. Efficient Memory-Based Learning for Robot Control. PhD thesis, Cambridge University, 1990.
- Dual IV: A single stage instrumental variable regression. In Advances in Neural Information Processing Systems 34, 2020.
- Off-policy policy evaluation for sequential decisions under unobserved confounding. In Advances in Neural Information Processing Systems 34, 2020.
- M. Z. Nashed and G. Wahba. Generalized inverses in reproducing kernel spaces: An approach to regularization of linear operator equations. SIAM Journal on Mathematical Analysis, 5(6):974–987, 1974.
- Instrumental variable estimation of nonparametric models. Econometrica, 71(5):1565–1578, 2003.
- Behaviour suite for reinforcement learning. In International Conference on Learning Representations, 2019.
- Hyperparameter selection for offline reinforcement learning. arXiv preprint arXiv:2007.09055, 2020.
- Pytorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems 32, pages 8024–8035. 2019.
- A. Rahimi and B. Recht. Random features for large-scale kernel machines. In Advances in Neural Information Processing Systems 20, pages 1177–1184. 2008.
- Environment reconstruction with hidden confounders for reinforcement learning based recommendation. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 566–576, 2019.
- Kernel instrumental variable regression. In Advances in Neural Information Processing Systems 32, pages 4593–4605. 2019.
- J. H. Stock and F. Trebbi. Retrospectives: Who invented instrumental variable regression? Journal of Economic Perspectives, 17(3):177–194, 2003.
- Reinforcement Learning: An Introduction. The MIT Press, 2018.
- Empirical study of off-policy policy evaluation for reinforcement learning. arXiv preprint arXiv:1911.06854, 2019.
- Stabilizing generative adversarial networks: A survey. arXiv preprint arXiv:1910.00927, 2019.
- P. Wright. The Tariff on Animal and Vegetable Oils. Investigations in International Commercial Policies. Macmillan Company, 1928.
- Liyuan Xu (16 papers)
- Yutian Chen (51 papers)
- Siddarth Srinivasan (12 papers)
- Nando de Freitas (98 papers)
- Arnaud Doucet (161 papers)
- Arthur Gretton (127 papers)