Deep Ensembles Meets Quantile Regression: Uncertainty-aware Imputation for Time Series (2312.01294v3)
Abstract: Real-world time series data frequently have significant amounts of missing values, posing challenges for advanced analysis. A common approach to address this issue is imputation, where the primary challenge lies in determining the appropriate values to fill in. While previous deep learning methods have proven effective for time series imputation, they often produce overconfident imputations, which could brings a potentially overlooked risk to the reliability of the intelligent system. Diffusion methods are proficient in estimating probability distributions but face challenges with high missing rates and moreover, computationally expensive due to the nature of the generative model framework. In this paper, we propose Quantile Sub-Ensembles, a novel method to estimate uncertainty with ensemble of quantile-regression-based task networks and then incorporate Quantile Sub-Ensembles into a non-generative time series imputation method. Our method not only produces accurate imputations that is robust to high missing rates, but also is computationally efficient due to the fast training of its non-generative model. We examine the performance of the proposed method on two real-world datasets, the air quality and health-care datasets, and conduct extensive experiments to show that our method outperforms other most of the baseline methods in making deterministic and probabilistic imputations. Compared with the diffusion method, CSDI, our approach can obtain comparable forecasting results which is better when more data is missing, and moreover consumes a much smaller computation overhead, yielding much faster training and test.
- Craig F. Ansley and Robert Kohn. 1984. On the estimation of ARIMA Models with Missing Values. Springer New York (1984).
- Multiple imputation by chained equations: What is it and how does it work? International Journal of Methods in Psychiatric Research 20, 1 (March 2011), 40–49. https://doi.org/10.1002/mpr.329
- Gilberto Batres-Estrada. 2015. Deep Learning for Multivariate Financial Time Series. (2015).
- The Arrow of Time in Multivariate Time Series. In Proceedings of The 33rd International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 48), Maria Florina Balcan and Kilian Q. Weinberger (Eds.). PMLR, New York, New York, USA, 2043–2051. https://proceedings.mlr.press/v48/bauer16.html
- Multi-task Gaussian Process Prediction (Poster ID W20). Nips 20 (2008), 153–160.
- BRITS: Bidirectional Recurrent Imputation for Time Series. In Advances in Neural Information Processing Systems, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Vol. 31. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2018/file/734e6bfcd358e25ac1db0a4241b95651-Paper.pdf
- Recurrent Neural Networks for Multivariate Time Series with Missing Values. Scientific Reports 8, 1 (2018), 6085.
- A probabilistic spatial dengue fever risk assessment by a threshold-based-quantile regression method. PloS one 9, 10 (2014), e106334.
- RDIS: Random Drop Imputation with Self-Training for Incomplete Time Series Data. ArXiv abs/2010.10075 (2020). https://api.semanticscholar.org/CorpusID:224803084
- BD Craven and Sardar MN Islam. 2011. Ordinary least-squares regression. The SAGE dictionary of quantitative management research (2011), 224–228.
- Calibrated Reliable Regression using Maximum Mean Discrepancy. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 17164–17175. https://proceedings.neurips.cc/paper_files/paper/2020/file/c74c4bf0dad9cbae3d80faa054b7d8ca-Paper.pdf
- SAITS: Self-attention-based imputation for time series. Expert Systems with Applications 219 (2023), 119619. https://doi.org/10.1016/j.eswa.2023.119619
- GP-VAE: Deep Probabilistic Time Series Imputation. In Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 108), Silvia Chiappa and Roberto Calandra (Eds.). PMLR, 1651–1661. https://proceedings.mlr.press/v108/fortuin20a.html
- An adaptive deep-learning load forecasting framework by integrating transformer and domain knowledge. Advances in Applied Energy 10 (2023), 100142.
- Denoising Diffusion Probabilistic Models. In Advances in Neural Information Processing Systems, H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin (Eds.), Vol. 33. Curran Associates, Inc., 6840–6851. https://proceedings.neurips.cc/paper_files/paper/2020/file/4c5bcfec8584af0d967f1ab10179ca4b-Paper.pdf
- Nearest neighbor imputation of species-level, plot-scale forest structure attributes from LiDAR data. Remote Sensing of Environment 112, 5 (2008), 2232–2245.
- Pricing determinants in the hotel industry: Quantile regression analysis. International Journal of Hospitality Management 29, 3 (2010), 378–384.
- Changha Hwang and Jooyong Shim. 2005. A simple quantile regression via support vector machine. In Advances in Natural Computation: First International Conference, ICNC 2005, Changsha, China, August 27-29, 2005, Proceedings, Part I 1. Springer, 512–520.
- Graham JW. 2009. Missing data analysis: making it work in the real world. 60 (2009). https://doi.org/10.1146/annurev.psych.58.110405.085530
- Armen Der Kiureghian and Ove Ditlevsen. 2009. Aleatory or epistemic? Does it matter? Structural Safety 31, 2 (2009), 105–112. https://doi.org/10.1016/j.strusafe.2008.06.020 Risk Acceptance and Risk Communication.
- Roger Koenker and Gilbert Bassett. 1978. Regression Quantiles. Econometrica 46, 1 (1978), 33–50. http://www.jstor.org/stable/1913643
- Roger Koenker and Kevin F. Hallock. 2013. Journal of Economic Perspectives—Volume 15, Number 4—Fall 2001—Pages 143–156 Quantile Regression. (2013).
- Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles. In Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.), Vol. 30. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2017/file/9ef2ed4b7fd2c810847ffa5fa85bce38-Paper.pdf
- Joanna Małgorzata Landmesser et al. 2016. Decomposition of differences in income distributions using quantile regression. Statistics in Transition. New Series 17, 2 (2016), 331–349.
- Hunter Kenneth Lange. 2000. Quantile Regression via an MM Algorithm. Journal of Computational & Graphical Statistics 9, 1 (2000), 60–77.
- Le and Quoc. 2006. Nonparametric Quantile Estimation. Journal of Machine Learning Research 7, 2006 (2006), 1231–1264.
- StackVAE-G: An efficient and interpretable model for time series anomaly detection. AI Open 3 (2022), 101–110.
- Faming Liang. 2005. Bayesian neural networks for nonlinear time series forecasting. Statistics and computing 15 (2005), 13–29.
- Zitao Liu and Milos Hauskrecht. [n. d.]. Learning Linear Dynamical Systems from Multivariate Time Series: A Matrix Factorization Based Framework. In Siam International Conference on Data Mining. 810–818.
- James E. Matheson and Robert L. Winkler. 1976. Scoring Rules for Continuous Probability Distributions. Management Science 22, 10 (1976), 1087–1096. http://www.jstor.org/stable/2629907
- Nicolai Meinshausen and Greg Ridgeway. 2006a. Quantile Regression Forests. Journal of Machine Learning Research 7, 2 (2006), 983–999.
- Nicolai Meinshausen and Greg Ridgeway. 2006b. Quantile regression forests. Journal of machine learning research 7, 6 (2006).
- Uncertainty-Aware Variational-Recurrent Imputation Network for Clinical Time Series. IEEE Transactions on Cybernetics 52, 9 (2022), 9684–9694. https://doi.org/10.1109/TCYB.2021.3053599
- High-quality prediction intervals for deep learning: A distribution-free, ensembled approach. In International conference on machine learning. PMLR, 4075–4084.
- Sangeeta Rani and Geeta Sikka. 2012. Article: Recent Techniques of Clustering of Time Series Data: A Survey. International Journal of Computer Applications 52, 15 (August 2012), 1–9. Full text available.
- Conformalized Quantile Regression. In Advances in Neural Information Processing Systems, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2019/file/5103c3584b063c431bd1268e9b5e76fb-Paper.pdf
- M. Schuster and K.K. Paliwal. 1997. Bidirectional recurrent neural networks. IEEE Transactions on Signal Processing 45, 11 (1997), 2673–2681. https://doi.org/10.1109/78.650093
- End-to-End Time Series Imputation via Residual Short Paths. In Proceedings of The 10th Asian Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 95), Jun Zhu and Ichiro Takeuchi (Eds.). PMLR, 248–263. https://proceedings.mlr.press/v95/shen18a.html
- Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. In Advances in Neural Information Processing Systems, C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett (Eds.), Vol. 28. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2015/file/07563a3fe3bbe7e3ba84431ad9d055af-Paper.pdf
- Predicting In-Hospital Mortality of ICU Patients: The PhysioNet/Computing in Cardiology Challenge 2012. computing in cardiology (2013).
- Ingo Steinwart and Andreas Christmann. 2011. Estimating conditional quantiles with the help of the pinball loss. arXiv e-prints (2011).
- Functional variational Bayesian neural networks. In International Conference on Learning Representations.
- GLIMA: Global and Local Time Series Imputation with Multi-directional Attention Learning. In 2020 IEEE International Conference on Big Data (Big Data). 798–807. https://doi.org/10.1109/BigData50022.2020.9378408
- Steffi Pauli Susanti and Fazat Nur Azizah. 2017. Imputation of missing value using dynamic Bayesian network for multivariate time series data. In 2017 International Conference on Data and Software Engineering (ICoDSE). IEEE, 1–5.
- Natasa Tagasovska and David Lopez-Paz. 2018. Frequentist uncertainty estimates for deep learning. (2018).
- A tensor-based method for missing traffic data completion. Transportation Research Part C: Emerging Technologies 28 (2013), 15–27. https://doi.org/10.1016/j.trc.2012.12.007 Euro Transportation: selected paper from the EWGT Meeting, Padova, September 2009.
- CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation. In Advances in Neural Information Processing Systems, M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan (Eds.), Vol. 34. Curran Associates, Inc., 24804–24816. https://proceedings.neurips.cc/paper_files/paper/2021/file/cfe8504bda37b575c70ee1a8276f3486-Paper.pdf
- James W. Taylor. 2000. A quantile regression neural network approach to estimating the conditional density of multiperiod returns. journal of forecasting 19, 4 (2000), 299–311.
- Uncertainty-Aware Deep Ensembles for Reliable and Explainable Predictions of Clinical Time Series. IEEE Journal of Biomedical and Health Informatics 25, 7 (2021), 2435–2444. https://doi.org/10.1109/JBHI.2020.3042637
- ST-MVL: Filling Missing Values in Geo-Sensory Time Series Data. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (New York, New York, USA) (IJCAI’16). AAAI Press, 2704–2710.
- Multi-directional Recurrent Neural Networks : A Novel Method for Estimating Missing Data. https://api.semanticscholar.org/CorpusID:39429412