Active learning with biased non-response to label requests (2312.08150v2)
Abstract: Active learning can improve the efficiency of training prediction models by identifying the most informative new labels to acquire. However, non-response to label requests can impact active learning's effectiveness in real-world contexts. We conceptualise this degradation by considering the type of non-response present in the data, demonstrating that biased non-response is particularly detrimental to model performance. We argue that biased non-response is likely in contexts where the labelling process, by nature, relies on user interactions. To mitigate the impact of biased non-response, we propose a cost-based correction to the sampling strategy--the Upper Confidence Bound of the Expected Utility (UCB-EU)--that can, plausibly, be applied to any active learning algorithm. Through experiments, we demonstrate that our method successfully reduces the harm from labelling non-response in many settings. However, we also characterise settings where the non-response bias in the annotations remains detrimental under UCB-EU for specific sampling methods and data generating processes. Finally, we evaluate our method on a real-world dataset from an e-commerce platform. We show that UCB-EU yields substantial performance improvements to conversion models that are trained on clicked impressions. Most generally, this research serves to both better conceptualise the interplay between types of non-response and model improvements via active learning, and to provide a practical, easy-to-implement correction that mitigates model degradation.
- Audibert JY, Bubeck S, Munos R (2010) Best arm identification in multi-armed bandits. In: COLT, pp 41–53 Barbieri et al [2016] Barbieri N, Silvestri F, Lalmas M (2016) Improving post-click user engagement on native ads via survival analysis. In: Proceedings of the 25th International Conference on World Wide Web, pp 761–770 Bartók et al [2014] Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Barbieri N, Silvestri F, Lalmas M (2016) Improving post-click user engagement on native ads via survival analysis. In: Proceedings of the 25th International Conference on World Wide Web, pp 761–770 Bartók et al [2014] Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Barbieri N, Silvestri F, Lalmas M (2016) Improving post-click user engagement on native ads via survival analysis. In: Proceedings of the 25th International Conference on World Wide Web, pp 761–770 Bartók et al [2014] Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Bartók G, Foster DP, Pál D, et al (2014) Partial monitoring—classification, regret bounds, and algorithms. Mathematics of Operations Research 39(4):967–997 Carcillo et al [2018] Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Carcillo F, Le Borgne YA, Caelen O, et al (2018) Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics 5:285–300 Cortes et al [2018] Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Cortes C, DeSalvo G, Gentile C, et al (2018) Online learning with abstention. In: International conference on machine learning, pp 1059–1067 Elahi et al [2016] Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Elahi M, Ricci F, Rubens N (2016) A survey of active learning in collaborative filtering recommender systems. Computer Science Review 20:29–50 Fang et al [2012] Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Fang M, Zhu X, Zhang C (2012) Active learning from oracle with knowledge blind spot. In: Twenty-Sixth AAAI Conference on Artificial Intelligence Farquhar et al [2021] Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Farquhar S, Gal Y, Rainforth T (2021) On statistical bias in active learning: How and when to fix it. arXiv preprint arXiv:210111665 Freund et al [1997] Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Freund Y, Seung HS, Shamir E, et al (1997) Selective sampling using the query by committee algorithm. Machine learning 28(2-3):133 Gardner et al [2018] Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Gardner J, Pleiss G, Weinberger KQ, et al (2018) GPyTorch: Blackbox matrix-matrix Gaussian process inference with GPU acceleration. In: Advances in neural information processing systems Hansen and Hurwitz [1946] Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Hansen MH, Hurwitz WN (1946) The problem of non-response in sample surveys. Journal of the American Statistical Association 41(236):517–529 Huang et al [2014] Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Huang SJ, Jin R, Zhou ZH (2014) Active learning by querying informative and representative examples. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(10):1936–1949 King et al [2001] King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- King G, Honaker J, Joseph A, et al (2001) Analyzing incomplete political science data: An alternative algorithm for multiple imputation. American political science review 95(1):49–69 Lall and Robinson [2022] Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Lall R, Robinson T (2022) The midas touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis 30(2):179–196 Lattimore and Szepesvári [2020] Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Lattimore T, Szepesvári C (2020) Bandit algorithms. Cambridge University Press Lewis [1995] Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Lewis DD (1995) A sequential algorithm for training text classifiers: Corrigendum and additional data. In: ACM SIGIR Forum, pp 13–19 Lin et al [2016] Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Lin C, Mausam M, Weld D (2016) Re-active learning: Active learning with relabeling. In: Proceedings of the AAAI Conference on Artificial Intelligence Lin et al [2023] Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Lin X, Chen X, Song L, et al (2023) Tree based progressive regression model for watch-time prediction in short-video recommendation. arXiv preprint arXiv:230603392 Little and Rubin [2019] Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Little RJ, Rubin DB (2019) Statistical analysis with missing data, vol 793. John Wiley & Sons Ma et al [2018] Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Ma X, Zhao L, Huang G, et al (2018) Entire space multi-task model: An effective approach for estimating post-click conversion rate. In: Proceedings of the International ACM SIGIR Conference on Research & Development in Information Retrieval, pp 1137–1140 McCallum et al [1998] McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- McCallum A, Nigam K, et al (1998) Employing EM and pool-based active learning for text classification. In: ICML, pp 350–358 Mohan et al [2013] Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Mohan K, Pearl J, Tian J (2013) Graphical models for inference with missing data Nguyen et al [2022] Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Nguyen CV, Ho LST, Xu H, et al (2022) Bayesian active learning with abstention feedbacks. Neurocomputing 471:242–250 Nguyen et al [2020] Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Nguyen VA, Shi P, Ramakrishnan J, et al (2020) CLARA: confidence of labels and raters. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2542–2552 Rosales et al [2012] Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Rosales R, Cheng H, Manavoglu E (2012) Post-click conversion modeling and analysis for non-guaranteed delivery display advertising. In: Proceedings of the fifth ACM international conference on Web search and data mining, pp 293–302 Rubin [1976] Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Rubin DB (1976) Inference and missing data. Biometrika 63(3):581–592 Settles [2009] Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Settles B (2009) Active learning literature survey technical report. University of Wisconsin-Madison Department of Computer Sciences Seung et al [1992] Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Seung HS, Opper M, Sompolinsky H (1992) Query by committee. In: Proceedings of the fifth annual workshop on Computational learning theory, pp 287–294 Sheng et al [2008] Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Sheng VS, Provost F, Ipeirotis PG (2008) Get another label? improving data quality and data mining using multiple, noisy labelers. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 614–622 Stekhoven and Bühlmann [2012] Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Stekhoven DJ, Bühlmann P (2012) Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28(1):112–118 Tax et al [2021] Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Tax N, de Vries KJ, de Jong M, et al (2021) Machine learning for fraud detection in e-commerce: A research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Springer, pp 30–54 Tianchi [2018] Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Tianchi (2018) Ad display/click data on taobao.com. URL https://tianchi.aliyun.com/dataset/dataDetail?dataId=56 Yan et al [2015] Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Yan S, Chaudhuri K, Javidi T (2015) Active learning from noisy and abstention feedback. In: 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton), pp 1352–1357 Yan et al [2016] Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Yan S, Chaudhuri K, Javidi T (2016) Active learning from imperfect labelers. In: Advances in Neural Information Processing Systems Yang and Loog [2018] Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Yang Y, Loog M (2018) A benchmark and comparison of active learning for logistic regression. Pattern Recognition 83:401–415 Zhao et al [2011] Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733 Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733
- Zhao L, Sukthankar G, Sukthankar R (2011) Incremental relabeling for active learning with noisy crowdsourced annotations. In: 2011 IEEE third international conference on privacy, security, risk and trust and 2011 IEEE third international conference on social computing, IEEE, pp 728–733